Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hive-testbench doesn't generate database #31

Open
vivian5-git opened this issue Dec 2, 2020 · 9 comments
Open

hive-testbench doesn't generate database #31

vivian5-git opened this issue Dec 2, 2020 · 9 comments

Comments

@vivian5-git
Copy link

Hello,

I'm using hive-testbench-hdp3 on HDP3.1.4. A bunch of files were generated in target HDFS directory after running tpcds-setup.sh. However no database was generated. Any advice to address the issue?

My steps are like:

  1. run tpcds-build.sh

  2. run 'FORMAT=parquet ./tpcds-setup.sh 10 /benchmarks/tpcds'. Note it reported an error as below

TPC-DS text data generation complete.
Loading text data into external tables.
make: *** [date_dim] Error 1
Loading constraints
Data loaded into database tpcds_bin_partitioned_parquet_10.

  1. check databses in hive - there is no new databse generated.
@amitagarwal06
Copy link
Contributor

Most likely beeline command needs to be modified in line no 73 of tpcds-setup.sh as per your setup. Add DEBUG_SCRIPT=ON to get verbose logging, that should help you to resolve your issue.

@vivian5-git
Copy link
Author

vivian5-git commented Dec 3, 2020

Thanks for the advice.
I ran the commands in tpcds-setup.sh manually. It seems the issue is about "make -j 1 -f $LOAD_FILE". This line returns error as below. Do you know what the problem is about?

No implicit rule found for all'. Considering target file date_dim'.
File date_dim' does not exist. Finished prerequisites of target file date_dim'.
Must remake target date_dim'. Invoking recipe from load_parquet_2.mk:3 to update target date_dim'.
Putting child 0x24cc530 (date_dim) PID 21140 on the chain.
Live child 0x24cc530 (date_dim) PID 21140
Reaping losing child 0x24cc530 PID 21140
make: *** [date_dim] Error 1
Removing child 0x24cc530 PID 21140 from chain.

@vivian5-git
Copy link
Author

Finally I got all commands run successfully manually. Thanks.

@ssainz
Copy link

ssainz commented Mar 9, 2021

Hi @vivian5-git - how did you resolve the issue? I am facing similar issue.

@ergouy
Copy link

ergouy commented Mar 24, 2021

Finally I got all commands run successfully manually. Thanks.

Where did you modify it

@nuclearwu
Copy link

how did you resolve the issue? I am facing similar issue.

@lyzGithub
Copy link

@huanghuazhang
Copy link

The process of loading is text file table->orc table. The default table type (hive.default.fileformat) of my hive environment is orc, then the text file table ddl may become orc type by mistake, and then the same error will occur. This problem can be solved by changing the default type setting in /settings/load-flat.sql:
Set hive.default.fileformat=TextFile

@bazingagain
Copy link

you can find the error message in hive server2 log.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants