Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to successfully run pyhs2 code beyond a certain set of rows and columns #66

Open
MyPythonGitHub opened this issue Jan 2, 2017 · 0 comments

Comments

@MyPythonGitHub
Copy link

Hi,
Wish you a very Happy and Prosperous New Year !

     I am trying to execute a query in Hive using pyhs2 library. My objective is to capture the results of the hive query into a Pandas dataframe. I am experiencing an error in my pyhs2 code beyond a limit (a specific set of rows and columns).

      In my Hive query, I am fetching 145 columns and around 4.5 million rows. My code is failing in the following circumstances:

a) When I fetch more than 52 rows and all the 145 columns, the code fails

b) If I reduced the column to 4 then , I can fetch upto a million rows but beyond that it's failing again.

        Will you please help me to resolve the error with a possible solution. Thanks in advance !

Cheers

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant