-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Indexing to a TRA (Time Routed Alias) is not supported? #63
Comments
Can you share what you see for errors or output on either the Hive or Solr side? I don't know of any reason why TRA wouldn't be supported, but I do know it's never been tested and there could be some hidden limitation there. |
file.log BTW, currently I'm using the first collection created by TRA as the target of my external table. This way I'm able to index my data, and part of my documents are distributed to their right collections, but not all of them. That's another problem, actually. I reported this second problem on Solr users' mailing list. |
Hey John, thanks for the logs and thanks for bringing this to our attention. I'm gonna take a look at reproducing and will post here if/when I find something. In the meantime, do you have the commands you used to setup your TRA collection group? Any other details you could provide would also help me check my work, so to speak. |
Hi. The commands used are:
Thank you for your time. |
Hey John, quick update on my progress: I'm not able to reproduce the particular error you're seeing. Here's the steps I'm trying:
Am I doing something different in my reproduction here that you're not doing on your end? If so, let me know and maybe we can understand what's going on. While I didn't find the particular error you mentioned, I did see a few other things that might be bugs. Specifically:
|
Curious, I tried hive-solr out with a standard alias and I see the same behavior. If I set up an alias 'A' to point at two collections (C1 and C2) which both have data, querying an external table linked with 'A' only returns the data from C2. I suspect this a deficiency in how we turn a collection name into map-red splits. At first glance the code that does this here doesn't appear to be alias-aware. Still testing though. EDIT: found a bug in our querying logic and proposed a fix here. I'm still not able to reproduce the issue you reported though John, so if you've got any more specific instructions for reproducing, please let me know. |
Hi Jason, thank you for your attention. HDP version: 2.6.1.0 Here are my steps and output:
Excerpt of output related to the error:
|
Hey John, no problem. I think you're right about the versions being relevant here. Thanks for mentioning that, I should have asked earlier. hive-solr 3.0.0 is aimed at Solr 6.x. It was tested primarily against Solr 6.6.2, and bundles Solr 6.6.2 internally for making requests. Since Solr itself doesn't guarantee client/server compatibility across major releases, we can't guarantee that hive-solr 3.0.0 will work with Solr 7.5 in all situations. Especially for features like TRAs which were introduced in 7x hive-solr 4.0.0 is our release targeting Solr 7.x, and I expect you'll see the correct behavior if you use that instead. If you can, give that a shot and let me know what happens. (I'll give 3.0.0 a shot as well, to double-check that this is version-specific behavior we're seeing.) |
I've been using hive-solr to index my data to Solr collections very well. Last week I wanted to try Time Routed Alias feature of Solr, which behaved as expected when I manually indexed a few test documents. I prepared some data to index into a TRA but hive-solr didn't seem to see the alias. It behaved like I was indexing into a non-existent collection. Does hive-solr support the TRA feature at all?
The text was updated successfully, but these errors were encountered: