Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

coordinator bypass recovery SAI-4705 #164

Open
wants to merge 2 commits into
base: fs/branch_9_3
Choose a base branch
from

Conversation

noblepaul
Copy link
Collaborator

@@ -328,6 +329,13 @@ public final void run() {
}

public final void doRecovery(SolrCore core) throws Exception {
if (core.getCoreDescriptor()
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Adding, in our previous queryaggregator implementation we have avoided all these checks by creating proxy core (by overriding few functions in SolrCore class). That avoided exposing collection to zookeeper. As exposing collection further creates issue (if app is ZK, or some solr cluster api which returns that collection).

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These checks are not required. These are just optimisations.

@hiteshk25
Copy link
Collaborator

@noblepaul i'm merging this into fs/branch_9_3. Can you open another pr to merge same in fs/branch_9x

Copy link
Collaborator

@hiteshk25 hiteshk25 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@hiteshk25
Copy link
Collaborator

Coordinator test is failing

135 actionable tasks: 133 executed, 2 up-to-date
  - org.apache.solr.search.TestCoordinatorRole.testMultiCollectionMultiNode (:solr:core)
    Test output: /home/runner/work/fullstory-solr/fullstory-solr/solr/core/build/test-results/test/outputs/OUTPUT-org.apache.solr.search.TestCoordinatorRole.txt
    Reproduce with: gradlew :solr:core:test --tests "org.apache.solr.search.TestCoordinatorRole.testMultiCollectionMultiNode" -Ptests.jvms=1 "-Ptests.jvmargs=-XX:TieredStopAtLevel=1 -XX:+UseParallelGC -XX:ActiveProcessorCount=1 -XX:ReservedCodeCacheSize=120m" -Ptests.seed=CB55CCE4A0420D83 -Ptests.file.encoding=US-ASCII

@hiteshk25
Copy link
Collaborator

  2> 3510517 ERROR (recoveryExecutor-51805-thread-1-processing-127.0.0.1:46337_solr .sys.COORDINATOR-COLL-conf_shard1_replica_n3 .sys.COORDINATOR-COLL-conf shard1 core_node4) [n:127.0.0.1:46337_solr c:.sys.COORDINATOR-COLL-conf s:shard1 r:core_node4 x:.sys.COORDINATOR-COLL-conf_shard1_replica_n3] o.a.s.c.RecoveryStrategy 
  2>           => java.lang.NullPointerException
  2> 	at org.apache.solr.cloud.ZkController.publish(ZkController.java:1719)
  2> java.lang.NullPointerException: null
  2> 	at org.apache.solr.cloud.ZkController.publish(ZkController.java:1719) ~[main/:9.3.0-SNAPSHOT abc0655cce152ac6a4fda1b8632bb90667209294 [snapshot build, details omitted]]
  2> 	at org.apache.solr.cloud.ZkController.publish(ZkController.java:1708) ~[main/:9.3.0-SNAPSHOT abc0655cce152ac6a4fda1b8632bb90667209294 [snapshot build, details omitted]]
  2> 	at org.apache.solr.cloud.RecoveryStrategy.doRecovery(RecoveryStrategy.java:336) ~[main/:9.3.0-SNAPSHOT abc0655cce152ac6a4fda1b8632bb90667209294 [snapshot build, details omitted]]
  2> 	at org.apache.solr.cloud.RecoveryStrategy.run(RecoveryStrategy.java:312) [main/:9.3.0-SNAPSHOT abc0655cce152ac6a4fda1b8632bb90667209294 [snapshot build, details omitted]]
  2> 	at com.codahale.metrics.InstrumentedExecutorService$InstrumentedRunnable.run(InstrumentedExecutorService.java:212) [metrics-core-4.2.19.jar:4.2.19]
  2> 	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) [?:?]
  2> 	at java.util.concurrent.FutureTask.run(FutureTask.java:264) [?:?]
  2> 	at org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:289) [solr-solrj-9.3.0-SNAPSHOT.jar:9.3.0-SNAPSHOT abc0655cce152ac6a4fda1b8632bb90667209294 [snapshot build, details omitted]]
  2> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?]
  2> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?]
  2> 	at java.lang.Thread.run(Thread.java:829) [?:?]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants