-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Text Extracting #1
Comments
The Manifold Alfresco connector could invoke NodeContentGet (with http or https, both are available) during the manifold processDocument; this would imply:
|
But NodeContentGet is protected by solr-specific authentication mechanism (certificat). Is there another way to call this webscript in HTTP / without certificat ? |
You can run without SSL - https://wiki.alfresco.com/wiki/Alfresco_And_SOLR#Running_Without_SSL |
When SSL is disabled, Solr webscript are accessible without any authentication. Not sure it's good idea, and you need to protect another way these webscripts. I think exposing this webscript with the standard authentication mechanism can solve theses problem. |
The all-in-one archetype is configured to use http (nossl) for Alfresco-Solr comms (in both directions) |
the maven SDK disable SSL during development phase, not in production environment ... |
True, but it shows how you need to patch the Alfresco web.xml in order to disable SSL |
Hi,
ManifoldCF use extract update handler to handle binary content. Binary content is sent to solr, and tikka try to extract text content and some metadata (mime type).
For alfresco connector, Alfresco should be used to convert binary to text as official solr do (by calling NodeContentGet). Because alfresco already know how to convert document to text.
But NodeContentGet webscript is protected by Certificat, you have to clone this webscript.
(original issue - philipmeadows/alfresco-webscript-manifold-connector#21 by @alexist )
The text was updated successfully, but these errors were encountered: