Imported files are not indexed

cancel
Showing results for 
Search instead for 
Did you mean: 
ludoADV
Active Member II

Re: Imported files are not indexed

I confirm I used community-docker-compose.yml

 

See logs: https://pastebin.com/1jyzkAxZ 

ludoADV
Active Member II

Re: Imported files are not indexed

Also, where is stored status.xml ?

cristinamr
Advanced

Re: Imported files are not indexed


@ludoADV wrote:

I confirm I used community-docker-compose.yml


This is not an Alfresco version. In your log I can see it:

alfresco_1            | 2021-08-06 07:43:20,185  INFO  [service.descriptor.DescriptorService] [main] Alfresco Content Services started (Community). Current version: 7.0.0 (rdf924796-blocal) schema 14,002. Originally installed version: 7.0.0 (rdf924796-blocal) schema 14,002.

You are using Current version: 7.0.0 (rdf924796-blocal) schema 14,002.

In your log, I cannot see any error related to Solr (is it working today the indexation?) but, this morning you have a failure with your database:

alfresco_1            | 2021-08-06 07:41:10,226  ERROR [repo.transaction.RetryingTransactionHelper] [http-nio-8080-exec-8] Rollback failure.  Normal retry behaviour will resume.
alfresco_1            | org.springframework.transaction.TransactionSystemException: JDBC rollback failed; nested exception is org.postgresql.util.PSQLException: This connection has been closed.

Then I can see your alfresco restarted, I can see the following error:

alfresco_1            | 2021-08-06 09:05:51,333  ERROR [extensions.webscripts.AbstractRuntime] [http-nio-8080-exec-8] Exception from executeScript: 07060004 Script url /bulkfsimport/initiate does not support the method GET
alfresco_1            | org.springframework.extensions.webscripts.WebScriptException: 07060004 Script url /bulkfsimport/initiate does not support the method GET

But this issue is caused by bulk import tool, nothing to see with the original error.

The problem  I can see is that you have provided me a solr log from yesterday with a concrete error and today you have provided me an Alfresco log in a diferent day with nothing related to the original error (logs are getting from different days and time!).

When  you want to see a concrete error the best option is stop your alfresco, start it, reproduce the problem and paste Alfresco and Solr logs during this preiod of time. So if you are having troubles, it's provably the technical erro will appear on these lines. Could you please do this?

 

Also, where is stored status.xml ?

What is it related? Bulk import tool?

 

Cheers,

Cristina.

--
VenziaIT: helping companies since 2005! Our ECM products: AQuA & Seidoc
ludoADV
Active Member II

Re: Imported files are not indexed

cristinamr
Advanced

Re: Imported files are not indexed

Nothing related to Solr but you have several errors related to Bulk Import:

[36malfresco_1            | 2021-08-06 09:41:47,319  ERROR [extensions.webscripts.AbstractRuntime] [http-nio-8080-exec-4] Exception from executeScript: 07060000 Script url /bulkfsimport/initiate does not support the method GET
alfresco_1            | org.springframework.extensions.webscripts.WebScriptException: 07060000 Script url /bulkfsimport/initiate does not support the method GET

Your issue is related with those documents you are trying to import and is getting this error.

Which steps are you performing in order to use Bulk Import?

--
VenziaIT: helping companies since 2005! Our ECM products: AQuA & Seidoc
ludoADV
Active Member II

Re: Imported files are not indexed

I followed official doc using http://dms:8080/alfresco/service/bulkfsimport

 

Docs have been correctly imported, I can see and open them. But cannot search pattern in file content.

cristinamr
Advanced

Re: Imported files are not indexed

If you are using the console, there must be something wrong with the steps your are performing because you have several errors related to bulk import (and most provably that's why those documents are not indexed). Could you please try with curl command? An example.


Docs have been correctly imported, I can see and open them. But cannot search pattern in file content.


Maybe you have a problem with status.xml file.  BTW, http://dms:8080/alfresco/service/bulk/import/filesystem/status.xml  there you have the status.xml file (or similar to that path). If you want an indexation of the properties you'll need to prepare the xml in order to be injected with those documents.

--
VenziaIT: helping companies since 2005! Our ECM products: AQuA & Seidoc
ludoADV
Active Member II

Re: Imported files are not indexed

root@dms:/opt/reports# curl -v -u admin:admin -X POST "http://dms:8080/alfresco/service/bilkfsimport/initiate" --url "http://dms:8080/alfresco/s/bulkfsimport/initiate" --data "targetPath=/Sites/reports/documentLibrary/&sourceDirectory=/opt/reports/CERTs/Avatars-copycats_Felix-Aime_Ronan-Mouchoux.pdf&existingFileMode=REPLACE"

Note: Unnecessary use of -X or --request, POST is already inferred.

*   Trying 127.0.1.1:8080...

* TCP_NODELAY set

* Connected to dms (127.0.1.1) port 8080 (#0)

* Server auth using Basic with user 'admin'

> POST /alfresco/service/bilkfsimport/initiate HTTP/1.1

> Host: dms:8080

> Authorization: Basic YWRtaW46YWRtaW4=

> User-Agent: curl/7.68.0

> Accept: */*

> Content-Length: 149

> Content-Type: application/x-www-form-urlencoded

> 

* upload completely sent off: 149 out of 149 bytes

* Mark bundle as not supporting multiuse

< HTTP/1.1 404 

< Server: nginx/1.18.0

< Date: Fri, 06 Aug 2021 12:25:25 GMT

< Content-Type: text/html;charset=UTF-8

< Transfer-Encoding: chunked

< Connection: keep-alive

< X-Frame-Options: SAMEORIGIN

< Cache-Control: no-cache

< Expires: Thu, 01 Jan 1970 00:00:00 GMT

< Pragma: no-cache

< 

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">

<html xmlns="http://www.w3.org/1999/xhtml">

   <head>

      <title>Web Script Status 404 - Not Found</title>

      <link rel="stylesheet" href="/alfresco/css/webscripts.css" type="text/css" />

   </head>

   <body>

      <div>

         <table>

            <tr>

               <td><img src="/alfresco/images/logo/AlfrescoLogo32.png" alt="Alfresco" /></td>

               <td><span class="title">Web Script Status 404 - Not Found</span></td>

            </tr>

         </table>

         <br/>

         <table>

            <tr><td>The Web Script <a href="%2Falfresco%2Fservice%2Fbilkfsimport%2Finitiate">/alfresco/service/bilkfsimport/initiate</a> has responded with a status of 404 - Not Found.</td></tr>

         </table>

         <br/>

         <table>

            <tr><td><b>404 Description:</b></td><td> Requested resource is not available.</td></tr>

            <tr><td>&nbsp;</td></tr>

            <tr><td><b>Message:</b></td><td>07060004 Script url /bilkfsimport/initiate does not map to a Web Script.</td></tr>

            <tr><td><b>Server</b>:</td><td>Community v7.0.0 (rdf924796-blocal) schema 14,002</td></tr>

            <tr><td><b>Time</b>:</td><td>Aug 6, 2021, 12:25:25 PM</td></tr>

            <tr><td></td><td>&nbsp;</td></tr>

         </table>

      </div>

   </body>

</html>

 

* Connection #0 to host dms left intact

Note: Unnecessary use of -X or --request, POST is already inferred.

* Found bundle for host dms: 0x5626c5c97c70 [serially]

* Can not multiplex, even if we wanted to!

* Re-using existing connection! (#0) with host dms

* Connected to dms (127.0.1.1) port 8080 (#0)

* Server auth using Basic with user 'admin'

> POST /alfresco/s/bulkfsimport/initiate HTTP/1.1

> Host: dms:8080

> Authorization: Basic YWRtaW46YWRtaW4=

> User-Agent: curl/7.68.0

> Accept: */*

> Content-Length: 149

> Content-Type: application/x-www-form-urlencoded

> 

* upload completely sent off: 149 out of 149 bytes

* Mark bundle as not supporting multiuse

< HTTP/1.1 302 

< Server: nginx/1.18.0

< Date: Fri, 06 Aug 2021 12:25:26 GMT

< Content-Type: text/html;charset=UTF-8

< Transfer-Encoding: chunked

< Connection: keep-alive

< Cache-Control: no-cache

< Expires: Thu, 01 Jan 1970 00:00:00 GMT

< Pragma: no-cache

< Location: /alfresco/s/bulkfsimport/status

< 

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">

<html xmlns="http://www.w3.org/1999/xhtml">

   <head>

      <title>Web Script Status 302 - Moved Temporarily</title>

      <link rel="stylesheet" href="/alfresco/css/webscripts.css" type="text/css" />

   </head>

   <body>

      <div>

         <table>

            <tr>

               <td><img src="/alfresco/images/logo/AlfrescoLogo32.png" alt="Alfresco" /></td>

               <td><span class="title">Web Script Status 302 - Moved Temporarily</span></td>

            </tr>

         </table>

         <br/>

         <table>

            <tr><td>The Web Script <a href="%2Falfresco%2Fs%2Fbulkfsimport%2Finitiate">/alfresco/s/bulkfsimport/initiate</a> has responded with a status of 302 - Moved Temporarily.</td></tr>

         </table>

         <br/>

         <table>

            <tr><td><b>302 Description:</b></td><td> The resource has temporarily moved to another location, but that future references should still use the original URI to access the resource.</td></tr>

            <tr><td>&nbsp;</td></tr>

            <tr><td><b>Message:</b></td><td></td></tr>

            <tr><td><b>Server</b>:</td><td>Community v7.0.0 (rdf924796-blocal) schema 14,002</td></tr>

            <tr><td><b>Time</b>:</td><td>Aug 6, 2021, 12:25:26 PM</td></tr>

            <tr><td></td><td>&nbsp;</td></tr>

            <tr><td><b>Diagnostics</b>:</td><td><a href="/alfresco/s/script/org/alfresco/repository/bulkfilesystemimport/initiate.post">Inspect Web Script (org/alfresco/repository/bulkfilesystemimport/initiate.post)</a></td></tr>

         </table>

      </div>

   </body>

</html>

 

* Connection #0 to host dms left intact

ludoADV
Active Member II

Re: Imported files are not indexed

 About status.xml:

http://dms:8080/alfresco/service/bulk/import/filesystem/status.xml returns error 404 - File not found

abhinavmishra14
Advanced

Re: Imported files are not indexed


@ludoADV wrote:

Can you give me instructions to get log file(s) you desire ? (sorry, I'm newbie on this solution).


In additon try enabling these logs as debug and then share the logs as @cristinamr mentioned there is nothing specific to solr. The below logs would intercept all solr httpcalls and trackers 

Log settings on alfresco layer:

#Solr query and httpclient logs
log4j.logger.org.alfresco.solr.query.AbstractQParser=DEBUG
log4j.logger.org.alfresco.repo.search.impl.solr.SolrQueryHTTPClient=DEBUG

Logs settings on search service layer:

#Indexing tracker logs
log4j.logger.org.alfresco.solr.tracker.AclTracker=DEBUG
log4j.logger.org.alfresco.solr.tracker.ContentTracker=DEBUG
log4j.logger.org.alfresco.solr.tracker.MetadataTracker=DEBUG
log4j.logger.org.alfresco.solr.tracker.ModelTracker=DEBUG

The above logs may give some insights. 

~Abhinav
(ACSCE, AWS SAA, Azure Admin)