I am using Alfresco Community 5.2.0. I have created a folder inside repository and uploaded around 5,35000 files. But whenever I am trying to access the folder using UI, Alfresco is getting very slow and becoming almost unassailable. I have noticed there are some queries are running in the Postgres for infinite time whenever I am performing this activity.
Alfresco recommendations for best performance include a 3,000 nodes limit on every level. If you have a folder including 5 Million files, then I recommend you to create subfolders to obtain a hierarchy where no level has more than 3,000 elements.
Thanks a lot. But I am trying to understand how this will improve the performance. Is there anything which will improve the query performance in Postgres DB level. Can you please provide me some detail in this?
It doesn't make any sense to try to optimize PostgreSQL when you are grossly abusing the limits recommended by the software vendor. It would be like cramming four elephants into a compact car and then asking how to tune the engine to get better gas mileage.
"How this will improve the performance" = Better distribution of data so that a simple "list contents" operation will not need to perform a full table scan and process all 5 million files. Since permission checking is done after loading from the database, the database load itself cannot effectively use offset / pagination to reduce the amount of data loaded, since individual elements may not be accessible to the current user - so a simple list operation typically ends up being a "load all in this folder" operation on the DB layer, with the performance corresponding to the amount of data in that folder (and relative to the overall distribution).