Skip to content

bloubi/datafari

 
 

Repository files navigation

-------------------------------------------------- DATAFARI V. 2.0 --------------------------------------------------

Datafari is the perfect product for anyone who needs to search within its corporate big data,
based on the most advanced open source technologies.
Datafari combines both the Apache ManifoldCF and Solr products, and proposes to its users to search into file shares,
cloud shares (dropbox, google drive), databases, but also emails and many more sources. 
Available as community and enterprise edition, Datafari is different from the competition : 
- Its open source license is not aggressive, as it uses the Apache v2 license: you are free to do whatever you want with it,
you just need to mention that you are using it. 
- It combines two renowned Apache projects, namely ManifoldCF and Solr, which gives Datafari a long term vision.

Pre-Requirements:

- Debian Environment 64 bits (a Docker image is available if you are on Windows environment)
- Processor : 1GHZ and RAM : 2GB
- Ports 8080 and 5432 are opened
- Debian environment : requires curl, jq

How to install and  start Datafari :

You can build the Debian installer with the ant script Datafari/debian7/build.xml. You can download Debian installer and Docker image from www.datafari.com.

1) Install Datafari :
# dpkg -i datafari.deb
2) Start Datafari : 
# cd /opt/datafari/bin
# bash start-datafari.sh
3) Stop Datafari :
# cd /opt/datafari/bin
# bash stop-datafari.sh

- Search UI :
http://localhost:8080/Datafari/

- Admin UI :
http://localhost:8080/Datafari/admin

You can find video tutorials on how to install and start Datafari from the installer :
- Debian : https://www.youtube.com/watch?v=cekFICeTTTs


If you want to use the jcifs connector in ManifoldCF, download  jcifs-1.3.xx.jar from http://jcifs.samba.org/src/ to DATAFARI_SOURCE_DIR\mcf\mcf_home\connector-lib-proprietary
Then edit the file Datafari/mcf/mcf_home/connectors.xml and uncomment the line :
 <!--repositoryconnector name="Windows shares" class="org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector"/-->
 And restart Datafari

You have to configure your Repository connector and job to add documents to Datafari.
You can find a video tutorial on how to index local file share here :
https://www.youtube.com/watch?v=w0FtsvZO9SI
You can find documentation on how to create connectors and jobs here : 
http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html


Enjoy :-)

Packages

No packages published

Languages

  • Roff 22.3%
  • Java 18.8%
  • JavaScript 17.3%
  • HTML 15.1%
  • C 7.1%
  • C++ 6.1%
  • Other 13.3%