Research smarter, discover faster.
The researchify is designed to streamline and expedite the process of finding research papers. Leveraging the powerful combination of Streamlit and Xata, the application provides users with a user-friendly and efficient platform for accessing academic literature.
-
Efficient Search: Quickly locate relevant research papers based on keywords.
-
User-Friendly Interface: Enjoy an intuitive and visually appealing interface, ensuring a seamless user experience.
-
Xata Database Integration: Benefit from the reliability of the Xata database for efficient data storage, retrieval, and organization.
-
Ask AI: With the conversational interface, ask the question related to the database. Xata's AI has you covered!
-
Quick Access to Full Papers: Efficiently access the full texts of research papers for in-depth exploration.
Follow these steps to get started with researchify locally:
-
Importing the data: Download the dataset, create a database in Xata and import the CSV file.
-
Clone the Repository:
git clone https://github.com/dotAadarsh/researchify.git -
Install Dependencies:
pip install -r requirements.txt -
Add your Xata credentials: Get your API Key and Database URL from Xata and add it to the .streamlit/secrets.toml file
-
Run the App:
streamlit run app.py
Arxiv.org AI Research Papers Dataset ↗
This dataset is a valuable resource for researchers and practitioners in the field of AI. It can be used to track the latest research trends, identify emerging research areas, and find relevant papers. This dataset contains the metadata for 10,000 research papers in the field of artificial intelligence (AI) that were published on arXiv.org.
License: Attribution 4.0 International (CC BY 4.0)
If you are new to Xata I'd suggest you read Getting started with Xata | Aadarsh (aadarshkannan.tech). Sign up at Xata - The serverless database built for modern development and start creating a database and make a note of Xata API and Database URL which we will need later on to connect with our database through Python SDK.
Once you create a database you will see import CSV data. Click on that and upload the downloaded dataset. Xata can guess column types from your data automatically, and if needed to change you can choose the dropdown and select the desired type. Once the import is completed, you will be able to see the table.
As you insert data into Xata, it is automatically indexed for full-text search. Try exploring the Search Engine feature which you can find in the left side pane. Try changing various parameters and see what happens. Click on Get the code snippet, select the language Python and copy the code. We will be using this snippet to build the search feature through our app.
Now go to the playground and try out different operations you want to perform through programming. Xata's ask endpoint utilizes current data from your Xata database and leverages Xata's search functionalities to retrieve pertinent information from our database. It then utilizes the OpenAI's ChatGPT API to understand your query and produce natural language responses. Run the following example in the playground and see the result.
result = xata.data().ask("Table_name", "Question")Here is what the response might looks like:
The Researchify app is built upon Streamlit and Xata's Python SDK. It has two pages, one is for search and other one is for chatbot.






