This tool is designed to perform topic modeling and other various text analysis on textual data using R for core analysis and a Python-based user interface (UI) built with the Tkinter library. It is particularly effective with long textual responses and provides visual aids through word clouds for shorter surveys.
python setup_env.py
venv\Scripts\activate
python topic_modeling_app.py
- First, run the following script: setup_env.py
- Second, activate your virtual environment with the following command: venv\Scripts\activate
- Third, run the last script: topic_modeling_app.py
Are you running to any issues regarding Execution Policy? You can temporarily bypass the restriction for the current PowerShell session by running the following command in your terminal:
Set-ExecutionPolicy -Scope Process -ExecutionPolicy Bypass
Afterwards, try activating the virtual environment again:
venv\Scripts\activate
- Topic Modeling: Ideal for analyzing extensive text data.
- Word Cloud: Visualizes the most frequent terms in datasets, best suited for shorter surveys.
- More to follow
Before using this tool, please ensure the following steps are completed to set up your environment:
- R and Python: Ensure both R and Python are installed on your computer. Download them from their official websites if necessary.
- Library Installation:
- R Libraries: Open your R console, navigate to the directory containing
requirements.R
, and executesource('requirements.R')
. - Python Libraries: Open a command prompt or terminal, navigate to the directory containing
requirements.txt
, and executepip install -r requirements.txt
.
- R Libraries: Open your R console, navigate to the directory containing
- Verify that the TreeTagger tool is correctly installed and its path is appropriately set in both R scripts for text processing. Search for the term "teamIR" in the scripts to identify and update these paths.
- Open
topic_modeling_app.py
in your Python IDE (like IDLE or PyCharm) using the file browser.
- The UI is designed to be user-friendly:
- Use the Word Cloud option for shorter surveys to visualize key terms.
- Use the Topic Modeling option for detailed analysis of more complex text data.
- Once the analysis is complete, the tool automatically saves the results in an Excel file in the same directory as the script.
If you encounter any issues or need further assistance, please feel free to contact [email protected]