InfoBlueprint is a powerful data profiling tool that helps you analyze and understand your datasets through an interactive web interface. Built with Streamlit, it provides comprehensive insights into your data structure, quality, and patterns.
- Interactive data exploration
- Automated data profiling and statistics
- Pattern recognition
- Visualization capabilities
- Support for multiple data sources
- Python 3.8 or higher
- uv package manager
curl -LsSf https://astral.sh/uv/install.sh | sh
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
# Clone the repository
git clone https://github.com/yourusername/infoblueprint.git
cd infoblueprint
# Initialize virtual environment and install dependencies
uv init
uv add -r requirements.txt
source .venv/bin/activate
.venv\Scripts\activate
streamlit run 01_connector.py
- Open your web browser and navigate to
http://localhost:8501
- Upload your dataset through the web interface
- Configure profiling parameters
- Explore the generated insights and visualizations
-
Installation Fails
- Ensure Python version compatibility
- Check system requirements
- Verify internet connection
-
Application Won't Start
- Confirm virtual environment is activated
- Check port 8501 availability
- Verify dependencies installation
-
Memory Issues
- Increase available RAM
- Reduce dataset size