Skip to content

Training models on images of cancer cells to recognize sick and healthy samples. Model training

License

Notifications You must be signed in to change notification settings

OlenaTokova/BreastCancer_project

Repository files navigation

BreastCancer_project

The project in the provided Jupyter notebook is focused on the development and application of a deep learning model for semantic segmentation of breast cancer imagery. The goal is to analyze medical images to identify and segment cancerous tissues effectively, which is crucial for diagnosis and treatment planning.

Main Goal

The main goal is to leverage advanced machine learning techniques, specifically deep learning, to improve the accuracy and efficiency of breast cancer diagnosis through automated image segmentation.

How It Works

Environment Setup: The notebook begins by setting up the necessary environment, including installing TensorFlow and other libraries, and preparing the directory structure for data management. Data Processing: Data is sourced from a public dataset, transferred from Google Drive to the local environment, and then unzipped for processing.

  • Model Training and Loading: A model is trained, saved, and later loaded for making predictions.
  • Prediction: The loaded model is used to predict test data, specifically segmenting breast cancer imagery into relevant classifications based on learned patterns.

Data Processing

The data processing involves downloading a dataset from Google Drive, unzipping it, and preparing it for input into the neural network. The images are pre-processed to a uniform size and normalized before being fed into the model.

Methods and Mathematical Approaches Used

Deep Learning: Utilizing a convolutional neural network (CNN) architecture to perform image segmentation. Batch Processing: Images are processed in batches to optimize memory usage and computational efficiency. Normalization: Image pixel values are normalized to a range between 0 and 1 to facilitate neural network training. Visualization and Image Processing Images are processed using TensorFlow's load_img and img_to_array functions, which handle loading and converting images to a suitable format for model input. Results are visualized using matplotlib to display original images alongside their predicted segmentation masks.

Future Use and Utility

The results from this project can be utilized in clinical settings to assist radiologists and oncologists by providing more accurate diagnostic tools, thus potentially improving patient outcomes. The automated system could reduce the time required for image analysis and increase the precision of tumor localization and segmentation.

Main Conclusions

The project demonstrates the application of deep learning to medical image analysis, particularly in the challenging area of breast cancer detection and segmentation.

Future Improvements

Error Handling: More robust error handling throughout the data loading and processing steps could prevent common issues like file not found or corrupted data errors. Model Performance: Including more details on the training process, such as validation loss and accuracy, would help in understanding model performance better. Scalability: Implementing more dynamic batch processing and memory management techniques could enhance the scalability of the application. User Interface: Developing a more sophisticated user interface could make the tool more accessible to non-technical medical professionals. Overall, the project provides a strong foundation for leveraging deep learning in medical imaging, with room for enhancements to increase its robustness and usability in clinical environments.

About

Training models on images of cancer cells to recognize sick and healthy samples. Model training

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published