Collection of Python scripts and utilities - Dominique S. Loyer
A collection of Python scripts and utilities developed for various research and development projects, including:
- Information Retrieval configurations for TREC/Pyserini
- NLP preprocessing and analysis tools
- Data processing utilities
- ML/AI helper functions
# Clone the repository
git clone https://github.com/DominiqueLoyer/my_code_python.git
cd my_code_python
# Install dependencies
pip install numpy pandas nltk tqdm
# Use the scripts
python mycode.py| File | Description |
|---|---|
mycode.py |
Main collection of Python utilities (~2MB) |
LICENSE |
Apache License 2.0 |
README.md |
This documentation |
- Java 21 + Pyserini configuration
- TREC topic parsing
- BM25 and TF-IDF implementations
- Query expansion techniques
- NLTK resource management
- Text preprocessing pipelines
- Tokenization and stemming
- Configuration scripts for Colab/Kaggle
- Batch processing helpers
- Result formatting tools
@software{loyer2025mycodepython,
author = {Loyer, Dominique S.},
title = {My Code Python: Research Utilities Collection},
year = {2025},
publisher = {Zenodo},
doi = {10.5281/zenodo.17917855},
url = {https://github.com/DominiqueLoyer/my_code_python}
}Dominique S. Loyer
- ORCID: 0009-0003-9713-7109
- GitHub: @DominiqueLoyer
- Affiliation: Université du Québec à Montréal (UQAM)
Apache License 2.0 - See LICENSE for details.
Code provided as-is for research and educational purposes. Use at your own risk!
Last Updated: January 2026