TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data LakesAamod KhatiwadaHarsha Kokelet al.2025ICDE 2025
TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data LakesAamod KhatiwadaHarsha Kokelet al.2024NeurIPS 2024
DataRinse: Semantic Transforms for Data preparation based on Code MiningIbrahim AbdelazizJulian Dolbyet al.2023VLDB 2023
SemFORMS: Automatic Generation of Semantic Transforms By Mining Data Science CodeIbrahim AbdelazizJulian Dolbyet al.2023IJCAI 2023
A Scalable AutoML Approach Based on Graph Neural NetworksMossad HelaliEssam Mansouret al.2022VLDB 2022
Automatically Debugging AutoML Pipelines Using Maro: ML Automated Remediation OracleJulian DolbyJason Tsayet al.2022MAPS 2022
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding TasksRuchir PuriDavid Kunget al.2021NeurIPS 2021