|
- Online:
- Address:
-
1101 Kitchawan Road
IBM T.J. Watson Research Center
Yorktown Heights, NY 10598
|
Navigation: News, Bio, Papers, Awards, Patents, Teaching, Misc, Copyright.
News
Bio
Martin Hirzel is a researcher and the manager of the AI
Programming Models team at IBM Research AI. Martin received his PhD
from the University of Colorado at Boulder in 2004; his thesis adviser
was Amer Diwan. At IBM, Martin works on tools and languages for
artificial intelligence and streaming systems. Martin's papers won
awards at several conferences and he is an ACM
Distinguished Scientist.
Papers
See also: DBLP, Google Scholar.
-
TDD-Bench Verified: Can LLMs Generate Tests for Issues Before They Get Resolved?
(pdf,
arXiv,
open-source code),
Toufique Ahmed, Martin Hirzel, Rangeet Pan, Avraham Shinnar, and Saurabh Sinha.
arXiv:2412.02883 [cs.SE], December 2024.
-
Transforming the Hybrid Cloud for Emerging AI Workloads
(pdf,
arXiv),
Deming Chen, Alaa Youssef, Ruchi Pendse, André Schleife, Bryan K. Clark, Hendrik Hamann, Jingrui He, Teodoro Laino, Lav Varshney, Yuxiong Wang, Avirup Sil, Reyhaneh Jabbarvand, Tianyin Xu, Volodymyr Kindratenko, Carlos Costa, Sarita Adve, Charith Mendis, Minjia Zhang, Santiago Núñez-Corrales, Raghu Ganti, Mudhakar Srivatsa, Nam Sung Kim, Josep Torrellas, Jian Huang, Seetharami Seelam, Klara Nahrstedt, Tarek Abdelzaher, Tamar Eilam, Huimin Zhao, Matteo Manica, Ravishankar Iyer, Martin Hirzel, Vikram Adve, Darko Marinov, Hubertus Franke, Hanghang Tong, Elizabeth Ainsworth, Han Zhao, Deepak Vasisht, Minh Do, Fabio Oliveira, Giovanni Pacifici, Ruchir Puri, and Priya Nagpurkar.
arXiv:2411.13239 [cs.DC], November 2024.
-
PDL: A Declarative Prompt Programming Language
(pdf,
arXiv,
open-source code),
Mandana Vaziri, Louis Mandel, Claudio Spiess, and Martin Hirzel.
arXiv:2410.19135 [cs.AI], October 2024.
-
Training and Cross-Validating Machine Learning Pipelines with Limited Memory
(pdf,
@AutoML,
video,
open-source code),
Martin Hirzel, Kiran Kate, Louis Mandel, and Avraham Shinnar.
Conference on Automated Machine Learning (AutoML),
September 2024.
-
PoTo: A Hybrid Andersen's Points-to Analysis for Python
(pdf,
arXiv),
Ingkarat Rak-amnouykit, Ana Milanova, Guillaume Baudart, Martin Hirzel, and Julian Dolby.
arXiv:2409.03918 [cs.PL], September 2024.
-
Principled and Practical Static Analysis for Python: Weakest Precondition Inference of Hyperparameter Constraints
(pdf,
doi),
Ingkarat Rak-amnouykit, Ana Milanova, Guillaume Baudart, Martin Hirzel, and Julian Dolby.
Software: Practice and Experience (SP&E),
54(3), pages 363-393, March 2024.
-
AI for Low-Code for AI
(pdf,
doi,
abstract),
Nikitha Rao, Jason Tsay, Kiran Kate, Vincent J. Hellendoorn,
and Martin Hirzel.
Conference on Intelligent User Interfaces (IUI), pages 837-852, March 2024.
- Supersedes earlier version:
AI for Low-Code for AI
(pdf,
arXiv,
artifact),
Nikitha Rao, Jason Tsay, Kiran Kate, Vincent J. Hellendoorn,
and Martin Hirzel.
arXiv:2305.20015 [cs.SE], May 2023.
-
Learning Transfers over Several Programming Languages
(pdf,
arXiv),
Razan Baltaji, Saurabh Pujar, Louis Mandel, Martin Hirzel, Luca Buratti, and Lav Varshney.
arXiv:2310.16937 [cs.CL], October 2023.
-
Low-Code Programming Models
(pdf,
doi,
abstract,
video),
Martin Hirzel.
Communications of the ACM (CACM),
66(10), pages 76-85, October 2023.
-
Searching for Fairer Machine Learning Ensembles
(pdf,
PMLR,
open-source code),
Michael Feffer, Martin Hirzel, Samuel C. Hoffman, Kiran Kate,
Parikshit Ram, and Avraham Shinnar.
Conference on Automated Machine Learning (AutoML),
September 2023.
-
Oversampling to Repair Bias and Imbalance Simultaneously
(pdf,
OpenReview,
talk,
open-source code),
Martin Hirzel and Parikshit Ram.
AutoML Workshop Track,
September 2023.
-
Out-of-Order Sliding-Window Aggregation with Efficient Bulk Evictions and Insertions
(pdf,
doi,
talk,
open-source code),
Kanat Tangwongsan, Martin Hirzel, and Scott Schneider.
Conference on Very Large Data Bases (VLDB),
pages 3227-3239, August 2023.
- See also extended version:
Out-of-Order Sliding-Window Aggregation with Efficient Bulk Evictions and Insertions (Extended Version)
(pdf,
arXiv),
Kanat Tangwongsan, Martin Hirzel, and Scott Schneider.
arXiv:2307.11210 [cs.DB], July 2023.
-
A Suite of Fairness Datasets for Tabular Classification
(pdf,
arXiv,
open-source code),
Martin Hirzel and Michael Feffer.
arXiv:2308.00133 [cs.LG], July 2023.
-
Extracting Enhanced Artifical Intelligence Model Metadata from Software Repositories
(pdf,
doi),
Jason Tsay, Alan Braz, Martin Hirzel, Avraham Shinnar, and Todd Mummert.
Journal on Empirical Software Engineering (EMSE),
27, article 176, September 2022.
-
Gradual AutoML using Lale
(pdf,
doi,
open-source code),
Kiran Kate, Martin Hirzel, Parikshit Ram, Avraham Shinnar, and Jason Tsay.
Tutorial at the
Conference on Knowledge Discovery and Data Mining (KDD-Tutorial),
pages 4794-4795, August 2022.
-
An Empirical Study of Modular Bias Mitigators and Ensembles
(pdf,
open-source code,
talk),
Michael Feffer, Martin Hirzel, Samuel C. Hoffman, Kiran Kate,
Parikshit Ram, and Avraham Shinnar.
Workshop on Benchmarking Data for Data-Centric AI (DataPerf@ICML),
July 2022.
-
The Raise of Machine Learning Hyperparameter Constraints in Python Code
(pdf,
doi,
abstract,
artifact),
Ingkarat Rak-amnouykit, Ana Milanova, Guillaume Baudart, Martin Hirzel, and Julian Dolby.
International Symposium on Software Testing and Analysis (ISSTA),
pages 580-592, July 2022.
-
Automatically Debugging AutoML Pipelines Using Maro: ML Automated Remediation Oracle
(pdf,
doi,
abstract,
open-source code),
Julian Dolby, Jason Tsay, and Martin Hirzel.
Symposium on Machine Programming (MAPS),
pages 60-69, June 2022.
- See also extended version:
Automatically Debugging AutoML Pipelines Using Maro: ML Automated Remediation Oracle (Extended Version)
(pdf,
arXiv),
Julian Dolby, Jason Tsay, and Martin Hirzel.
arXiv:2205.01311 [cs.SE], May 2022.
-
Complex Python Features in the Wild
(pdf,
doi,
abstract),
Yi Yang, Ana Milanova, and Martin Hirzel.
Conference on Mining Software Repositories (MSR),
pages 282-293, May 2022.
-
Comments on Comments: Where Code Review and Documentation Meet
(pdf,
doi,
abstract,
arXiv),
Nikitha Rao, Jason Tsay, Martin Hirzel, and Vincent Hellendoorn.
Short paper at Conference on Mining Software Repositories (MSR),
pages 18-22, May 2022.
-
Sliding-Window Aggregation Algorithms
(pdf,
doi),
Kanat Tangwongsan, Martin Hirzel, and Scott Schneider.
Encyclopedia of Big Data Technologies, March 2022.
- Supersedes earlier version:
Sliding-Window Aggregation Algorithms
(pdf,
doi),
Kanat Tangwongsan, Martin Hirzel, and Scott Schneider.
Encyclopedia of Big Data Technologies, February 2018.
-
Pipeline Combinators for Gradual AutoML
(pdf,
@NeurIPS,
open-source code),
Guillaume Baudart, Martin Hirzel, Kiran Kate, Parikshit Ram,
Avraham Shinnar, and Jason Tsay.
Conference on Neural Information Processing Systems (NeurIPS),
pages 19705-19718, December 2021.
-
RASL: Relational Algebra in Scikit-Learn Pipelines
(pdf,
event,
OpenReview,
open-source code),
Chirag Sahni, Kiran Kate, Avraham Shinnar, Hoang Thanh Lam, and Martin Hirzel.
Workshop on Databases and AI (DBAI@NeurIPS),
December 2021.
-
Learning GraphQL Query Cost
(pdf,
doi,
abstract,
dataset),
Georgios Mavroudeas, Guillaume Baudart, Alan Cha, Martin Hirzel,
Jim A. Laredo, Malik Magdon-Ismail, Louis Mandel, and Erik Wittern.
Automated Software Engineering (ASE) Industry Track,
pages 1146-1150, November 2021.
- See also extended version:
Learning GraphQL Query Costs (Extended Version)
(pdf,
arXiv),
Georgios Mavroudeas, Guillaume Baudart, Alan Cha, Martin Hirzel,
Jim A. Laredo, Malik Magdon-Ismail, Louis Mandel, and Erik Wittern.
arXiv:2108.11139 [cs.SE], August 2021.
-
Towards Automating Code Review at Scale
(pdf,
doi,
abstract),
Vincent Hellendoorn, Jason Tsay, Manisha Mukherjee, and Martin Hirzel.
Symposium on the Foundations of Software Engineering: Ideas, Visions and Reflections Track (FSE-IVR),
pages 1479-1482, August 2021.
-
Finding Data Compatibility Bugs with JSON Subschema Checking
(pdf,
doi,
abstract,
open-source code),
Andrew Habib, Avraham Shinnar, Martin Hirzel, and Michael Pradel.
International Symposium on Software Testing and Analysis (ISSTA),
pages 620-632, July 2021.
- Winner of a Distinguished Artifact Award at ISSTA 2021.
- Supersedes earlier version:
Type Safety with JSON Subschema
(pdf,
arXiv),
Andrew Habib, Avraham Shinnar, Martin Hirzel, and Michael Pradel.
arXiv:1911.12651 [cs.PL], November 2019.
-
Compiling Stan to Generative Probabilistic Languages and Extension to Deep Probabilistic Programming
(pdf,
doi,
open-source code,
abstract),
Guillaume Baudart, Javier Burroni, Martin Hirzel, Louis Mandel, and Avraham Shinnar.
Conference on Programming Language Design and Implementation (PLDI),
pages 497-510, June 2021.
- Supersedes earlier version:
Compiling Stan to Generative Probabilistic Programming Languages
(poster),
Guillaume Baudart, Javier Burroni, Martin Hirzel, Kiran Kate, Louis Mandel, and Avraham Shinnar.
Poster at the International Conference on
Probabilistic Programming (PROBPROG), October 2020.
- Supersedes earlier version:
Extending Stan for Deep Probabilistic Programming,
Javier Burroni, Guillaume Baudart, Louis Mandel, Martin Hirzel, and Avraham Shinnar.
arXiv:1810.00873 [cs.AI], September 2018
(pdf,
arXiv)
-
In-Order Sliding-Window Aggregation in Worst-Case Constant Time
(pdf,
doi,
open-source code),
Kanat Tangwongsan, Martin Hirzel, and Scott Schneider.
The VLDB Journal, 30, pages 933-957, June 2021.
- Supersedes earlier version:
arXiv:2009.13768 [cs.DB], September 2020
(pdf,
arXiv).
-
Engineering Fair Machine Learning Pipelines
(pdf,
open-source code,
video),
Martin Hirzel, Kiran Kate, and Parikshit Ram.
Workshop on Responsible AI (RAI@ICLR), May 2021.
-
Extracting Hyperparameter Constraints from Code
(pdf,
contents),
Ingkarat Rak-amnouykit, Ana Milanova, Guillaume Baudart,
Martin Hirzel, and Julian Dolby.
Workshop on Security and Safety in Machine Learning Systems (SecML@ICLR),
May 2021.
-
Python 3 Types in the Wild: A Tale of Two Type Systems
(pdf,
doi,
abstract,
video),
Ingkarat Rak-amnouykit, Daniel McCrevan, Ana Milanova, Martin Hirzel,
and Julian Dolby.
Dynamic Languages Symposium (DLS),
pages 57-70, November 2020.
-
Lale: Consistent Automated Machine Learning
(pdf,
arXiv,
talk,
open-source code,
contents),
Guillaume Baudart, Martin Hirzel, Kiran Kate, Parikshit Ram,
and Avraham Shinnar.
KDD Workshop on Automation in Machine Learning (AutoML@KDD),
August 2020.
-
Mining Documentation to Extract Hyperparameter Schemas
(pdf,
arXiv,
talk,
contents),
Guillaume Baudart, Peter Kirchner, Martin Hirzel, and Kiran Kate.
ICML Workshop on Automated Machine Learning (AutoML@ICML),
July 2020.
- The resulting extracted schemas are available as open-source.
-
AIMMX: Artificial Intelligence Model Metadata Extractor
(pdf,
doi,
abstract,
video,
open-source code),
Jason Tsay, Alan Braz, Martin Hirzel, Todd Mummert, and Avraham Shinnar.
Conference on Mining Software Repositories (MSR),
pages 81-92, June 2020.
-
A semi-supervised Deep Learning Algorithm for Abnormal EEG Identification
(pdf,
arXiv,
contents),
Subhrajit Roy, Kiran Kate, and Martin Hirzel.
Machine Learning for Health Workshop at NeurIPS (ML4H),
December 2019.
- Supersedes earlier version:
arXiv:1903.07822v1 [cs.LG],
March 2019
(pdf,
arXiv).
-
Optimal and General Out-of-Order Sliding-Window Aggregation
(pdf,
@vldb,
open-source code),
Kanat Tangwongsan, Martin Hirzel, and Scott Schneider.
Conference on Very Large Data Bases (VLDB),
pages 1167-1180, August 2019.
- Supersedes earlier version:
Sub-O(log n) Out-of-Order Sliding-Window Aggregation,
Kanat Tangwongsan, Martin Hirzel, and Scott Schneider.
arXiv:1810.11308 [cs.DS, cs.DB], October 2018
(pdf,
arXiv).
-
Machine Learning in Python with No Strings Attached
(pdf,
doi,
talk,
open-source code),
Guillaume Baudart, Martin Hirzel, Kiran Kate, Louis Mandel, and Avraham Shinnar.
Workshop on Machine Learning and Programming Languages (MAPL),
pages 1-9, June 2019.
- Supersedes earlier version:
Yaps: Python Frontend to Stan,
Guillaume Baudart, Martin Hirzel, Kiran Kate, Louis Mandel, and
Avraham Shinnar. arXiv:1812.04125 [cs.AI], December 2018
(pdf,
arXiv)
-
Type-Driven Automated Learning with Lale
(pdf,
arXiv,
open-source code),
Martin Hirzel, Kiran Kate, Avraham Shinnar, Subhrajit Roy,
and Parikshit Ram.
arXiv:1906.03957 [cs.PL], May 2019.
-
Protecting Chatbots from Toxic Content
(pdf,
doi,
abstract),
Guillaume Baudart, Julian Dolby, Evelyn Duesterwald, Martin Hirzel,
and Avraham Shinnar.
Symposium on New Ideas, New Paradigms, and Reflections on Programming and Software (Onward!), pages 99-110, November 2018.
-
Reactive Chatbot Programming
(pdf,
doi,
abstract),
Guillaume Baudart, Martin Hirzel, Louis Mandel, Avraham Shinnar, and Jérôme Siméon.
Workshop on Reactive and Event-based Languages and Systems (REBLS),
pages 21-30, November 2018.
-
Probabilistic Reactive Programming
(open-source code),
Louis Mandel, Guillaume Baudart, Avraham Shinnar, Kiran Kate, and
Martin Hirzel.
Poster at the International Conference on
Probabilistic Programming (PROBPROG), October 2018.
-
Watertight Probabilistic Abstractions in Python,
Guillaume Baudart, Avraham Shinnar, Martin Hirzel, and Louis Mandel.
Poster at the International Conference on
Probabilistic Programming (PROBPROG), October 2018.
-
Dagstuhl Seminar on Big Stream Processing
(pdf,
sigmod record),
Sherif Sakr, Tilmann Rabl, Martin Hirzel, Paris Carbone, and Martin Strohbach.
SIGMOD Record, 47 (3), pages 36-39, September 2018.
-
Stream Processing Languages in the Big Data Era
(pdf,
sigmod record),
Martin Hirzel, Guillaume Baudart, Angela Bonifati, Emanuele Della Valle, Sherif Sakr, and Akrivi Vlachou.
SIGMOD Record, 47 (2), pages 29-40, June 2018.
-
Deep Probabilistic Programming Languages: A Qualitative Study
(pdf,
poster,
arXiv),
Guillaume Baudart, Martin Hirzel, and Louis Mandel.
arXiv:1804.06458 [cs.AI], April 2018.
- Also a poster
at the Workshop on Machine Learning and Programming Languages
(MAPL),
June 2018.
-
Continuous Queries
(pdf,
doi),
Martin Hirzel.
Encyclopedia of Big Data Technologies, May 2018.
-
Stream Processing Languages and Abstractions
(pdf,
doi),
Martin Hirzel and Guillaume Baudart.
Encyclopedia of Big Data Technologies, March 2018.
-
Stream Query Optimization
(pdf,
doi),
Martin Hirzel, Robert Soulé, Buğra Gedik, and Scott Schneider.
Encyclopedia of Big Data Technologies, March 2018.
-
Big Stream Processing Systems (Dagstuhl Seminar 17441)
(pdf,
doi),
Tilmann Rabl, Sherif Sakr, and Martin Hirzel.
Dagstuhl Reports, 7(10), pages 111-138, March 2018.
-
Runway: Machine Learning Model Experiment Management Tool
(pdf,
@sysml),
Jason Tsay, Todd Mummert, Norman Bobroff, Alan Braz, and Martin Hirzel.
SysML Conference, February 2018.
-
Bootstrapping Chatbots for Novel Domains
(pdf,
@lld),
Petr Babkin, Md. Faisal Mahbub Chowdhury, Alfio Gliozzo, Martin Hirzel,
and Avraham Shinnar.
Workshop at NIPS on Learning with Limited Labeled Data (LLD@NIPS), December 2017.
-
Generating Chat Bots from Web API Specifications
(pdf,
talk,
abstract),
Mandana Vaziri, Louis Mandel, Avraham Shinnar,
Jérôme Siméon, and Martin Hirzel.
Symposium on New Ideas, New Paradigms, and Reflections on Programming and Software (Onward!), pages 44-57, October 2017.
-
Experience Report: Prototyping a Query Compiler Using Coq
(pdf,
talk,
@acm,
open-source code),
Joshua Auerbach, Martin Hirzel, Louis Mandel, Avraham Shinnar, and
Jérôme Siméon.
International Conference on Functional Programming (ICFP),
pages 9:1-9:15, September 2017.
- Supersedes earlier version:
Prototyper un compilateur de requêtes avec Coq,
Joshua Auerbach, Martin Hirzel, Louis Mandel, Avraham Shinnar
et Jérôme Siméon.
Journées Francophones des Langages Applicatifs (JFLA),
January 2017 (pdf)
-
Low-Latency Sliding-Window Aggregation in Worst-Case Constant Time
(pdf,
talk,
@acm,
open-source code),
Kanat Tangwongsan, Martin Hirzel, and Scott Schneider.
Conference on Distributed Event-Based Systems (DEBS),
pages 66-77, June 2017.
- Winner,
Best Paper Award,
DEBS 2017.
- Supersedes earlier version:
IBM Research Report RC25574,
Kanat Tangwongsan, Martin Hirzel, and Scott Schneider.
11 November 2015
(pdf)
-
Tutorial: Sliding-Window Aggregation Algorithms
(pdf,
@acm),
Martin Hirzel, Scott Schneider, and Kanat Tangwongsan.
Conference on Distributed Event-Based Systems (DEBS),
pages 11-14, June 2017.
-
Handling Environments in a Nested Relational Algebra with Combinators
and an Implementation in a Verified Query Compiler
(pdf,
talk,
github,
@acm,
open-source code),
Joshua Auerbach, Martin Hirzel, Louis Mandel, Avraham Shinnar, and
Jérôme Siméon.
International Conference on Management of Data (SIGMOD),
pages 1555-1569, May 2017.
- Artifacts Evaluated - Reusable
- Results Replicated
-
Q*cert: A Platform for Implementing and Verifying Query Compilers
(pdf,
github,
@acm,
open-source code),
Joshua Auerbach, Martin Hirzel, Louis Mandel, Avraham Shinnar, and
Jérôme Siméon.
Demonstration at the International Conference on Management of Data (SIGMOD-Demo),
pages 1703-1706, May 2017.
-
I Can Parse You: Grammars for Dialogs
(pdf,
talk,
@LIPIcs),
Martin Hirzel, Louis Mandel, Avraham Shinnar,
Jérôme Siméon, and Mandana Vaziri.
Summit oN Advances in Programming Languages (SNAPL),
pages 6:1-6:15, May 2017.
-
SPL: An Extensible Language for Distributed Stream Processing
(pdf,
doi,
abstract,
open-source code),
Martin Hirzel, Scott Schneider, and Buğra Gedik.
Transactions on Programming Languages and Systems (TOPLAS),
39(1), pages 5:1-5:39, March 2017.
- Supersedes earlier version:
IBM Research Report RC25486,
Martin Hirzel, Scott Schneider, and Buğra Gedik. 30 July 2014
(pdf).
- SPL is the language for the IBM Streams product (see documentation).
-
River: An Intermediate Language for Stream Processing
(pdf,
abstract),
Robert Soulé, Martin Hirzel, Buğra Gedik, and Robert Grimm.
Software: Practice and Experience (SP&E), 46(7), pages 891-929, July 2016.
-
AQuA: Adaptive Quality Analytics
(pdf,
talk,
abstract),
Wei Zhang, Martin Hirzel, and David Grove.
Conference on Distributed Event-Based Systems (DEBS),
pages 169-180, June 2016.
-
Spreadsheets for Stream Processing with Unbounded Windows and Partitions
(pdf,
talk,
abstract),
Martin Hirzel, Rodric Rabbah, Philippe Suter, Olivier Tardieu, and
Mandana Vaziri.
Conference on Distributed Event-Based Systems (DEBS),
pages 49-60, June 2016.
-
Virtual Lightweight Snapshots for Consistent Analytics in NoSQL Stores
(pdf,
talk,
abstract),
Fernando Chirigati, Jérôme Siméon, Martin Hirzel, and Juliana Freire.
International Conference on Data Engineering (ICDE), Industrial Track, pages 1310-1321, May 2016.
-
META: Middleware for Events, Transactions, and Analytics
(pdf,
abstract),
Matthew Arnold, David Grove, Benjamin Herta, Michael Hind, Martin Hirzel,
Arun Iyengar, Louis Mandel, Vijay A. Saraswat, Avraham Shinnar,
Jérôme Siméon, Mikio Takeuchi, Olivier Tardieu, and
Wei Zhang.
IBM Journal of Research and Development (IBMRD),
60(2-3), pages 15:1-15:10, March-May 2016.
-
Language Runtime and Optimizations in IBM Streams
(pdf,
abstract,
open-source code),
Scott Schneider, Buğra Gedik, and Martin Hirzel.
IEEE Database Engineering Bulletin,
38(4), pages 61-72, December 2015.
-
General Incremental Sliding-Window Aggregation
(pdf,
talk,
@vldb,
open-source code),
Kanat Tangwongsan, Martin Hirzel, Scott Schneider, and Kun-Lung Wu.
Conference on Very Large Data Bases (VLDB),
pages 702-713, September 2015.
-
Debugging Mixed-Environment Programs with Blink
(pdf,
abstract),
Byeongcheol Lee, Martin Hirzel, Robert Grimm, and Kathryn S. McKinley.
Software: Practice and Experience (SP&E),
45(9), pages 1277-1306, September 2015.
-
A Pattern Calculus for Rule Languages: Expressiveness, Compilation, and Mechanization
(pdf,
talk,
abstract,
artifact),
Avraham Shinnar, Jérôme Siméon, and Martin Hirzel.
European Conference on Object-Oriented Programming (ECOOP), pages 542-567, July 2015.
-
Spreadsheets for Stream Partitions and Windows
(pdf,
talk,
proceedings,
arXiv),
Martin Hirzel, Rodric Rabbah, Philippe Suter, Olivier Tardieu, and Mandana Vaziri.
Workshop on Software Engineering Methods in Spreadsheets (SEMS@ICSE),
pages 39-40, May 2015.
-
Safe Data Parallelism for General Streaming
(pdf,
abstract),
Scott Schneider, Martin Hirzel, Buğra Gedik, and Kun-Lung Wu.
IEEE Transactions on Computers (TC),
64(2), pages 504-517, February 2015.
-
Event Processing over a Distributed JSON Store: Design and Performance
(pdf,
talk,
abstract),
Miki Enoki, Jérôme Siméon, Hiroshi Horii, and Martin Hirzel.
Conference on Web Information System Engineering (WISE),
pages 395-404, October 2014.
-
Stream Processing with a Spreadsheet
(pdf,
talk,
abstract),
Mandana Vaziri, Olivier Tardieu, Rodric Rabbah, Philippe Suter,
and Martin Hirzel.
European Conference on Object-Oriented Programming (ECOOP),
pages 360-384, July 2014.
-
Elastic Scaling for Data Stream Processing
(pdf,
abstract),
Buğra Gedik, Scott Schneider, Martin Hirzel, and Kun-Lung Wu.
Transactions on Parallel and Distributed Systems (TPDS), 25(6),
pages 1447-1463, June 2014.
-
A Catalog of Stream Processing Optimizations
(pdf,
abstract),
Martin Hirzel, Robert Soulé, Scott Schneider, Buğra Gedik,
and Robert Grimm.
ACM Computing Surveys (CSUR), 46(4), April 2014.
- Supersedes earlier version:
IBM Research Report RC25215,
Martin Hirzel, Robert Soulé, Scott Schneider, Buğra Gedik,
and Robert Grimm. 28 September 2011
(pdf).
-
Testing Properties of Dataflow Program Operators
(pdf,
talk),
Zhihong Xu, Martin Hirzel, Gregg Rothermel, and Kun-Lung Wu.
Conference on Automated Software Engineering (ASE),
pages 103-113, November 2013.
-
Semantic Characterization of MapReduce Workloads
(pdf,
talk),
Zhihong Xu, Martin Hirzel, and Gregg Rothermel.
International Symposium on Workload Characterization (IISWC),
pages 87-97, September 2013.
-
Dynamic Expressivity with Static Optimization for Streaming Languages
(pdf,
talk),
Robert Soulé, Michael I. Gordon, Saman Amarasinghe, Robert Grimm, and Martin Hirzel.
Conference on Distributed Event-Based Systems (DEBS),
pages 159-170, June 2013.
-
Tutorial: Stream Processing Optimizations
(pdf,
talk),
Scott Schneider, Buğra Gedik, and Martin Hirzel.
Conference on Distributed Event-Based Systems (DEBS),
pages 249-258, June 2013.
-
IBM Streams Processing Language: Analyzing Big Data in Motion
(pdf,
abstract,
open-source code),
Martin Hirzel, Henrique Andrade, Buğra Gedik,
Gabriela Jacques-Silva, Rohit Khandekar, Vibhore Kumar, Mark Mendell,
Howard Nasgaard, Scott Schneider, Robert Soulé, and Kun-Lung Wu.
IBM Journal of Research and Development (IBMRD),
57(3-4), pages 7:1-7:11, May-July 2013.
- SPL is the language for the IBM Streams product (see documentation).
-
Robust Scripting via Patterns
(pdf,
talk),
Bard Bloom and Martin Hirzel.
Dynamic Languages Symposium (DLS), pages 29-40,
October 2012.
-
Auto-Parallelizing Stateful Distributed Streaming Applications
(pdf,
talk),
Scott Schneider, Martin Hirzel, Buğra Gedik, and Kun-Lung Wu.
International Conference on Parallel Architectures and Compilation Techniques (PACT),
pages 53-64, September 2012.
-
From a Calculus to an Execution Environment for Stream Processing
(pdf,
talk),
Robert Soulé, Martin Hirzel, Buğra Gedik, and Robert Grimm.
Conference on Distributed Event-Based Systems (DEBS),
pages 20-31, July 2012.
-
Partition and Compose: Parallel Complex Event Processing
(pdf,
talk,
award talk),
Martin Hirzel.
Conference on Distributed Event-Based Systems (DEBS),
pages 191-200, July 2012.
-
CAPSULE: Language and System Support for Efficient State Sharing in Distributed Stream Processing Systems
(pdf,
talk),
Giuliano Losa, Vibhore Kumar, Henrique Andrade, Buğra Gedik, Martin Hirzel, Robert Soulé, and Kun-Lung Wu.
Conference on Distributed Event-Based Systems (DEBS), Industrial Track,
pages 268-277, July 2012.
-
Marco: Safe, Expressive Macros for any Language
(pdf,
talk),
Byeongcheol Lee, Robert Grimm, Martin Hirzel, and Kathryn S. McKinley.
European Conference on Object-Oriented Programming (ECOOP),
LNCS 7313 pages 589-613, June 2012.
-
Extending a General-Purpose Streaming System for XML
(pdf,
talk,
open-source code),
Mark Mendell, Howard Nasgaard, Eric Bouillet, Martin Hirzel, and Buğra Gedik.
International Conference on Extending Database Technology (EDBT), Industrial Track,
pages 534-539, March 2012.
-
Streams that Compose using Macros that Oblige
(pdf,
talk),
Martin Hirzel and Buğra Gedik.
Workshop on Partial Evaluation and Program Manipulation (PEPM),
pages 141-150, January 2012.
- Streams Processing Language (SPL) reference
(html),
IBM Streams product documentation, 15 April 2011.
- Supersedes earlier version:
IBM Research Report RC24897,
Martin Hirzel, Henrique Andrade, Buğra Gedik, Vibhore Kumar,
Giuliano Losa, Mark Mendell, Howard Nasgaard, Robert Soulé,
and Kun-Lung Wu. 5 November 2009
(pdf).
- Supersedes earlier version:
IBM Research Report RC24830, 29 July 2009.
- Supersedes earlier version:
IBM Research Report RC24760, 11 March 2009.
- From a Stream of Relational Queries to Distributed Stream Processing
(pdf,
talk),
Qiong Zou, Huayong Wang, Robert Soulé,
Martin Hirzel, Henrique Andrade,
Buğra Gedik, and
Kun-Lung Wu.
Conference on Very Large Data Bases (VLDB), Industrial Track, pages 1394-1405, September 2010.
- Synthesizing Dynamic Bug Detectors for Foreign Language Interfaces
(pdf,
talk,
software),
Byeongcheol Lee, Ben Wiedermann, Martin Hirzel, Robert Grimm, and Kathryn S. McKinley.
Conference on Programming Language Design and Implementation (PLDI), pages 36-49, June 2010.
-
Z-Rays: Divide Arrays and Conquer Speed and Flexibility
(pdf,
talk),
Jennifer B. Sartor, Stephen M. Blackburn, Daniel Frampton, Martin Hirzel, and Kathryn S. McKinley.
Conference on Programming Language Design and Implementation (PLDI), pages 471-482, June 2010.
- Dualities in Programming Languages
(pdf,
talk,
abstract),
Martin Hirzel and Priya Nagpurkar.
Fun and Interesting Thoughts (FIT) Session at PLDI, June 2010.
-
A Universal Calculus for Stream Processing Languages
(pdf,
talk),
Robert Soulé, Martin Hirzel, Robert Grimm, Buğra Gedik, Henrique Andrade, Vibhore Kumar, and Kun-Lung Wu.
European Symposium on Programming (ESOP), LNCS 6012 pages 507-528, March 2010.
- Extended version available as “NYU Technical Report TR2010-924”
(pdf),
Department of Computer Science, New York University,
March 2010.
-
Debug All Your Code: Portable Mixed-Environment Debugging
(pdf,
talk)
Byeongcheol Lee, Martin Hirzel, Robert Grimm, and Kathryn McKinley.
Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA), pages 207-225, October 2009.
-
Efficient Memory Management for Long-Lived Objects
(pdf,
@ibm),
Ronny Morad, Martin Hirzel, Elliot K. Kolodner, and Mooly Sagiv.
IBM Research Report RC24794, 7 May 2009.
-
Wake Up and Smell the Coffee: Evaluation Methodology for the 21st Century
(pdf,
@acm),
Stephen M. Blackburn, Kathryn S. McKinley, Robin Garner, Chris Hoffmann, Asjad M. Khan, Rotem Bentzur, Amer Diwan, Daniel Feinberg, Daniel Frampton, Samuel Z. Guyer, Martin Hirzel, Antony Hosking, Maria Jump, Han Lee, J. Eliot B. Moss, Aashish Phansalkar, Darko Stefanović, Thomas VanDrunen, Daniel von Dincklage, Ben Wiedermann.
Communications of the ACM (CACM) Research Highlights (Invited),
51(8), pages 83-89, August 2008.
-
Online Phase-Adaptive Data Layout Selection
(pdf,
talk,
abstract),
Chengliang Zhang and Martin Hirzel.
European Conference on Object-Oriented Programming (ECOOP),
LNCS 5142 pages 309-334, July 2008.
-
No Bit Left Behind: The Limits of Heap Data Compression
(pdf,
talk,
abstract),
Jennifer B. Sartor, Martin Hirzel, and Kathryn S. McKinley.
International Symposium on Memory Management (ISMM), pages 111-120,
June 2008.
- Winner, Best Presentation Award (First Place), ISMM 2008.
- Extended version available as “Technical Report TR-08-17”
(pdf),
Department of Computer Science, University of Texas at Austin,
June 2008.
-
Matchete: Paths through the Pattern Matching Jungle
(pdf,
talk,
abstract),
Martin Hirzel, Nathaniel Nystrom, Bard Bloom, and Jan Vitek.
International Symposium on Practical Aspects of Declarative Languages (PADL),
LNCS 4902 pages 150-166, January 2008.
-
Jeannie: Granting Java Native Interface Developers their Wishes
(pdf,
talk,
abstract),
Martin Hirzel and Robert Grimm.
Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA), pages 19-38, October 2007.
-
Data Layouts for Object-Oriented Programs
(pdf,
talk,
abstract),
Martin Hirzel.
International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), pages 265-276, June 2007.
- Extended version available as “IBM Research Report RC24218”
(pdf),
Computer Science, IBM Research Division,
January 2007.
-
Fast Online Pointer Analysis
(pdf,
abstract),
Martin Hirzel, Daniel von Dincklage, Amer Diwan, and Michael Hind.
Transactions on Programming Languages and Systems (TOPLAS),
29(2), April 2007.
-
The DaCapo Benchmarks: Java Benchmarking Development and Analysis
(pdf,
org,
abstract),
Stephen M. Blackburn, Robin Garner, Chris Hoffmann, Asjad M. Khan, Kathryn S. McKinley, Rotem Bentzur, Amer Diwan, Daniel Feinberg, Daniel Frampton, Samuel Z. Guyer, Martin Hirzel, Antony Hosking, Maria Jump, Han Lee, J. Eliot B. Moss, Aashish Phansalkar, Darko Stefanović, Thomas VanDrunen, Daniel von Dincklage, Ben Wiedermann.
Object-Oriented Programming, Systems, Languages, and Applications
(OOPSLA), pages 169-190, October 2006.
-
Improving Locality with Parallel Hierarchical Copying GC
(pdf,
talk,
abstract),
David Siegwart and Martin Hirzel.
International Symposium on Memory Management (ISMM), pages 52-63,
June 2006.
-
Parley: Federated Virtual Machines
(slides),
Perry Cheng, Dave Grove, Martin Hirzel, Rob O'Callahan, and Nikhil Swamy.
Invitational Workshop on the Future of Virtual Execution Environments, hosted by IBM Research, September 2004.
-
Connectivity-Based Garbage Collection
(pdf,
pdf-2up,
abstract),
Martin Hirzel. Ph.D. Dissertation Thesis, Department of Computer Science,
University of Colorado at Boulder, July 2004.
-
Pointer Analysis in the Presence of Dynamic Class Loading
(pdf,
pdf-2up,
talk,
abstract),
Martin Hirzel, Amer Diwan, and Michael Hind.
European Conference on Object-Oriented Programming (ECOOP),
LNCS 3086 pages 96-122, June 2004.
-
Connectivity-Based Garbage Collection
(pdf,
abstract),
Martin Hirzel, Amer Diwan, and Matthew Hertz.
Object-Oriented Programming, Systems, Languages, and Applications
(OOPSLA), pages 359-373, October 2003.
-
Choosing a Set of Partitions to Collect in a Connectivity-Based Garbage Collector
(pdf),
Martin Hirzel, Harold N. Gabow, and Amer Diwan. University of
Colorado Technical Report CU-CS-958-03, August 2003.
-
On the Usefulness of Type and Liveness Accuracy for Garbage Collection and Leak Detection
(pdf,
pdf-2up,
abstract),
Martin Hirzel, Amer Diwan, and Johannes Henkel.
Transactions on Programming Languages and Systems (TOPLAS),
24(6), pages 593-624, November 2002.
-
Understanding the Connectivity of Heap Objects
(pdf,
talk,
abstract),
Martin Hirzel, Johannes Henkel, Amer Diwan, and Michael Hind.
International Symposium on Memory Management (ISMM), pages 36-49,
June 2002.
-
Dynamic Hot Data Stream Prefetching for General-Purpose Programs
(pdf,
abstract),
Trishul M. Chilimbi and Martin Hirzel. Conference on Programming
Language Design and Implementation (PLDI), pages 199-209, June 2002.
-
Bursty Tracing: A Framework for Low-Overhead Temporal Profiling
(pdf,
talk,
@fddo),
Martin Hirzel and Trishul M. Chilimbi. 4th Workshop on
Feedback-Directed and Dynamic Optimization (FDDO),
pages 117-126, December 2001.
-
On the Usefulness of Liveness for Garbage Collection and Leak Detection
(pdf,
pdf-2up,
talk,
abstract),
Martin Hirzel, Amer Diwan, and Antony Hosking.
European Conference on Object-Oriented Programming (ECOOP),
LNCS 2072 pages 181-206, June 2001.
-
On the Type Accuracy of Garbage Collection
(pdf,
ps,
abstract),
Martin Hirzel and Amer Diwan. International Symposium on
Memory Management (ISMM), pages 1-11, October 2000.
-
Effectiveness of Garbage Collection and Explicit Deallocation
(pdf,
@boulder),
Martin Hirzel. Master's Thesis, Department of Computer Science,
University of Colorado at Boulder, May 2000.
Awards
Paper awards:
Professional society advanced member grades:
Patents
-
Repairing machine learning pipelines,
Julian Dolby, Jason Tsay, and Martin Hirzel.
US Patent 11,868,166, January 2024.
-
Automatic conversion of data within data pipeline,
Kiran Kate, Martin Hirzel, and Avraham Shinnar.
US Patent 11,720,586, August 2023.
-
Schema-based machine-learning model task deduction,
Alan Braz, Martin Hirzel, Avraham Shinnar, Jason Tsay, and Todd Mummert.
US Patent 11,599,357, March 2023.
-
Guiding machine learning models and related components,
Norman Bobroff, Alan Braz, Martin Hirzel, Todd Mummert, and Peter Westerink.
US Patent 11,537,932, December 2022.
-
Reducing semantic errors in code generated by machine learning models,
Julian Dolby, Martin Hirzel, Kiran Kate, Louis Mandel, Avraham Shinnar,
and Kavitha Srinivas.
US Patent 11,507,352, November 2022.
-
Database algebra and compiler with environments,
Martin Hirzel, Louis Mandel, Avraham Shinnar, and Jerome Simeon.
US Patent 10,983,966, April 2021.
-
Configurable and non-invasive protection of private information for conversational agents,
Guillaume Baudart, Evelyn Duesterwald, Martin Hirzel, Avraham Shinnar, and Julian Dolby.
US Patent 10,977,385, April 2021.
-
Conversational optimization of cognitive models,
Martin Hirzel, Harold L. Ossher, David J. Piorkowski, and Peri Tarr.
US Patent 10,810,994, October 2020.
-
Specifying a conversational computer agent and its outcome with a grammar,
Martin Hirzel, Louis Mandel, Avraham Shinnar, Jerome Simeon, and Mandana Vaziri.
US Patent 10,796,088, October 2020.
-
Spreadsheet compiler for stream processing,
Martin Hirzel, Rodric Rabbah, Philippe Suter, Olivier Tardieu,
and Mandana Vaziri.
US Patent 10,691,881, June 2020.
-
Interpreting conversational authoring of information models,
Martin Hirzel, Avraham E. Shinnar, and Jerome Simeon.
US Patent 10,679,000, June 2020.
-
Personalization of conversational agents through macro recording,
Martin Hirzel, Louis Mandel, Avraham E. Shinnar, Jerome Simeon, and
Mandana Vaziri.
US Patent 10,553,204, February 2020.
-
Conversational authoring of event processing application,
Martin Hirzel, Avraham E. Shinnar, and Jerome Simeon.
US Patent 10,552,543, February 2020.
-
Information flow analysis for conversational agents,
Matthew Arnold, Julian Dolby, and Martin Hirzel.
US Patent 10,423,873, September 2019.
-
Generating chat bots from web API specifications,
Martin Hirzel, Louis Mandel, Avraham Shinnar, Jérôme
Siméon, Mandana Vaziri, and Charles Wiecha.
US Patent 10,388,285, August 2019.
-
Encoding a spreadsheet program for stream processing,
Martin Hirzel, Rodric Rabbah, Philippe Suter, Olivier Tardieu,
and Mandana Vaziri.
US Patent 10,255,262, April 2019.
-
Quality-directed adaptive analytic retraining,
David Grove, Martin Hirzel, and Wei Zhang.
US Patent 10,163,061, December 2018.
-
Discovering windows in temporal predicates,
Martin Hirzel, Christopher Hyland, and Nicolas Ke.
US Patent 10,127,903, November 2018.
-
Global production rules for distributed data,
Matthew Arnold, Martin Hirzel, Vijay Saraswat, Avraham Shinnar,
Jérôme Siméon, and Lionel Villard.
US Patent 9,584,358, February 2017.
-
Stream-enabled spreadsheet as a circuit,
Martin Hirzel, Rodric Rabbah, Olivier Tardieu, and Mandana Vaziri.
US Patent 9,569,418, February 2017.
-
System and method for using development objectives to guide implementation of source code,
Andrew L. Frenkiel and Martin Hirzel.
US Patent 9,361,068, June 2016.
-
Automatic exploitation of data parallelism in streaming applications,
Bugra Gedik, Martin Hirzel, Scott A. Schneider, and Kun-Lung Wu.
US patent 9,170,794, October 2015.
-
Incremental aggregation-based event pattern matching,
Martin Hirzel.
US patent 9,158,824, October 2015.
-
Elastic auto-parallelization for stream processing applications based on a measured throughput and congestion,
Bugra Gedik, Martin Hirzel, Scott A. Schneider, and Kun-Lung Wu.
US patent 9,106,391, August 2015.
-
Automating the creation of an application provisioning model,
Henrique Andrade, Judah M. Diament, Andrew L. Frenkiel, Bugra Gedik,
Martin Hirzel, and Anton V. Riabov.
US patent 8,589,864, November 2013.
-
Data sharing in a stream processing system,
Henrique Andrade, Bugra Gedik, Martin Hirzel, Vibhore Kumar, Giuliano Losa, Robert Soule, and Kun-Lung Wu.
US patent 8,560,602, October 2013.
-
Virtual execution environment for streaming languages,
Henrique Andrade, Jim Challenger, Bugra Gedik, Robert Grimm, Martin Hirzel, Vibhore Kumar, Robert Soule, and Kun-Lung Wu.
US patent 8,499,292, July 2013.
-
Transforming relational queries into stream processing,
Henrique Andrade, Bugra Gedik, Martin Hirzel, Robert Soule, Hua Yong Wang, Kun-Lung Wu, and Qiong Zou.
US patent 8,326,821, December 2012.
-
Proxying open database connectivity (ODBC) calls,
Henrique Andrade, Bugra Gedik, Martin Hirzel, Robert Soule, Huayong Wang, Kun-Lung Wu, and Qiong Zou.
US patent 8,321,443, November 2012.
-
State sharing in a distributed data stream processing system,
Henrique Andrade, Bugra Gedik, Martin Hirzel, Vibhore Kumar, Giuliano Losa, Robert Soule, and Kun-Lung Wu.
US patent 8,285,780, October 2012.
-
Improving locality with parallel hierarchical copying garbage collection,
David K. Siegwart and Martin Hirzel.
US patent 8,108,448, January 2012.
-
Dynamic prefetching of hot data streams,
Trishul Chilimbi and Martin Hirzel.
US patent 8,046,752, October 2011.
-
Method and system for determining optimal data layout using blind justice,
Martin Hirzel and Chengliang Zhang.
US patent 7,805,580, September 2010.
-
Dynamic temporal optimization framework,
Trishul Chilimbi and Martin Hirzel.
US patent 7,140,008, November 2006.
University classes taught
- Compiler Construction
(home),
Taught together with Kristoffer H. Rose.
Cap-stone graduate class, enrollment of 6 students.
New York University (NYU), number CSCI-GA.2130-001, Fall 2013.
- Scripting Languages
(home),
Special Topics in Computer Science graduate class, enrollment of 45 students.
New York University (NYU), number CSCI-GA.3033-003, Summer 2012.
- Compiler Construction
(home),
Cap-stone graduate class, enrollment of 16 students.
New York University (NYU), number CSCI-GA.2130-001, Fall 2011.
- Scripting Languages
(home),
Special Topics in Computer Science graduate class, enrollment of 12 students.
New York University (NYU), number G22.3033-002, Summer 2008.
-
Programming Languages
(home),
Entry-level graduate class, enrollment of 15 students.
New York University (NYU), number G22.2110-001, Summer 2007.
-
Programming Languages
(home),
Entry-level graduate class, enrollment of 27 students.
New York University (NYU), number G22.2110-001, Spring 2007.
-
Principles of Programming Languages
(blurb),
Third-year undergraduate class, enrollment of 128 students.
University of Colorado at Boulder (CU), number CSCI 3155, Fall 2002.
Miscellaneous other content
- Jeannie user guide
(html, pdf) and open source compiler contributed to xtc.
- Über formal unentscheidbare Sätze der Principia Mathematica und verwandter Systeme I (On formally undecidable propositions of Principia Mathematica and related systems I) translation of Kurt Gödel's theorem and proof from 1931
(pdf,
errata).
Copyright notices
- Papers published by the Association for Computing Machinery
(ACM) are Copyright © by the Association for Computing
Machinery, Inc. Permission to make digital or hard copies of part
or all of this work for personal or classroom use is granted
without fee provided that copies are not made or distributed for
profit or commercial advantage and that copies bear this notice
and the full citation on the first page. Copyrights for
components of this work owned by others than ACM must be
honored. Abstracting with credit is permitted. To copy otherwise,
to republish, to post on servers, or to redistribute to lists,
requires prior specific permission and/or a fee. Request
permissions from Publications Dept, ACM Inc., fax +1 (212)
869-0481, or permissions@acm.org.
- Papers published by the Institute of Electrical and Electronics
Engineers, Inc. (IEEE) are Copyright © by IEEE. Personal use
of this material is permitted. However, permission to
reprint/republish this material for advertising or promotional
purposes or for creating new collective works for resale or
redistribution to servers or lists, or to reuse any copyrighted
component of this work in other works must be obtained from the
IEEE.
- Papers published in the Lecture Notes
in Computer Science (LNCS), which include the proceedings for
ECOOP, ESOP, and PADL, are Copyright © by Springer-Verlag.
http://hirzels.com/martin/