On Leveraging Tests to Infer Nullable Annotations

Dietrich, Jens; Pearce, David J.; Chandramohan, Mahin

doi:10.4230/LIPIcs.ECOOP.2023.10

Abstract

Issues related to the dereferencing of null pointers are a pervasive and widely studied problem, and numerous static analyses have been proposed for this purpose. These are typically based on dataflow analysis, and take advantage of annotations indicating whether a type is nullable or not. The presence of such annotations can significantly improve the accuracy of null checkers. However, most code found in the wild is not annotated, and tools must fall back on default assumptions, leading to both false positives and false negatives. Manually annotating code is a laborious task and requires deep knowledge of how a program interacts with clients and components. We propose to infer nullable annotations from an analysis of existing test cases. For this purpose, we execute instrumented tests and capture nullable API interactions. Those recorded interactions are then refined (santitised and propagated) in order to improve their precision and recall. We evaluate our approach on seven projects from the spring ecosystems and two google projects which have been extensively manually annotated with thousands of @Nullable annotations. We find that our approach has a high precision, and can find around half of the existing @Nullable annotations. This suggests that the method proposed is useful to mechanise a significant part of the very labour-intensive annotation task.

A tool to detect bugs in Java and C/C++/Objective-C code before it ships. URL: https://fbinfer.com/.
JetBrain Developer Ecosystem Survey 2021, 2021. URL: https://www.jetbrains.com/lp/devecosystem-2021/java/.
C. Andreae, J. Noble, S. Markstrum, and T. Millstein. A framework for implementing pluggable type systems. In Proc. OOPSLA, pages 57-74, 2006.
Nathaniel Ayewah and William Pugh. Null dereference analysis in practice. In Proc. PASTE, pages 65-72. ACM Press, 2010.
Subarno Banerjee, Lazaro Clapp, and Manu Sridharan. Nullaway: Practical type-based null safety for java. In Proc. ESEC/FSE'19, pages 740-750. ACM, 2019.
Al Bessey, Ken Block, Ben Chelf, Andy Chou, Bryan Fulton, Seth Hallem, Charles Henri-Gros, Asya Kamsky, Scott McPeak, and Dawson Engler. A few billion lines of code later: using static analysis to find bugs in the real world. Communications of the ACM, 53(2):66-75, 2010.
Eric Bodden, Andreas Sewe, Jan Sinschek, Hela Oueslati, and Mira Mezini. Taming reflection: Aiding static analysis in the presence of reflection and custom class loaders. In Proc. ICSE'11, pages 241-250. IEEE, 2011.
Mehdi Bouaziz, Francesco Logozzo, and Manuel Fähndrich. Inference of necessary field conditions with abstract interpretation. In Proc. APLAS, pages 173-189. Springer-Verlag, 2012.
Gilad Bracha. Pluggable type systems. In Proc. Workshop on Revival of Dynamic Languages, 2004.
Dan Brotherston, Werner Dietl, and Ondrej Lhoták. Granullar: gradual nullable types for java. In Proc. CC, pages 87-97. ACM Press, 2017.
Eric Bruneton, Romain Lenglet, and Thierry Coupaye. Asm: a code manipulation tool to implement adaptable systems. Adaptable and extensible component systems, 30(19), 2002.
Cristiano Calcagno, Dino Distefano, Jérémy Dubreil, Dominik Gabi, Pieter Hooimeijer, Martino Luca, Peter W. O'Hearn, Irene Papakonstantinou, Jim Purbrick, and Dulma Rodriguez. Moving fast with software verification. In Proc. NFM, pages 3-11. Springer-Verlag, 2015.
Patrice Chalin and Perry R James. Non-null references by default in java: Alleviating the nullity annotation burden. In European Conference on Object-Oriented Programming, pages 227-247. Springer, 2007.
Patrice Chalin, Perry R James, and Frédéric Rioux. Reducing the use of nullable types through non-null by default and monotonic non-null. IET Software, 2(6):515-531, 2008.
Brian Chin, Shane Markstrum, and Todd Millstein. Semantic type qualifiers. In Proc. PLDI, pages 85-95. ACM Press, 2005.
Brian Chin, Shane Markstrum, Todd Millstein, and Jens Palsberg. Inference of user-defined type qualifiers and qualifier rules. In Proc. ESOP, 2006.
Jens Dietrich, David J Pearce, Kamil Jezek, and Premek Brada. Contracts in the wild: A study of java programs. In Proc. ECOOP'17. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2017.
Jens Dietrich, Henrik Schole, Li Sui, and Ewan Tempero. XCorpus - An executable corpus of Java programs. Journal of Object Technology, 16(4):1:1-24, August 2017. URL: https://doi.org/10.5381/jot.2017.16.4.a1.
Dino Distefano, Manuel Fähndrich, Francesco Logozzo, and Peter W. O'Hearn. Scaling static analyses at facebook. CACM, 62(8):62-70, 2019.
Kinga Dobolyi and Westley Weimer. Changing Java’s semantics for handling null pointer exceptions. In Proc. ISSRE, pages 47-56. IEEE Computer Society, 2008.
T. Ekman and G. Hedin. Pluggable checking and inferencing of non-null types for Java. JOT, 6(9):455-475, 2007.
Sam Estep, Jenna Wise, Jonathan Aldrich, Éric Tanter, Johannes Bader, and Joshua Sunshine. Gradual program analysis for null pointers. In European Conference on Object-Oriented Programming (ECOOP), pages 3:1-3:25, 2021.
Manuel Fähndrich and K Rustan M Leino. Declaring and checking non-null types in an object-oriented language. In Proceedings of the 18th annual ACM SIGPLAN conference on Object-oriented programing, systems, languages, and applications, pages 302-312, 2003.
C. Flanagan, K. Leino, M. Lillibridge, G. Nelson, J. B. Saxe, and R. Stata. Extended static checking for Java. In Proc. PLDI, pages 234-245, 2002.
Cormac Flanagan and K Rustan M Leino. Houdini, an annotation assistant for esc/java. In International Symposium of Formal Methods Europe, pages 500-517. Springer, 2001.
Jeffrey S. Foster, Manuel Fähndrich, and Alexander Aiken. A theory of type qualifiers. In Proc. PLDI, pages 192-203, 1999.
Jeffrey S. Foster, Tachio Terauchi, and Alex Aiken. Flow-sensitive type qualifiers. In Proc. PLDI, pages 1-12, 2002.
Gordon Fraser and Andrea Arcuri. Evosuite: automatic test suite generation for object-oriented software. In Proc. FSE'11, pages 416-419, 2011.
Maria Anna G. Gaitani, Vassilis Zafeiris, N. A. Diamantidis, and Emmanouel A. Giakoumakis. Automated refactoring to the null object design pattern. Inf. Softw. Technol, 59, 2015.
Adele Goldberg and David Robson. Smalltalk-80: the language and its implementation. Addison-Wesley Longman Publishing Co., Inc., 1983.
Neville Grech, George Fourtounis, Adrian Francalanza, and Yannis Smaragdakis. Heaps don't lie: countering unsoundness with heap snapshots. In Proc. OOPSLA'17, pages 1-27. ACM, 2017.
Warren Harrison. Eating your own dog food. IEEE Software, 23(3):5-7, 2006.
David Hovemeyer, Jaime Spacco, and William Pugh. Evaluating and tuning a static analysis to find null pointer bugs. In Proc. PASTE, pages 13-19. ACM Press, 2005.
Laurent Hubert. A non-null annotation inferencer for java bytecode. In Proceedings of the 8th ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering, pages 36-42, 2008.
Laurent Hubert, Thomas Jensen, and David Pichardie. Semantic foundations and inference of non-null annotations. In Proceedings of the International conference on Formal Methods for Open Object-Based Distributed Systems (FMOODS), pages 132-149. Springer-Verlag, 2008.
Laurent Hubert and David Pichardie. Soundly handling static fields: Issues, semantics and analysis. ENTCS, 253(5):15-30, 2009.
Shuhei Kimura, Keisuke Hotta, Yoshiki Higo, Hiroshi Igaki, and Shinji Kusumoto. Does return null matter? In Conference on Software Maintenance, Reengineering, and Reverse Engineering (CSMR-WCRE), pages 244-253. IEEE, 2014.
Alexander Kogtenkov. Practical void safety. In Proc. VSTTE, pages 132-151. Springer-Verlag, 2017.
Manuel Leuenberger, Haidar Osman, Mohammad Ghafari, and Oscar Nierstrasz. Harvesting the wisdom of the crowd to infer method nullness in java. In 2017 IEEE 17th International Working Conference on Source Code Analysis and Manipulation (SCAM), pages 71-80. IEEE, 2017.
Zhenmin Li, Lin Tan, Xuanhui Wang, Shan Lu, Yuanyuan Zhou, and Chengxiang Zhai. Have things changed now?: an empirical study of bug characteristics in modern open source software. In Proceedings of the Workshop on Architectural and System Support for Improving Software Dependability (ASID), pages 25-33. ACM Press, 2006.
Barbara H Liskov and Jeannette M Wing. A behavioral notion of subtyping. ACM Transactions on Programming Languages and Systems (TOPLAS), 16(6):1811-1841, 1994.
Fengyun Liu, Ondrej Lhoták, Aggelos Biboudis, Paolo G. Giarrusso, and Martin Odersky. A type-and-effect system for object initialization. In Proc. OOPSLA, pages 175:1-175:28, 2020.
Fengyun Liu, Ondrej Lhoták, Enze Xing, and Nguyen Cao Pham. Safe object initialization, abstractly. In Proceedings of the Symposium on Scala, pages 33-43. ACM Press, 2021.
Magnus Madsen and Jaco van de Pol. Relational nullable types with boolean unification. In Proc. OOPSLA, pages 1-28, 2021.
C. Male, D.J. Pearce, A. Potanin, and C. Dymnikov. Java bytecode verification for @NonNull types. In Proc. CC, pages 229-244, 2008.
Gianluca Mezzetti, Anders Møller, and Martin Toldam Torp. Type regression testing to detect breaking changes in node. js libraries. In proc. ECOOP'18. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2018.
Ana Milanova and Wei Huang. Inference and checking of context-sensitive pluggable types. In Proc. ESEC/FSE, page 26. ACM Press, 2012.
Fabian Muehlboeck and Ross Tate. Empowering union and intersection types with integrated subtyping. PACMPL, 2(OOPSLA):112:1-112:29, 2018.
Mangala Gowri Nanda and Saurabh Sinha. Accurate interprocedural null-dereference analysis for java. In 2009 IEEE 31st International Conference on Software Engineering, pages 133-143. IEEE, 2009.
Abel Nieto, Marianna Rapoport, Gregor Richards, and Ondrej Lhoták. Blame for null. In Proc. ECOOP, pages 3:1-3:28, 2020.
Abel Nieto, Yaoyu Zhao, Ondřej Lhoták, Angela Chang, and Justin Pu. Scala with Explicit Nulls. In Proc. ECOOP, volume 166, pages 25:1-25:26, 2020.
Haidar Osman, Manuel Leuenberger, Mircea Lungu, and Oscar Nierstrasz. Tracking null checks in open-source java systems. In 2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering (SANER), volume 1, pages 304-313. IEEE, 2016.
Carlos Pacheco and Michael D Ernst. Randoop: feedback-directed random testing for java. In Proc. OOPSLA'07, pages 815-816. ACM, 2007.
Matthew M Papi, Mahmood Ali, Telmo Luis Correa Jr, Jeff H Perkins, and Michael D Ernst. Practical pluggable types for java. In Proc. ISSTA'08, pages 201-212. ACM, 2008.
D. J. Pearce. Sound and complete flow typing with unions, intersections and negations. In Proc. VMCAI, pages 335-354, 2013.
P. Pominville, F. Qian, R. Vallée-Rai, L. Hendren, and C. Verbrugge. A framework for optimizing Java using attributes. In Proc. CC, pages 334-554, 2001.
Xin Qi and Andrew C. Myers. Masked types for sound object initialization. In Proc. POPL, pages 53-65. ACM Press, 2009.
Baber Rehman, Xuejing Huang, Ningning Xie, and Bruno C. d. S. Oliveira. Union Types with Disjoint Switches. In Proc. ECOOP, volume 222, pages 25:1-25:31, 2022.
Caitlin Sadowski, Edward Aftandilian, Alex Eagle, Liam Miller-Cushon, and Ciera Jaspan. Lessons from building static analysis tools at google. Communications of the ACM, 61(4):58-66, 2018.
Marco Servetto, Julian Mackay, Alex Potanin, and James Noble. The billion-dollar fix - safe modular circular initialisation with placeholders and placeholder types. In Proc. ECOOP, volume 7920, pages 205-229. Springer-Verlag, 2013.
Jeremy Siek and Walid Taha. Gradual typing for objects. In Proc. ECOOP, pages 151-175. Springer-Verlag, 2007.
Nicholas Smith, Danny Van Bruggen, and Federico Tomassetti. Javaparser: visited. Leanpub, oct. de, 2017.
Fausto Spoto. Nullness analysis in boolean form. In Proc. SEFM, pages 21-30. IEEE, 2008.
Fausto Spoto and Michael D. Ernst. Inference of field initialization. In Proc. ICSE, pages 231-240. ACM Press, 2011.
Li Sui, Jens Dietrich, Amjed Tahir, and George Fourtounis. On the recall of static call graph construction in practice. In Proc. ICSE'20, pages 1049-1060. IEEE, 2020.
Alexander J. Summers and Peter Mueller. Freedom before commitment: A lightweight type system for object initialisation. In Proc. OOPSLA, pages 1013-1032, 2011.
Timothy A. V. Teatro, J. Mikael Eklund, and Ruth Milman. Maybe and either monads in plain C++ 17. In Proc. CCECE, pages 1-4. IEEE, 2018.
Brian Vermeer. Spring dominates the java ecosystem with 60% using it for their main applications, 2020. URL: https://snyk.io/blog/spring-dominates-the-java-ecosystem-with-60-using-it-for-their-main-applications/.
Ying Wang, Ming Wen, Zhenwei Liu, Rongxin Wu, Rui Wang, Bo Yang, Hai Yu, Zhiliang Zhu, and Shing-Chi Cheung. Do the dependency conflicts in my project matter? In Proc. ESEC/FSE'18, pages 319-330, 2018.
Rafael Winterhalter. Byte Buddy - A code generation and manipulation library for creating and modifying Java classes during the runtime, 2014. URL: https://bytebuddy.net/.

On Leveraging Tests to Infer Nullable Annotations

Authors Jens Dietrich , David J. Pearce , Mahin Chandramohan

File

Document Identifiers

Author Details

Acknowledgements

Cite AsGet BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

References

On Leveraging Tests to Infer Nullable Annotations

Authors Jens Dietrich , David J. Pearce , Mahin Chandramohan

File

Document Identifiers

Author Details

Funding

Acknowledgements

Cite AsGet BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

Supplementary Materials

References