What is a
linguistic
antipattern
?

Have you ever had a gnarly bug, or even just a frustrating coding session, that could be ultimately traced back to something that just didn't do what you thought it did based on the name? We certainly have. These can be caused by two people who interpret a word differently, or one person making too many assumptions by themselves. But more often than not, they're caused by a problem where the name predictably leads people to believe a function does something it simply doesn't. The ways in which this happens are linqusitic antipatterns. As defined by the original researchers:

Linguistic Antipatterns (LAs) in software systems are recurring poor practices in the naming, documentation, and choice of identifiers in the implementation of an entity, thus possibly impairing program understanding.

This website is dedicated to cataloguing types of linguistic antipatterns and discussing the deeper reasons they cause problems and how to fix them.

Multiple methods with confusably-similar names and effects

Description

A class or namespace has two functions with similar names. A programmer who wants the functionality of the first function may mistakenly call the second. If the effects are similar, casual testing may mislead the programmer into thinking they had called the correct function, even if the two functions differ in some important way.

Examples

The classic example of this is the confusion between Thread.start() and Thread.run() in Java. Python's standard thread package was based on Java's, and also has this problem. For example, consider these snippets:

Thread myThread = new Thread(() => doSomethingExpensive());
myThread.run();

class MyThread(thread):
  def run(self):
    doSomethingExpensive()
myThread = MyThread()
myThread.run()

In both of these, the programmer intended to call myThread.start(), which creates a new background thread and then runs doSomethingExpensive on that background thread. Instead, they have called myThread.run(), which runs doSomethingExpensive() in the current thread. This happened because start and run are confusingly similar names. Further, the application will appear to work, but it will be slower because something which should be done in the background is blocking important behavior. Because of that, this bug can go undetected in a codebase for a long time.

For another example, consider the battle between the various load functions in PyYAML. Originally, in versions 3.12 and below, PyYAML had two functions called load and safe_load, where safe_load had a safe behavior while load could execute arbitrary code. In PyYAML 4.1, they renamed the old load to danger_load. They removed the old safe_load function and created a new load function, which is also unsafe. This caused significant controversy. The story is chronicled in this blog post.

Both pairs of functions raised issues of confusability. Users of a function called load will see YAML parse correctly and believe they have implemented this functionality correctly, but they have in fact introduced an arbitrary-code-execution vulnerability into their software. And when there is a function called danger_load, a programmer may be tempted to think that the load function implements the safer options, but in this library load was in some ways more dangerous than danger_load.

Discussion and Lessons

Having confusable methods is one way to violate the Representable/Valid Principle, that there should be a 1-1 mapping between representable and valid states of the program. For the thread example, the state where doSomethingExpensive() has been run in the main thread is an error state which should never occur. It should not be possible to call the Thread.run() method. In PyYAML 3.12, calling load enters a state where arbitrary code may have been executed, which is invalid; it should therefore not be possible for a programmer to inadvertently call it. In comparison, after the changes of version 4.1, an ordinary programmer would only call load and never danger_load, meaning the result of having called danger_load is not representable in any program surviving minimal code review. However, this only makes uses of the load function more likely to pass code review, even though it is also insecure.

What is a
linguistic
antipattern
?

Origin

How does this website differ from the original
Linguistic
Antipatterns
papers ?

Who are we !?

Linguistic
Antipatterns

Multiple methods with confusably-similar names and effects

Description

Examples

Discussion and Lessons

What is a linguisticantipattern ?

Origin

How does this website differ from the original LinguisticAntipatterns papers ?

Who are we !?

LinguisticAntipatterns

Multiple methods with confusably-similar names and effects

Description

Examples

Discussion and Lessons

What is a
linguistic
antipattern
?

How does this website differ from the original
Linguistic
Antipatterns
papers ?

Linguistic
Antipatterns