Beethoven: A library for making Java bytecode instrumentation easier
The point of this library is to offer a simple, annotation-driven way to instrument classes. What this means is that a user of this library should never have to see so much as a byte. Everything should be abstracted to the level of classes, methods and fields.
Version 2
Implemented
-
Per-instruction method frame state analysis tools
-
Method insertion
-
Method replacement
-
In-place method appending
- Accepting return value as argument
- Overwriting return value
-
In-place method prepending
-
Automatic injection of INVOKEDYNAMIC lambda targets
-
Method injection priority
-
Inject superclasses
-
Inject interfaces
-
Inject fields
TODO
-
Better tests
-
Execution path optimization (e.g. remove unnecessary GOTOs)
-
Implement exceptions
-
Implement try-catch
How do I use this?
Beethoven is designed to be relatively accessible, so you should never have to see raw JVM instructions. Let's use an example:
Given the following class:
public final class Foo {
private final int myNumber;
public Foo(int myNumber) {
this.myNumber = myNumber;
}
public int addMyNumber(int addTo) {
return addTo + this.myNumber;
}
}
Say we would like to modify the behaviour of addMyNumber
to print the value of addTo
and return the value as defined
above, except divided by two. Under normal conditions where the Foo
source code could be modified, this might looks
as follows:
public int addMyNumber(int addTo) {
System.out.println(addTo);
return (addTo + this.myNumber) / 2;
}
But if we for some reason cannot change the source code of Foo
, we can instead use Beethoven to modify the bytecode at
runtime by defining a class which specifies how we would like to modify the behaviour. In our example, this would look
as follows:
import dev.w1zzrd.asm.InPlaceInjection;
import dev.w1zzrd.asm.Inject;
public class TweakFoo {
@Inject(value = InPlaceInjection.AFTER, target = "addMyNumber(I)I", acceptOriginalReturn = true)
public int addMyNumber(int addTo, int ret) {
System.out.println(addTo);
return ret / 2;
}
}
We specify using the @Inject
annotation that we would like to inject the annotated method into the targeted class (
we'll get to how we target a class in a sec), that we would like to append the code AFTER
the existing code, that we
are targeting a method named addMyNumber
which accepts an int (specified by (I)
) and returns an int (specified by
the final I
), as well as that we would like to accept the original method's return value as an argument to the tweak
method (named ret
in the example code above). If we hadn't cared about what the original method returned, we could
simply ignore the acceptOriginalReturn
parameter and omit the int ret
argument from the method.
Type signatures
Method signatures, as used in the example code above (the target
value in the @Inject
annotation), are specified in
the following format: methodName(argumentTypes)returnType
Types follow the JNI type naming standard:
Type | Name |
---|---|
Boolean | Z |
Byte | B |
Short | S |
Int | I |
Long | J |
Float | F |
Double | D |
Object | Lpath ; |
Array | [Type |
For objects, the type is specified as the full path of the type, where all .
are replaced with /
. So, for example,
a Java string would be written as Ljava/lang/String;
. Generics are ignored when describing an objects type.
Array types are defined as a [
followed by any JNI type (including another array type). Nested arrays are written as
consecutive [
. For example, a three-dimensional Throwable
array (Throwable[][][]
) would be written as
[[[Ljava/lang/Throwable;
.
Finally, this means that, for example, the method
public Class<?> getClass(
String name,
int index,
float crashProbability,
double[] numbers,
String[][] crashMap
) {
// ...
}
would have the target
value getClass(Ljava/lang/String;IF[D[[Ljava/lang/String;)Ljava/lang/Class;
Targeting classes
To target a class, simply create an instance of the dev.w1zzrd.asm.Combine
class targeting it. After that, you feed it
annotated MethodNodes. For example, using the example code described earlier, we could do as follows:
import dev.w1zzrd.asm.Combine;
import dev.w1zzrd.asm.GraftSource;
import dev.w1zzrd.asm.Loader;
public class Run {
public static void main(String[] args) {
// This is the class we would like to inject code into
Combine target = new Combine(Loader.getClassNode("Foo"));
// This is the class we want to inject code from
GraftSource source = new GraftSource(Loader.getClassNode("TweakFoo"));
// Inject all annotated methods
for (MethodNode toInject : source.getInjectMethods()) {
// Inject an annotated method into the targeted class
target.inject(toInject, source);
}
// Now we load the tweaked Foo class into the JVM
target.compile();
// Done! To see that it has worked, we can try running the tweaked method:
int result = new Foo(5).addMyNumber(15); // Prints "5"
System.out.println(result); // Prints "10"
}
}
To inject code from multiple tweak classes, simply define more GraftSource
instances and inject them into the target.
Currently, Beethoven supports injecting code before and after the original method, as well as overwriting existing methods entirely, as well as adding new methods.
In fact, there is a drastically simpler way to inject code into a target. Say we know that the above classes are all available to the default ClassLoader's classpath. In this case, we can simply do as follows:
import dev.w1zzrd.asm.Injector;
public class Run {
public static void main(String[] args) {
// Locates all necessary tweaks and injects them into Foo
Injector.injectAll("Foo").compile();
// Done! To see that it has worked, we can try running the tweaked method:
int result = new Foo(5).addMyNumber(15); // Prints "5"
System.out.println(result); // Prints "10"
}
}
The only caveat is that classes injected this way must have an @InjectClass
annotation declaring the targeted class.
In our example, this would mean that the TweakFoo
class would look as follows:
import dev.w1zzrd.asm.InPlaceInjection;
import dev.w1zzrd.asm.Inject;
import dev.w1zzrd.asm.InjectClass;
// This marks the class as targeting Foo
@InjectClass(Foo.class)
public class TweakFoo {
@Inject(value = InPlaceInjection.AFTER, target = "addMyNumber(I)I", acceptOriginalReturn = true)
public int addMyNumber(int addTo, int ret) {
System.out.println(addTo);
return ret / 2;
}
}
Additionally, method resolution is relatively intelligent, so one can omit the target
parameter of the @Inject
annotation in cases where the target is unambiguous. As long as the tweak method has the same name as the targeted
method, the target should never be ambiguous.