Calling a function with an “unknown” name in C ++. Part 1 - cdecl

Formulation of the problem

What did I mean when I wrote the “unknown” function name? This means that the function name, its parameters and, finally, the calling convention, become known only during the execution of the program. Let's take her call! =)

Now we will try to call the function according to the cdecl standard.
Excerpt from Wikipedia:

CD system for the x86 architecture. In cdecl, right-to-left order. The EAX register (except for the floating point values) has been returned. Registers EAX, ECX, and EDX are available for use in the function.

In general, parameters are passed through the stack in the reverse order, the resulting value will be in EAX except for floating-point numbers — they will be in the x87 pseudo-stack.

Make a plan of work:
1) Generate a buffer in memory that can be unchanged, word for word (4 bytes) pushed onto the stack.
2) Find out the address of the function that will be called
3) Put a word buffer on the stack.
4) Call the function
5) pull the result

Go!

')
What we have:
1) char * sName - here is the name of the function
2) int N - the number of parameters
3) enum CParamType {cptNone = 0, cptPointer, cptInt, cptDouble} - possible data types - for now let's confine ourselves to these
4) CParamType Params [] - list of parameter types
5) void * ParamList [] - in fact, pointers to variables with parameters
6) CParamType RetType - result data type
7) void * Ret - a pointer to the memory where you want to throw the result
8) enum CCallConvention {cccNone = 0, cccCDecl, cccStdCall, cccFastCall} - types of calling conventions
9) CCallConvention conv - calling convention. First we will call only cdecl functions

This is a necessary and sufficient list of ads that we need to call.
C / C ++ does not have the means to perform this operation, so you have to turn to assembler.

1. Create a buffer

First, let's count the number of words. Everything is simple - void *, int - 4 bytes - 1 word, double - 8 bytes - 2 words.

Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }
Copy Source | Copy HTML int WordCount= 0 ; for ( int i= 0 ,i<N,i++) { switch (Params[i]) { case cptPointer: case cptInt: WordCount++; break ; case cptDouble: WordCount+= 2 ; break ; } }

Counted. Memory allocation:
void* Buffer = new char[4*WordCount];

Fill the buffer: void *, int - put it unchanged, and swap words in double.

Copy Source | Copy HTML
int offset = 0 ;
double x;
for ( int i = 0 , i <n, i ++)
{
switch (Params [i])
{
case cptPointer:
case cptInt:
* ( int *) (buf + offset) = * (( int *) (ParamList [i]));
offset + = 4 ;
break ;
case cptDouble:
x = * (( double *) (((DTMain *) (v-> T)) -> pData));
memcpy (buf + offset + 4 , & x, 4 );
memcpy (buf + offset, ( char *) & x + 4 , 4 );
offset + = 8 ;
break ;
}
}

I think there is nothing to comment on. offset - offset buffer.

2. We learn the function address

It's all quite simple.
void* addr = dlsym(NULL,sName);
Where the first parameter is the library descriptor. NULL to search in current context.
We connect dlfcn.h and do not forget to add -ldl to the linking parameters.

3. We put the buffer on the stack by words

Fuh. The most interesting.
To work with the stack, we naturally need an assembler. I use the gnu compiler, so the assembler with the AT & T syntax does not kick my feet, I don’t really like it myself, but I don’t have to choose.

Copy Source | Copy HTML
asm ( "\ movl $ 0, %% eax; \ movl% 2, %% ebx; \ movl% 3, %% ecx; \ l1: cmpl% % ecx, %% eax; \ je l2; \ pushl (%% ebx, %% eax, 4); \ addl $ 1, %% eax; \ jmp l1; "
: "= r" (b)
: "r" (addr), "r" ( Buffer ), "g" (WordCount)
: "% eax"
);

We do a cycle: until ecx (WordCount) becomes 0, put the word on the stack and reduce ecx.

4. Call the function

Do
l2: call *%1;
after filling the stack. % 1 is a pointer to the function (addr).

5. Return the result

There are 2 options: whole result or fractional. According to the agreement, by default the result will be in% eax, but if with a floating point, it will be x87 in the vsevdo-stack.
1) Whole result
movl %%eax, %0;
where% 0 is the result variable.

2) Floating point option
The idea here is to remove the answer from the ST (0). So far I have not managed to do this. I would like to see possible solutions in the comments. Thank you in advance.

Well that's all! The task was really not trivial. I hope someone will need this post.

PS You need all this to write an interpreter.
_________
The text was prepared in Habra Editor

UPD: Highlight the source

Source: https://habr.com/ru/post/78886/

All Articles