求高效的基础运算库~~~

yxyxyx1700 · 发表于 2012-2-8 20:06:51

马上注册，结交更多好友，享用更多功能，让你轻松玩转社区。

您需要登录才可以下载或查看，没有账号？欢迎注册

×

本人新手，请问VC++下有没有比math.h库更加高效的基础运算库，或者对pow、log、sin、cos、sqrt等函数的优化方法~~(只考虑一般情况即可，不需要考虑大数及高精度)，请各位高手赐教！！！万分感谢！！！

liangbch · 发表于 2012-2-9 13:46:16

这些数学函数的实现是通过CPU/FPU指令实现的，已经非常快了，应该能够满足绝大多数需要,而楼主似乎对其性能不满意却又提到不考虑高精度和大数，这就让人搞不懂了。个人认为就double精度而言，这些函数很难有实质的改进了，任何算法的实现都应该快不过CPU指令。

sielw · 发表于 2012-2-13 09:53:32

弱弱问一句，哪里能看到math.c？

liangbch · 发表于 2012-2-13 13:35:19

2# liangbch 刚才写了一个程序，对VC6中的数学函数的性能进行了测试。结果如下，我的CPU是Intel E8500, CPU频率是3.16GH sqrt() 花费时间约4.3 CPU 周期 sin() 花费时间约13 CPU周期 log() 花费时间约10 CPU周期

liangbch · 发表于 2012-2-13 13:38:33

这里贴出所有的代码，注意，需要修改 CPU_FREQ的定义，使得它的值和你的CPU一致。编译环境为VC6.0

#include <stdio.h>
#include <stdlib.h>
#include <math.h>
#include <windows.h>
#define PI 3.1415926535897932384626433832795
#define DATA_COUNT 1000
#define CPU_FREQ (3.16e9) //CPU frequency, need always change this definition for match your CPU
typedef double ( *lpfn_math )(double);
unsigned __int64 s_u64Frequency = 1;
unsigned __int64 s_u64Start, s_u64End;
double data[DATA_COUNT];
double result[DATA_COUNT];
void testFun( const lpfn_math pFun,
const char* lpszFunName,
const int testTimes )
{
int i,j;
double t;
double ns;
QueryPerformanceFrequency((LARGE_INTEGER *)&s_u64Frequency );
printf( "\nTest function: %s() %u times\n",
lpszFunName, testTimes*DATA_COUNT );
QueryPerformanceCounter((LARGE_INTEGER *)&s_u64Start );
for (i=0;i<testTimes;i++)
{
for (j=0;j<DATA_COUNT;j++)
{
result[j]=pFun(data[j]);
}
}
QueryPerformanceCounter((LARGE_INTEGER *)&s_u64End );
t = (double)(( s_u64End - s_u64Start ) / (double) s_u64Frequency );
printf( "Elapsed time: %d ms\n", (int)(t*1000));
ns=(t * 1e9) /(double)(DATA_COUNT *testTimes); //calc the time of one sqrt function call in nanosecond
printf("%s() spend %f ns\n", lpszFunName,ns );
printf("%s() spend %f cpu cycle\n", lpszFunName, (ns*1e9)/CPU_FREQ);
}
void test_sqrt()
{
int c,n1,n2;
c=0;
while (c<DATA_COUNT)
{
n1=rand();
n2=rand();
if (n1!=0 && n2!=0)
{
data[c]=double(n1) / double(n2);
c++;
}
}
testFun( sqrt,"sqrt",1000);
}
void test_sine()
{
int c,n;
c=0;
while (c<DATA_COUNT)
{
n=(rand() % 1000)+1;
data[c]=double(n) / 1000.00 * PI * 0.5; //data[c] is i/1000*PI/2, i=1..1000
c++;
}
testFun(sin,"sin",1000);
}
void test_log()
{
int c,n1,n2;
c=0;
while (c<DATA_COUNT)
{
n1=(rand() % 1000)+1;
n2=rand() % 1000;
data[c]=double(n1) + double(n2) * 0.001; //data[ i ] always between 1.000 to 1000.999
c++;
}
testFun( log,"log",1000);
}
int main(int argc, char* argv[])
{
test_sqrt();
test_sine();
test_log();
return 0;
}

复制代码

gxqcn · 发表于 2012-2-13 14:19:36

2# liangbch 刚才写了一个程序，对VC6中的数学函数的性能进行了测试。结果如下，我的CPU是Intel E8500, CPU频率是3.16GH sqrt() 花费时间约4.3 CPU 周期 sin() 花费时间约13 CPU周期 log() 花费时间约10 CPU周 ... liangbch 发表于 2012-2-13 13:35

感觉比我想象的快多了，我原本以为它们都得耗上百余个指令周期呢。

liangbch · 发表于 2012-2-13 14:32:55

很不幸，VC6中没有数学函数的源代码，即使在安装了VC是选择 VC++ Runtime Libraries ->CRT Source Code 也是如此。我能够在调试时进入其他库函数的源文件，如memset,但是不能进入数学函数所在的源文件。搜索CRT源码目录,仅仅找到2个导入库的定义文件，见下。 C:\Program Files\Microsoft Visual Studio\VC98\CRT\SRC\Intel\_SAMPLD_.DEF C:\Program Files\Microsoft Visual Studio\VC98\CRT\SRC\Intel\_SAMPLE_.DEF

G-Spider · 发表于 2012-2-13 16:43:45

查看了一下，msvcrt.dll 和相关的LIBC.LIB 库，以上函数还是对fsqrt , fsin ,fcos 的封装。不知是否还有用到其它库函数。

_sqrt:
var_4 = word ptr -4
arg_0 = dword ptr 4
arg_4 = dword ptr 8
lea edx, [esp+arg_0]
call __fload_withFB
loc_77C1D52D:
push edx
fstcw [esp+4+var_4]
mov eax, [esp+4+arg_4]
jz short loc_77C1D589
cmp [esp+4+var_4], 27Fh
jz short loc_77C1D545
call __load_CW
loc_77C1D545:
test eax, 80000000h
jnz short loc_77C1D56B
fsqrt
....
;=======================================
__fload_withFB:
var_A = tbyte ptr -0Ah
mov eax, [edx+4]
and eax, 7FF00000h
cmp eax, 7FF00000h
jz short loc_77C20C37
fld qword ptr [edx]
retn
; --------------------------------
loc_77C20C37:
mov eax, [edx+4]
sub esp, 0Ah
or eax, 7FFF0000h
mov dword ptr [esp+0Ah+var_A+6], eax
mov eax, [edx+4]
mov ecx, [edx]
shld eax, ecx, 0Bh
shl ecx, 0Bh
mov [esp+4], eax
mov dword ptr [esp+0Ah+var_A], ecx
fld [esp+0Ah+var_A]
add esp, 0Ah
test eax, 0
mov eax, [edx+4]
retn
__fload_withFB endp

复制代码

liangbch · 发表于 2012-2-13 22:33:41

关于求浮点数（单精度和双精度）的平方根，下面的文章值得一看，它利用IEEE754浮点数的格式，直接求出一个比较好的初值，然后使用牛顿迭代求精。 http://ilab.usc.edu/wiki/index.php/Fast_Square_Root

wayne · 发表于 2012-2-15 11:37:45

在 Android 源码目录的 bionic/libm/src路径下找到平方根的实现e_sqrt.c：感兴趣的可以看看：

e_sqrt.c (14.1 KB, 下载次数: 1)

账号		自动登录	找回密码
密码			欢迎注册

[求助] 求高效的基础运算库~~~

马上注册，结交更多好友，享用更多功能，让你轻松玩转社区。