SynDataGenerator-Cat
所属分类:超算/并行计算
开发工具:Visual C++
文件大小:878KB
下载次数:5
上传日期:2010-11-07 18:54:48
上 传 者:
sunda
说明: Mining Favorable Facets
文件列表:
Debug (0, 2006-08-27)
Debug\generate.exe (204882, 2006-06-04)
Debug\generate.ilk (237968, 2006-06-04)
Debug\generate.obj (23334, 2006-06-04)
Debug\generate.pch (237044, 2006-06-04)
Debug\generate.pdb (484352, 2006-06-04)
Debug\vc60.idb (41984, 2007-05-25)
Debug\vc60.pdb (53248, 2006-06-04)
generate.cpp (10987, 2006-06-04)
generate.dsp (3425, 2006-06-04)
generate.dsw (541, 2006-06-04)
generate.ncb (50176, 2007-05-25)
generate.opt (49664, 2007-05-25)
generate.plg (1160, 2006-06-04)
test.txt (1580556, 2007-05-25)
Readme for Synthetic Dataset Generator of Data Containing Nominal Attributes and Numeric Attributes
===================================================================================================
Running Environment: Microsoft Visual C++
How to run
----------
Open "generate.dsw"
Press "Execute"
You can run the program successfully.
You can also change the parameters of the data set generator.
We have a command-line syntax for the generator as follows.
Syntax: generate
where Distribution = E(qually) | C(orrelated) | A(nti-correlated)
You can modify the above parameters by the following methods.
1. choose "Project->Settings" menu
2. choose "Debug" tag
3. change the values in "Program arguments" textfield.
e.g. current values are
"3 1 5 1 A 100000 test.txt", which means
that we will generate the data set named "test.txt" containing 100000 tuples with 3 numeric attributes
and 1 nominal/categorial attribute with 5 values where the Zipfian distribution parameter
is equal to 1 and the data distribution is anti-correlated.
近期下载者:
相关文件:
收藏者: