SynDataGenerator-Cat

所属分类:超算/并行计算
开发工具:Visual C++
文件大小:878KB
下载次数:5
上传日期:2010-11-07 18:54:48
上 传 者sunda
说明:  Mining Favorable Facets

文件列表:
Debug (0, 2006-08-27)
Debug\generate.exe (204882, 2006-06-04)
Debug\generate.ilk (237968, 2006-06-04)
Debug\generate.obj (23334, 2006-06-04)
Debug\generate.pch (237044, 2006-06-04)
Debug\generate.pdb (484352, 2006-06-04)
Debug\vc60.idb (41984, 2007-05-25)
Debug\vc60.pdb (53248, 2006-06-04)
generate.cpp (10987, 2006-06-04)
generate.dsp (3425, 2006-06-04)
generate.dsw (541, 2006-06-04)
generate.ncb (50176, 2007-05-25)
generate.opt (49664, 2007-05-25)
generate.plg (1160, 2006-06-04)
test.txt (1580556, 2007-05-25)

Readme for Synthetic Dataset Generator of Data Containing Nominal Attributes and Numeric Attributes =================================================================================================== Running Environment: Microsoft Visual C++ How to run ---------- Open "generate.dsw" Press "Execute" You can run the program successfully. You can also change the parameters of the data set generator. We have a command-line syntax for the generator as follows. Syntax: generate where Distribution = E(qually) | C(orrelated) | A(nti-correlated) You can modify the above parameters by the following methods. 1. choose "Project->Settings" menu 2. choose "Debug" tag 3. change the values in "Program arguments" textfield. e.g. current values are "3 1 5 1 A 100000 test.txt", which means that we will generate the data set named "test.txt" containing 100000 tuples with 3 numeric attributes and 1 nominal/categorial attribute with 5 values where the Zipfian distribution parameter is equal to 1 and the data distribution is anti-correlated.

近期下载者

相关文件


收藏者