Challenge_Data

所属分类:文件格式
开发工具:matlab
文件大小:5706KB
下载次数:72
上传日期:2017-03-07 09:46:47
上 传 者根哥
说明:  故障预测与健康管理数据PHM,数据格式为txt,希望对有需要的同志有帮助。英文描述建议大家看看,说不定会有帮助。
(5. PHM08 Challenge data set the data challenge competition held at the 1st international conference on Prognostics and Health Management (PHM08) is being made publicly available. The dataset is similar to the one posted above (see Turbofan engine degradation simulation data set) except the true RUL values are not revealed. Users are expected to develop their algorithms using training and test sets provided in the package. The set is in text format and has been zipped including a readme file. Please cite: A. Saxena and K. Goebel (2008). PHM08 Challenge Data Set )

文件列表:
Challenge Data Description 2016.pdf (326053, 2016-07-19)
final_test.txt (9268393, 2008-05-20)
test.txt (5004220, 2010-05-03)
train.txt (7714776, 2010-05-03)

updated: May 04, 2010 PHM08 Prognostics Data Challenge Dataset Description This dataset was used for the prognostics challenge competition at the International Conference on Prognostics and Health Management (PHM08). The challenge is still open for the researchers to develop and compare their efforts against the winners of the challenge in 2008. References to the three winner papers are provided below. [1] Heimes, F.O., “Recurrent neural networks for remaining useful life estimation”, in the Proceedings of the 1st International Conference on Prognostics and Health Management (PHM08), Denver CO, Oct 2008. [2] Tianyi Wang, Jianbo Yu, Siegel, D., Lee, J., “A similarity-based prognostics approach for Remaining Useful Life estimation of engineered systems”, in the Proceedings of the 1st International Conference on Prognostics and Health Management (PHM08), Denver CO, Oct 2008. [3] Peel, L., “Recurrent neural networks for remaining useful life estimation”, in the Proceedings of the 1st International Conference on Prognostics and Health Management (PHM08), Denver CO, Oct 2008. Experimental Scenario Data sets consist of multiple multivariate time series. Each data set is further divided into training and test subsets. Each time series is from a different engine – i.e., the data can be considered to be from a fleet of engines of the same type. Each engine starts with different degrees of initial wear and manufacturing variation which is unknown to the user. This wear and variation is considered normal, i.e., it is not considered a fault condition. There are three operational settings that have a substantial effect on engine performance. These settings are also included in the data. The data are contaminated with sensor noise. The engine is operating normally at the start of each time series, and starts to degrade at some point during the series. In the training set, the degradation grows in magnitude until a predefined threshold is reached beyond which it is not preferable to operate the engine. In the test set, the time series ends some time prior to complete degradation. The objective of the competition is to predict the number of remaining operational cycles before in the test set, i.e., the number of operational cycles after the last cycle that the engine will continue to operate properly. Usage The data are provided as a zip-compressed text file with 26 columns of numbers, separated by spaces. Each row is a snapshot of data taken during a single operational cycle; each column is a different variable. The columns correspond to: 1) unit number 2) time, in cycles 3) operational setting 1 4) operational setting 2 5) operational setting 3 6) sensor measurement 1 7) sensor measurement 2 ... 26) sensor measurement 26 Users are expected to train their algorithms using data in the file named train.txt. They must then evaluate the RUL prediction performance on data provided in file test.txt. Associated true RUL values are not being revealed just like in the competition. Very soon the users will be able to use a web application to upload their results and get an aggregate score feedback. Please check back in June 2010 to get more details on the web application for automated scoring. Until then they can get a feedback by emailing their results through simple text files to any of the email addresses provided below. A set of representative top 20 scores obtained during the competition are included here to provide a reference to everyone. No. Score 1 436.841 2 512.426 3 737.769 4 809.757 5 908.588 6 975.586 7 1,049.57 8 1,051.88 9 1,075.16 10 1,083.91 11 1,127.95 12 1,139.83 13 1,219.61 14 1,263.02 15 1,557.61 16 1,808.75 17 1,966.38 18 2,065.47 19 2,399.88 20 2,430.42 Evaluation The final score is a weighted sum of RUL errors. The scoring function is an asymmetric function that penalizes late predictions more than the early predictions. (Please see attached documentation for details) Once algorithms are trained to satisfaction, users can apply them to the final test dataset contained in the file named final_test.txt. Users should send the vector of RULs for the final test set to the PHM Society for evaluation. A score will be mailed back soon. Researchers are encouraged to publish their results regardless of the absolute performance if they believe there is novelty in their algorithm. The intent is to develop innovative approaches for prognostics. Note: Any team or individual is allowed to submit their results on the final test set ONLY ONCE. Data Set: train.txt, test.txt Train trjectories: 218 Test trajectories: 218 final_test trajectories: 435 Contacts Abhinav Saxena – abhinav.saxena@nasa.gov, 650-604-3208 Kai Goebel – kai.goebel@nasa.gov References A. Saxena, K. Goebel, D. Simon, and N. Eklund, “Damage Propagation Modeling for Aircraft Engine Run-to-Failure Simulation”, in the Proceedings of the 1st International Conference on Prognostics and Health Management (PHM08), Denver CO, Oct 2008.

近期下载者

相关文件


收藏者