学科分类
/ 3
54 个结果
  • 简介:Asemi-structureddocumenthasmorestructuredinformationcomparedtoanordinarydocument,andtherelationamongsemi-structureddocumentscanbefullyutilized.Inordertotakeadvantageofthestructureandlinkinformationinasemi-structureddocumentforbettermining,astructuredlinkvectormodel(SLVM)ispresentedinthispaper,whereavectorrepresentsadocument,andvectors'elementsaredeterminedbyterms,documentstructureandneighboringdocuments.TextminingbasedonSLVMisdescribedintheprocedureofK-meansforbriefnessandclarity:calculatingdocumentsimilarityandcalculatingclustercenter.TheclusteringbasedonSLVMperformssignificantlybetterthanthatbasedonaconventionalvectorspacemodelintheexperiments,anditsFvalueincreasesfrom0.65-0.73to0.82-0.86.

  • 标签: HTML语言 XML语言 半结构文件模型 版本开采 结构信息
  • 简介:Sequentialpatternminingisanimportantdataminingproblemwithbroadapplications.However,itisalsoachallengingproblemsincetheminingmayhavetogenerateorexamineacombinatoriallyexplosivenumberofintermediatesubsequences.Recentstudieshavedevelopedtwomajorclassesofsequentialpatternminingmethods:(1)acandidategeneration-and-testapproach,representedby(i)GSP,ahorizontalformat-basedsequentialpatternminingmethod,and(ii)SPADE,averticalformat-basedmethod;and(2)apattern-growthmethod,representedbyPrefixSpananditsfurtherextensions,suchasgSpanforminingstructuredpatterns.Inthisstudy,weperformasystematicintroductionandpresentationofthepattern-growthmethodologyandstudyitsprinciplesandextensions.Wefirstintroducetwointerestingpattern-growthalgorithms,FreeSpanandPrefixSpan,forefficientsequentialpatternmining.ThenweintroducegSpanforminingstructuredpatternsusingthesamemethodology.Theirrelativeperformanceinlargedatabasesispresentedandanalyzed.Severalextensionsofthesemethodsarealsodiscussedinthepaper,includingminingmulti-level,multi-dimensionalpatternsandminingconstraint-basedpatterns.

  • 标签: 数据挖掘 顺序方向挖掘 可量测性 性能分析
  • 简介:Geological Prospecting and Mining in TibetGeologicalProspectingandMininginTibet¥DONDUINAMGYISeptember1,1995markedthe30thanniv...

  • 标签:
  • 简介:HuainanCoalMiningBureau,aspeciallargecoalenterpriseandastatekeycoalproductionbase,issituatedincentral-northpartofAnhuiProvince.Thearea,well-knownas"thecoalcapitalofEastChina",aboundsincoalresources,andtheprovencoalreserveisestimatedtobeupto70billiontonswithcompletevarietiesandsuperiorquality.Bytheyearof2010,theannualproductioncapacitywillreach30milliontons.Thereareexcellentinvestmentenvironmentandconvenientcommunicationandtransportation

  • 标签:
  • 简介:语篇语言学与翻译研究,进而讨论翻译研究的语篇语言学方法以及语篇翻译研究的范围、研究重点以及研究方法,即翻译研究的语篇语言学方法

  • 标签:
  • 简介:Thispaperexaminestheapproachusedbymiddleschoolteacherstoteachingtextsandarguesagainstthetraditionalpracticeofexploitingtextsjusttoteachgrammarandvocabulary.Amorebalancedapproachispresented,involvingallfour-languageskills,concentratingonoutputaswellasinput,andtrainingstu-dentstoextractrelevantinformationfromtexts,makingthemmoreefficientreaders.

  • 标签: 群口
  • 简介:Thepaperdescribesatexture-basedfasttextlocationschemewhichoperatesdirectlyintheDiscreteWaveletTransform(DWT)domain.Bythedistinguishingtexturecharacteristicsencodedinwavelettransformdomain,thetextisfastdetectedfromcomplexbackgroundimagesstoredinthecompressedformatsuchasJPEG2000withoutfulldecompress.Comparedwithsometraditionalcharacterlocationmethods,theproposedschemehastheadvantagesoflowcomputationalcost,robusttosizeandfontofcharactersandhighaccuracy.Preliminaryexperimentalresultsshowthattheproposedschemeisefficientandeffective.

  • 标签: 离散子波变换 语义内容 纹理分析 图像指数
  • 简介:Withmassiveamountsofdatastoredindatabases,mininginformationandknowledgeindatabaseshasbecomeanimportantissueinrecentresearch.Researchersinmanydifferentfieldshaveshowngreatinterestindateminingandknowledgediscoveryindatabases.Severalemergingapplicationsininformationprovidingservices,suchasdatawarehousingandon-lineservicesovertheInternet,alsocallforvariousdataminingandknowledgediscoverytchniquestounderstandusedbehaviorbetter,toimprovetheserviceprovided,andtoincreasethebusinessopportunities.Inresponsetosuchademand,thisarticleistoprovideacomprehensivesurveyonthedataminingandknowledgediscorverytechniquesdevelopedrecently,andintroducesomerealapplicationsystemsaswell.Inconclusion,thisarticlealsolistssomeproblemsandchallengesforfurtherresearch.

  • 标签: 数据库 知识发现 机器学习 数据开采
  • 简介:Thispaperpresentsanewwaytoextractconceptthatcanbeusedtoimprovetextclassificationper-formance(precisionandrecall).Thecomputationalmeasurewillbedividedintotwolayers.Thebottomlayercalleddocumentlayerisconcernedwithextractingtheconceptsofparti-culardocumentandtheupperlayercalledcategorylayeriswithfindingthedescriptionandsubjectconceptsofparticularcategory.Therelevantim-plementationalgorithmthatdramatic-allydecreasesthesearchspaceisdis-cussedindetail.Theexperimentbasedonreal-worlddatacollectedfromInfo-Bankshowsthattheapproachissupe-riortothetraditionalones.

  • 标签: 概念 计算方法 运算法则 正文 分类 有效性
  • 简介:TheonethingthatmostinterfereswithEnglishasaSecondLanguagelearner’sreadingisunknownvocabulary.Thisreferstoanywordwhichblocksthereaders’understandingintheirprocessofreading.Whenreading,onewillinevitablymeetunknownvocabularynomatterhowlargeone’smentallexiconis.Oftenthedensityofunfamiliarwordsinreadingmakesthereaderfeelgreatlyfrustratedandgiveupintheend,forhehasalreadylosthistrainofthoughtwhenlookingupwordsinthedictionary.Itisclearthattoogreatadensityofunknownlexicalitemsslowsdownthereadingspeedwhichleadstopoorcomprehensionofthetext.Ofcourse,one’sknowledgeofEnglishvocabularyisboundtobelimited;butisthereawayonecanefficientlycopewiththeunknownwordsonecomesacrossinreadingwithouthavingtostopandlookthemupinthedictionary.

  • 标签: 渔业 丝兰
  • 简介:Landresourcesarefacingcrisesofbeingmisused,especiallyforanintersectionareabetweentownandcountry,andlandcontrolhastobeenforced.Thispaperpresentsadevelopmentofdataminingmethodforlandcontrol.Avector-matchmethodfortheprerequisiteofdataminingi.e.,datacleaningisproposed,whichdealswithbothcharacterandnumericdataviavectorizingcharacter-stringandmatchingnumber.Aminimaldecisionalgorithmofroughsetisusedtodiscovertheknowledgehiddeninthedatawarehouse.Inordertomonitorlandusedynamicallyandaccurately,itissuggestedtosetupareal-timelandcontrolsystembasedonGPS,digitalphotogrammetryandonlinedatamining.Finally,themeansisappliedintheintersectionareabetweentownandcountryofWuhancity,andasetofknowledgeaboutlandcontrolisdiscovered.

  • 标签: LAND CONTROL DATA MINING vector-match method
  • 简介:Thispaperpresentsafault-detectionmethodbasedonthephasespacereconstructionanddataminingapproachesforthecomplexelectronicsystem.TheapproachforthephasespacereconstructionofchaotictimeseriesisacombinationalgorithmofmultipleautocorrelationandΓ-test,bywhichthequasi-optimalembeddingdimensionandtimedelaycanbeobtained.Thedataminingalgorithm,whichcalculatestheradiusofgyrationofunit-masspointaroundthecentreofmassinthephasespace,candistinguishthefaultparameterfromthechaotictimeseriesoutputbythetestedsystem.Theexperimentalresultsdepictthatthisfaultdetectionmethodcancorrectlydetectthefaultphenomenaofelectronicsystem.

  • 标签: 数据采集 故障检测 混沌时间序列 相位空间重建 拓扑结构
  • 简介:OutlierminingisanimportantaspectindataminingandtheoutlierminingbasedonCookdistanceismostcommonlyused.Butweknowthatwhenthedatahavemulticollinearity,thetraditionalCookmethodisnolongereffective.Consideringtheexcellenceoftheprincipalcomponentestimation,weuseittosubstitutetheleastsquaresestimation,andthengivetheCookdistancemeasurementbasedonprincipalcomponentestimation,whichcanbeusedinoutliermining.Atthesametime,wehavedonesomeresearchonrelatedtheoriesandapplicationproblems.

  • 标签: 外露层采矿 基本成分估计 库克距离 数字化矿业 线性回归模型
  • 简介:RecentyearshavewitnessedaresurgenceofinterestintheepicpoetryofValeriusFlaccusandhiscontemporariesintheSilverAge,leadingtotheappreciationoftheselaterepicsnotonlyfortheirintrinsicvaluebutalsofortheircontributiontoourunderstandingofepicintheprecedingGoldenAge,theprincipalexampleofwhichis,ofcourse,Vergil’sAeneid.Foritsrehabilitationtopopularity,ValeriusFlaccus’sArgonauticaisindebtedtotheperceptiveanalysisofthepractitionersofliterarycriticism.Withrespecttoancienttexts,however,literarycriticism

  • 标签: 贝朋
  • 简介:Inthispaper,ARMiner,adataminingtoolbasedonassociationrules,isintroduced.Beginningwiththesystemarchitecture,thecharacteristicsandfunctionsaredis-cussedindetails,includingdatatransfer,concepthierarchygeneralization,miningruleswithnegativeitemsandthere-developmentofthesystem.Anexampleofthetool'sapplicationisalsoshown.Finally,someissuesforfutureresearcharepresented.

  • 标签: ARMiner 数据开采工具 机器学习
  • 简介:ThebackdoororinformationleakofWebserverscanbedetectedbyusingWebMiningtechniquesonsomeabnormalWeblogandWebapplicationlogdata.ThesecurityofWebserverscanbeenhancedandthedamageofillegalaccesscanbeavoided.Firstly,thesystemfordiscoveringthepatternsofinformationleakagesinCGIscriptsfromWeblogdatawasproposed.Secondly,thosepatternsforsystemadministratorstomodifytheircodesandenhancetheirWebsitesecuritywereprovided.Thefollowingaspectsweredescribed:oneistocombinewebapplicationlogwithweblogtoextractmoreinformation,sowebdataminingcouldbeusedtomineweblogfordiscoveringtheinformationthatfirewallandInformationDetectionSystemcannotfind.AnotherapproachistoproposeanoperationmoduleofwebsitetoenhanceWebsitesecurity.Inclusterserversession,Density-BasedClusteringtechniqueisusedtoreduceresourcecostandobtainbetterefficiency.

  • 标签: WEB 网络安全 数据挖掘 计算机网络 逻辑推理