电脑智能化融入博客圈
文章来源:未知 文章作者:enread 发布时间:2012-09-07 07:16 字体: [ ]  进入论坛
(单词翻译:双击或拖选)
Can a computer "read" an online blog and understand it? Several Concordia computer scientists are helping1 to get closer to that goal. Leila Kosseim, associate professor in Concordia's Faculty2 of Engineering and Computer Science, and a recently-graduated doctoral student, Shamima Mithun, have developed a system called BlogSum that has potentially vast applications. It allows an organization to pose a question and then find out how a large number of people talking online would respond. The system is capable of gauging3 things like consumer preferences and voter intentions by sorting through websites, examining real-life self-expression and conversation, and producing summaries that focus exclusively on the original question.
 
"Huge quantities of electronic texts have become easily available on the Internet, but people can be overwhelmed, and they need help to find the real content hiding in the mass of information," explains Kosseim, one of the lead researchers at Concordia's Computational Linguistics4 Laboratory (CLaC lab).
 
Analyzing5 informally-written language poses unique challenges compared to analyzing, for example, a news article. Blogs, forums6 and the like contain opinions, emotions and speculations7, not to mention spelling errors and poor grammar. A summarization tool must address two particular problems, question irrelevance8 (sentences that are not relevant to the main question), and discourse9 incoherence, (sentences in which the intent of the writer is unclear).
 
BlogSum met these challenges with demonstrable efficiency. The researchers developed and tested their tool by examining a set of blogs and review sites. BlogSum used "discourse relations" to crunch10 the data -- ways of filtering and ordering sentences into coherent summaries. BlogSum was measured against prior computational rankings and achieved mostly superior results. In addition, it was evaluated by actual human subjects, who also found it to be superior. Summaries produced by BlogSum reduced question irrelevance and discourse incoherence, successfully distilling11(蒸馏的) large amounts of text into highly readable summaries.
 
This study is an example of Natural Language Processing (NLP), in which Concordia, through the CLaC lab, is a leader. NLP stands at the intersection12 of artificial intelligence and linguistics, seeking to enable computers to derive13 meaning from human language.
 
"The field of natural language processing is starting to become fundamental to computer science, with many everyday applications -- making search engines find more relevant documents or making smart phones even smarter," explained Kosseim.


点击收听单词发音收听单词发音  

1 helping 2rGzDc     
n.食物的一份&adj.帮助人的,辅助的
参考例句:
  • The poor children regularly pony up for a second helping of my hamburger. 那些可怜的孩子们总是要求我把我的汉堡包再给他们一份。
  • By doing this, they may at times be helping to restore competition. 这样一来, 他在某些时候,有助于竞争的加强。
2 faculty HhkzK     
n.才能;学院,系;(学院或系的)全体教学人员
参考例句:
  • He has a great faculty for learning foreign languages.他有学习外语的天赋。
  • He has the faculty of saying the right thing at the right time.他有在恰当的时候说恰当的话的才智。
3 gauging 43b7cd74ff2d7de0267e44c307ca3757     
n.测量[试],测定,计量v.(用仪器)测量( gauge的现在分词 );估计;计量;划分
参考例句:
  • The method is especially attractive for gauging natural streams. 该方法对于测量天然的流注具有特殊的吸引力。 来自辞典例句
  • Incommunicative as he was, some time elapsed before I had an opportunity of gauging his mind. 由于他不爱说话,我过了一些时候才有机会探测他的心灵。 来自辞典例句
4 linguistics f0Gxm     
n.语言学
参考例句:
  • She plans to take a course in applied linguistics.她打算学习应用语言学课程。
  • Linguistics is a scientific study of the property of language.语言学是指对语言的性质所作的系统研究。
5 analyzing be408cc8d92ec310bb6260bc127c162b     
v.分析;分析( analyze的现在分词 );分解;解释;对…进行心理分析n.分析
参考例句:
  • Analyzing the date of some socialist countries presents even greater problem s. 分析某些社会主义国家的统计数据,暴露出的问题甚至更大。 来自辞典例句
  • He undoubtedly was not far off the mark in analyzing its predictions. 当然,他对其预测所作的分析倒也八九不离十。 来自辞典例句
6 forums 68daf8bdc8755fe8f4859024b3054fb8     
讨论会; 座谈会; 广播专题讲话节目; 集会的公共场所( forum的名词复数 ); 论坛,讨论会,专题讨论节目; 法庭
参考例句:
  • A few of the forums were being closely monitored by the administrators. 有些论坛被管理员严密监控。
  • It can cast a dark cloud over these forums. 它将是的论坛上空布满乌云。
7 speculations da17a00acfa088f5ac0adab7a30990eb     
n.投机买卖( speculation的名词复数 );思考;投机活动;推断
参考例句:
  • Your speculations were all quite close to the truth. 你的揣测都很接近于事实。 来自《现代英汉综合大词典》
  • This possibility gives rise to interesting speculations. 这种可能性引起了有趣的推测。 来自《用法词典》
8 irrelevance 05a49ed6c47c5122b073e2b73db64391     
n.无关紧要;不相关;不相关的事物
参考例句:
  • the irrelevance of the curriculum to children's daily life 课程与孩子们日常生活的脱节
  • A President who identifies leadership with public opinion polls dooms himself to irrelevance. 一位总统如果把他的领导和民意测验投票结果等同起来,那么他注定将成为一个可有可无的人物。 来自辞典例句
9 discourse 2lGz0     
n.论文,演说;谈话;话语;vi.讲述,著述
参考例句:
  • We'll discourse on the subject tonight.我们今晚要谈论这个问题。
  • He fell into discourse with the customers who were drinking at the counter.他和站在柜台旁的酒客谈了起来。
10 crunch uOgzM     
n.关键时刻;艰难局面;v.发出碎裂声
参考例句:
  • If it comes to the crunch they'll support us.关键时刻他们是会支持我们的。
  • People who crunch nuts at the movies can be very annoying.看电影时嘎吱作声地嚼干果的人会使人十分讨厌。
11 distilling f3783a7378d04a2dd506fe5837220cb7     
n.蒸馏(作用)v.蒸馏( distil的过去式和过去分词 )( distilled的过去分词 );从…提取精华
参考例句:
  • Water can be made pure by distilling it. 水经蒸馏可变得纯净。 来自《简明英汉词典》
  • More ammonium sulphate solution is being recovered in the process of distilling oil shale. 在提炼油页岩的过程中回收的硫酸铵液比过去多了。 来自《简明英汉词典》
12 intersection w54xV     
n.交集,十字路口,交叉点;[计算机] 交集
参考例句:
  • There is a stop sign at an intersection.在交叉路口处有停车标志。
  • Bridges are used to avoid the intersection of a railway and a highway.桥用来避免铁路和公路直接交叉。
13 derive hmLzH     
v.取得;导出;引申;来自;源自;出自
参考例句:
  • We derive our sustenance from the land.我们从土地获取食物。
  • We shall derive much benefit from reading good novels.我们将从优秀小说中获得很大好处。
TAG标签: computer internet blog
发表评论
请自觉遵守互联网相关的政策法规,严禁发布色情、暴力、反动的言论。
评价:
表情:
验证码:点击我更换图片