网站的规划建设与分析,网站推广国外,怎么做网站的搜索引擎,佛山旺道seo为什么“单词”被省略了#xff1a;单词的本质是含义简单且可以高频重复的信息#xff0c;句子的本质是经过多个单词不断消歧最终包含指向性含义的信息。从基因角度来看#xff0c;大的片段相当于句子#xff0c;对这些片段再分段起单词作用#xff0c;密码子#xff08;… 为什么“单词”被省略了单词的本质是含义简单且可以高频重复的信息句子的本质是经过多个单词不断消歧最终包含指向性含义的信息。从基因角度来看大的片段相当于句子对这些片段再分段起单词作用密码子每三个核苷酸对应一个氨基酸本质上还是字母。从蛋白质角度来看二级结构中由氢键造成的较为规律的折叠、螺旋可以视作单词能实现特定功能的蛋白质才称得上句子。参考文献理论基础思想很重要但论证得并不好Cadeddu, A., Wylie, E. K., Jurczak, J., Wampler‐Doty, M., Grzybowski, B. A. (2014). Organic chemistry as a language and the implications of chemical linguistics for structural and retrosynthetic analyses. Angewandte Chemie International Edition, 53(31), 8108-8112.综述类关联NLP方法和应用领域的表格挺有价值的Öztürk, H., Özgür, A., Schwaller, P., Laino, T., Ozkirimli, E. (2020). Exploring chemical space using natural language processing methodologies for drug discovery. Drug Discovery Today, 25(4), 689-705.首度提出Protein Vector(Protvec)和Gene Vector(Genevec)的概念Asgari, E., Mofrad, M. R. K. (2015). Continuous distributed representation of biological sequences for deep proteomics and genomics. PLoS ONE, 10(11), 1–15.Protein与word embedding的结合Bepler, T., Berger, B. (2019). Learning protein sequence embeddings using information from structure. 7th International Conference on Learning Representations, ICLR 2019, 1–17.虽然漫画中将2018年Schwaller发表的Seq2Seq被期刊接收且效果好见6视作这个方法在生物分子领域的第一次成功应用但做这方面的论文一般都会引用这篇作为一切故事的开端。两个韩国高中生的作业能做到这样真的很厉害了Nam, J., Kim, J. (2016). Linking the neural machine translation and the prediction of organic chemistry reactions. arXiv preprint arXiv:1612.09529.Seq2Seq最佳Schwaller, P., Gaudin, T., Lanyi, D., Bekas, C., Laino, T. (2018). “Found in Translation”: predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models. Chemical science, 9(28), 6091-6098.另一篇比较有价值的Seq2SeqKarimi, M., Wu, D., Wang, Z., Shen, Y. (2019). DeepAffinity: Interpretable deep learning of compound-protein affinity through unified recurrent and convolutional neural networks. Bioinformatics, 35(18), 3329–3338.漂亮的标题漂亮的intro但内容不是很惊艳的BERT应用Vig, J., Madani, A., Varshney, L. R., Xiong, C., Socher, R., Rajani, N. F. (2020). Bertology meets biology: Interpreting attention in protein language models. arXiv preprint arXiv:2006.15222.萌屋作者白鹡鸰白鹡鸰jí líng是一种候鸟天性决定了会横跨很多领域。已在上海交大栖息四年目前以图像语义为食但私下也对自然语言很感兴趣喜欢在卖萌屋轻松不失严谨的氛围里浪~~形~~飞~~翔~~因为刚开始Ph.D.文章还统统是放在天上的卫星接下来会尽早与大家正式见面的知乎ID也是白鹡鸰欢迎造访。后台回复关键词【入群】加入卖萌屋NLP/IR/Rec与求职讨论群有顶会审稿人、大厂研究员、知乎大V和妹纸等你来撩哦~