WEBVTT 1 00:00:14.100 --> 00:00:15.520 当然你要不断的精炼 Of course, you have to constantly refine. 2 00:00:15.520 --> 00:00:19.450 比如说通过学科领域来经验 For example, experience through the subject area. 3 00:00:19.830 --> 00:00:22.650 我前面可能没有讲 I may not have said it before. 4 00:00:22.910 --> 00:00:25.600 大家比如说我是城市规划的 For example, I am a city planner. 5 00:00:25.940 --> 00:00:33.190 我就通过奔到底死了纪念本思代得只有73篇钟数 I was dead by running to the end to commemorate Ben instead of only 73 hours. 6 00:00:34.180 --> 00:00:35.740 我点一下精炼一下 I'll have it refined. 7 00:00:35.740 --> 00:00:38.750 你就是城市研究方面的 You are a city researcher 8 00:00:38.750 --> 00:00:41.080 然后如果说我是环境的学生 And if I'm an environmental student 9 00:00:41.210 --> 00:00:43.910 我就看环境的75篇对吧 I will look at the 75 articles of the environment, right? 10 00:00:43.910 --> 00:00:46.710 那么就是这样七十几篇可能做出来 Well, that's it. 11 00:00:46.710 --> 00:00:50.860 还不一定是我所不是所有的都是我要的 Not all I want. 12 00:00:50.860 --> 00:00:55.420 那么答比方说你就选择了其中的一篇了 So for example, you chose one of them. 13 00:00:55.950 --> 00:00:58.960 比如说选择了这一篇的中书文献 For example, selected this one of the Chinese literature. 14 00:00:58.960 --> 00:01:02.350 发现跟你兴趣特别相关 Find it particularly relevant to your interests. 15 00:01:02.550 --> 00:01:04.440 这个时候我们就要看 We'll see it at this time. 16 00:01:05.410 --> 00:01:09.040 比如说他这里有479篇的背影文献 For example, here he has 479 back documents. 17 00:01:10.390 --> 00:01:14.910 被引频次也就是说有他被引用了479次 Was cited 479 times, which means he was quoted times. 18 00:01:14.910 --> 00:01:17.980 这是一个比较高的数字也 This is a higher number, too. 19 00:01:18.240 --> 00:01:20.360 一般来讲就是一篇文献 Generally speaking, it is a document. 20 00:01:22.010 --> 00:01:24.320 综述文献很容易被背影 Literature review is easy to be viewed from behind. 21 00:01:24.520 --> 00:01:28.260 那么479还是比较多的一个文献 Then 479 is still a relatively large document. 22 00:01:28.260 --> 00:01:30.290 那点一下的话 At that point 23 00:01:40.330 --> 00:01:41.040 不好意思 Excuse me 24 00:01:41.040 --> 00:01:42.980 我这两个数字不太对 My two figures are not quite right. 25 00:01:43.040 --> 00:01:45.220 但是不影响大家的理解 But it does not affect our understanding. 26 00:01:45.360 --> 00:01:51.810 就是说点进去以后就是有这么多引用他的文献会出来 That is to say, after the point, there will be so many references to him. 27 00:01:51.810 --> 00:01:55.620 这个就是我刚才说的通过文献 This is what I just said through the literature. 28 00:01:56.290 --> 00:01:57.010 背影文献 Background document 29 00:01:57.010 --> 00:02:00.980 我们可以去往深里头挖很多东西 We can dig a lot deeper. 30 00:02:00.980 --> 00:02:04.620 那么我自己的研究生写论文时候 So when my own graduate students write papers 31 00:02:04.620 --> 00:02:07.630 因为我们有时候做的题目会比较偏 Because we sometimes do the topic will be more biased. 32 00:02:07.630 --> 00:02:13.150 这样的话会积几乎很少找到这种相关文献 In this way, the relevant literature is rarely found. 33 00:02:13.150 --> 00:02:16.720 甚至可能就会发现说我用的词可能不太对 I may even find that the words I use may not be quite right. 34 00:02:16.890 --> 00:02:18.600 那么就要通过这种方法 Then this is the way 35 00:02:18.600 --> 00:02:21.090 就是你好不容易找到一篇以后 After you finally found one 36 00:02:21.090 --> 00:02:28.130 你就要通过被引文献是文献这样的一些角度去深入的去挖 You'll have to dig deep through the point that the cited document is a document. 37 00:02:28.130 --> 00:02:31.260 最后还是会挖出来很读者观文献的 In the end, we will dig out the literature of the reader. 38 00:02:33.110 --> 00:02:38.390 这个前面讲的就是webofscienceofnow This is about webofscienceofnow 39 00:02:38.390 --> 00:02:39.470 理解他 Understand him 40 00:02:39.570 --> 00:02:45.550 能够帮助我们做研究它可能的一些功能非常简单 It can help us to do research, and some of its possible functions are very simple. 41 00:02:45.850 --> 00:02:50.980 但是我讲的是非常少的一部分功能 But I'm talking about a very small part of the function. 42 00:02:50.980 --> 00:02:55.670 你们去自己到网站上去摸索一下 You go to the website and find out for yourself. 43 00:02:55.750 --> 00:02:57.730 马上太容易了 It will be too easy soon 44 00:02:57.850 --> 00:02:59.050 马上就可以知道 We'll find out right away 45 00:02:59.050 --> 00:03:02.420 我今天这个课无非就告诉你们有这么一些功能 I just want to tell you that there are some functions in this class today. 46 00:03:03.040 --> 00:03:05.050 还有一些其他的工具 There are other tools 47 00:03:05.090 --> 00:03:08.010 可能对学生来讲不是非常重要 It may not be very important to the students. 48 00:03:08.010 --> 00:03:16.850 但是对老师和对博士生可能会比较重要的接C或者是1SA也是 But for teachers and doctoral students may be more important to receive C or 1sa is also. 49 00:03:16.850 --> 00:03:20.000 webofnot的平台里头的东西 What's inside the webofnot platform 50 00:03:21.630 --> 00:03:25.370 有老师或者有研究生在这里的话 If there is a teacher or a graduate student here 51 00:03:25.370 --> 00:03:27.530 你们可以去自己去摸一下 You can go and touch it yourself. 52 00:03:27.630 --> 00:03:32.170 因为jc是对期刊的一些分类的 Because JC has some sort of journal 53 00:03:32.440 --> 00:03:33.500 这个研究 This research 54 00:03:34.100 --> 00:03:38.800 尤其是对你去研究这个领域的哪个期刊影响因子高 In particular, which journals you study in this field have a high influence factor. 55 00:03:39.290 --> 00:03:40.960 我要投在哪个区 Which area should I vote in? 56 00:03:41.350 --> 00:03:43.810 因为现在还讲一区二区三区四区 Because now we are talking about district one, district two, district three and district four. 57 00:03:43.810 --> 00:03:48.460 那么杰CR也是在外包和劳累机里头的一个数据库 So, JCR is also a database in the outsourcing and labor machines. 58 00:03:48.840 --> 00:03:51.010 大家可以回去看一下 Everyone can go back and have a look. 59 00:03:51.010 --> 00:03:57.940 然后ESCA它可以去研究机构和学科 Then ESCA, it can go to research institutions and disciplines. 60 00:03:58.130 --> 00:04:02.590 现在校长们会比较关心 Now the principals will be more concerned 61 00:04:02.750 --> 00:04:07.500 我们学校有几个学科进入了全球的前10% Our school has several subjects in the top 10% in the world. 62 00:04:07.910 --> 00:04:09.800 前1%钱 The first 1 63 00:04:09.800 --> 00:04:10.860 1‰ 1 per thousand 64 00:04:11.240 --> 00:04:17.480 还有有哪些高就是叫高被引论文就叫热热点任务 There are also what is called the high-cited papers called hot hot task. 65 00:04:18.120 --> 00:04:20.810 哪些是需要讲的 What needs to be said? 66 00:04:20.810 --> 00:04:27.790 因为好像说科研处已经开始要制定这样的这个奖励计划 Because it seems that the scientific research department has begun to develop such a reward plan. 67 00:04:27.790 --> 00:04:28.430 对 Right 68 00:04:30.060 --> 00:04:32.590 叫做高被引论文 It's called a high-cited paper 69 00:04:33.180 --> 00:04:34.920 热点要做奖励 Hot spots need to be rewarded 70 00:04:35.210 --> 00:04:38.130 那么如果说我们老师自己想要看一看 So if our teacher wants to have a look of his own 71 00:04:38.130 --> 00:04:42.330 说我的论文量跟高被引论文的差距在哪里等等的 What is the difference between my essay volume and the highly cited paper? 72 00:04:42.570 --> 00:04:43.060 可以看 Can see 73 00:04:43.060 --> 00:04:46.230 ESI那么在学生这里我就不再讲 ESI then I'll stop talking about it here in the students. 74 00:04:46.960 --> 00:04:53.030 然后除了我们图书馆有的数据库的分析工具 And then besides the analytical tools of the databases that we have in our library 75 00:04:53.570 --> 00:04:55.430 我刚才讲的是webofscience I was talking about webofscience 76 00:04:55.430 --> 00:04:57.980 然后大家中文的可以去看一下 Then everyone in Chinese can go and have a look. 77 00:04:57.980 --> 00:05:00.440 CNKIR尤其是本科生 Cnkir, especially undergraduates 78 00:05:00.440 --> 00:05:02.030 可能你们会用到 Maybe you'll use it 79 00:05:02.670 --> 00:05:03.500 挺好玩的 It's fun 80 00:05:03.500 --> 00:05:06.250 其实这个大家去走一走 In fact, let's go for a walk. 81 00:05:07.580 --> 00:05:10.290 这些功能还是很有意思的 These functions are still very interesting. 82 00:05:10.390 --> 00:05:12.310 那么除了这个以外 So in addition to this 83 00:05:12.750 --> 00:05:14.080 还有两个 There are two more 84 00:05:14.080 --> 00:05:18.880 我今天这个课里头还会介绍两个免费的工具 I will introduce two free tools in this class today. 85 00:05:18.880 --> 00:05:25.910 这两个工具可能比刚才我说的这些功能要更先进一些 These two tools may be more advanced than what I said just now. 86 00:05:25.910 --> 00:05:34.810 他们是用到了一些计算机文本挖掘和数据挖掘的技术 They are used in some computer text mining and data mining technology. 87 00:05:35.270 --> 00:05:38.430 那么是免费的开放的软件 So it's free and open software. 88 00:05:40.120 --> 00:05:42.020 有些学生会知道site Some students know site 89 00:05:42.020 --> 00:05:43.270 space是吧 Space is it 90 00:05:45.090 --> 00:05:46.790 大概的介绍都在这里 The general introduction is here. 91 00:05:46.920 --> 00:05:51.840 那么它其实是对一种知识图谱的一种展示 So it's actually a demonstration of a knowledge map 92 00:05:51.840 --> 00:05:53.120 一种显示 A display 93 00:05:53.940 --> 00:05:55.020 也是很好玩 It's fun, too. 94 00:05:56.320 --> 00:05:56.740 可能 Possibly 95 00:05:57.310 --> 00:06:01.730 我觉得我的学生好像都比较喜欢去玩这些东西 I think my students like to play with these things. 96 00:06:03.270 --> 00:06:05.770 做出来的图也比较炫 The picture made is also more dazzling. 97 00:06:06.250 --> 00:06:10.150 假如说在你们的专业研究的过程当中 In the course of your professional research 98 00:06:10.620 --> 00:06:14.500 利用一两幅这样的图去去说明 Use one or two of these to illustrate 99 00:06:14.500 --> 00:06:16.280 比如说你开题的时候 For example, when you open the question. 100 00:06:16.280 --> 00:06:24.330 你用这样的图去说明你题目的热门程度还是会比较炫的 You use such a picture to illustrate the popular degree of your topic will be more dazzling. 101 00:06:24.330 --> 00:06:26.470 因为据我所知 Because as far as I know 102 00:06:26.470 --> 00:06:30.990 用这些工具的还是在我们这个领域的人会比较多一点 There will be more people in our field with these tools. 103 00:06:32.720 --> 00:06:35.250 因为这个PPT是可以分享的 Because this PowerPoint can be shared 104 00:06:35.780 --> 00:06:37.580 大家也可以去看一下 You can also go and have a look. 105 00:06:38.160 --> 00:06:42.310 就是PPT这些帮助文档等等 Is PPT these help documents and so on 106 00:06:44.040 --> 00:06:46.390 这是一个系统界面的情况 This is a system interface case. 107 00:06:46.390 --> 00:06:50.660 反正所有的这种工具的界面都差不多 Anyway, the interface of all these tools is similar. 108 00:06:50.980 --> 00:06:55.780 然后你们这个年轻的话就是随便一摸就全部懂了 Then you, this young man, just feel it and understand it all. 109 00:06:56.560 --> 00:07:00.880 然后我在这里就是要给大家展示一下 And I'm here to show you 110 00:07:00.880 --> 00:07:02.840 就这个东西能做成啥样子 What can this thing do? 111 00:07:03.740 --> 00:07:11.690 我们做了一个一个自己做了一个项目 We did a project of our own. 112 00:07:11.830 --> 00:07:12.690 做了一个项目 Did a project 113 00:07:12.690 --> 00:07:18.560 然后我展示一下这个项目它的一些图片 Then I show you some pictures of this project. 114 00:07:18.990 --> 00:07:26.350 也就是说通过一个例子来告诉大家sex贝斯可以做哪些事情 That is to say, through an example, to tell you what sexbeth can do. 115 00:07:26.660 --> 00:07:28.650 我们做的是技术 What we do is technology 116 00:07:29.210 --> 00:07:33.390 技术竞争情报的研究的研究 Research on technical competitive intelligence 117 00:07:33.390 --> 00:07:34.130 前沿研究 Frontier research 118 00:07:34.130 --> 00:07:36.520 热点研究机构和期刊的分析 Analysis of hot research institutions and periodicals 119 00:07:36.940 --> 00:07:42.720 这个工具做出来的东西肯定会比webofsun私钥更好一些 This tool will certainly do better than the webofsun private key. 120 00:07:42.720 --> 00:07:44.670 更漂亮更深一些 More beautiful and deeper 121 00:07:45.550 --> 00:07:52.060 然后主要的发文机构可能乍一看很乱是吧 And then the main agencies may look messy at first, right? 122 00:07:52.060 --> 00:07:56.830 但是实际上它还是这个可以看出来一些规律 But in fact, it still shows some rules. 123 00:07:57.460 --> 00:07:58.720 圆圈的大小 The size of a circle 124 00:07:59.000 --> 00:08:01.560 表示的是论文数量的多少 It means the number of papers. 125 00:08:01.560 --> 00:08:06.480 然后这些线是表示了这些机构之间合作关系 Then these lines are the expressions of cooperation between these agencies. 126 00:08:09.220 --> 00:08:16.680 那么这种核心的节点肯定就是是这个领域的最重要的研究机构 So this core node is certainly the most important research institution in this field. 127 00:08:16.910 --> 00:08:19.890 那么比如说我们这个主题的话 So, for example, what we say on this subject. 128 00:08:19.890 --> 00:08:28.540 就会发现武汉大学北京科技情报所哈尔滨工大的管理学院等等会是 It will be found that Wuhan university, Beijing science and technology information Institute, Harbin Institute of technology management, and so on will be. 129 00:08:28.540 --> 00:08:32.010 比较重要的研究者 A more important researcher 130 00:08:32.010 --> 00:08:35.570 那么大家可能可以看到南京大学信息管理系 Then you may see the department of information management of Nanjing university. 131 00:08:36.070 --> 00:08:38.360 他发论文不少 He issues many papers. 132 00:08:38.360 --> 00:08:40.920 但是他很少跟别人做合作 But he rarely cooperates with others. 133 00:08:42.520 --> 00:08:51.090 这种就是非常形象的展示了研究机构他的合作关系 This is the very image of the research Institute, his partnership. 134 00:08:51.640 --> 00:08:56.060 他就跟人家都不搭嘎在这种地方 He doesn't even talk to people in places like this. 135 00:08:56.700 --> 00:08:58.180 那么这里头的机构 So the organization in here 136 00:08:58.370 --> 00:09:00.860 它的合作研究就比较多 It has more cooperative research. 137 00:09:01.220 --> 00:09:04.500 像武大武汉大学管理信息管理学院他就比较多 He is more like the school of management information management at Wuhan university of Wuhan. 138 00:09:10.100 --> 00:09:14.370 然后我们看一下是研究的知识基础 Then we look at the knowledge base of research. 139 00:09:14.620 --> 00:09:18.060 这个表格如果大家这样看一下的话 If you look at this form like this 140 00:09:18.250 --> 00:09:20.360 可能是觉得没啥了不起 Maybe it's nothing. 141 00:09:20.360 --> 00:09:20.840 对吧 Right 142 00:09:20.840 --> 00:09:25.600 那么这里的一个最关键的指标就是它的中心性 So one of the key indicators here is its centrality. 143 00:09:25.820 --> 00:09:29.090 根据中心性的排序 Sort by centricity 144 00:09:29.090 --> 00:09:31.920 就是它的大小的排序 Is the sort of size 145 00:09:32.270 --> 00:09:36.860 我们可以得出来这样一个表格 We can come up with such a form. 146 00:09:36.860 --> 00:09:39.020 这个表格排在最上面的 This form is on the top. 147 00:09:39.020 --> 00:09:48.910 也就是说中心性最高的这个值最高的文献是在这个领域最重要的一个 In other words, the central highest value of the highest literature in this field is the most important one. 148 00:09:48.910 --> 00:09:49.710 文献 Literature 149 00:09:49.890 --> 00:09:52.910 它是这个领域的研究知识基础 It is the basis of research knowledge in this field. 150 00:09:53.330 --> 00:09:57.400 那么这个跟我刚才说的背影又不完全一样 So this is not exactly the same as the back I just said. 151 00:09:57.670 --> 00:10:01.060 背影其实实际上还是比较外在的 In fact, the shadow is actually more external. 152 00:10:01.310 --> 00:10:03.180 他不涉及内容 He didn't touch on the content. 153 00:10:03.180 --> 00:10:06.910 只是说某一个作者在发表论文的时候 Just to say that an author is publishing a paper 154 00:10:06.910 --> 00:10:08.530 写了一下我的参考文献 Wrote my references 155 00:10:08.530 --> 00:10:10.100 我的引用文献是什么 What is my citation? 156 00:10:10.260 --> 00:10:15.130 是非常外外在的这样的一种关系 Is a very external such relationship. 157 00:10:15.130 --> 00:10:20.020 但是赛CIS中的中心性的分析 But the analysis of centricity in cicis 158 00:10:20.020 --> 00:10:25.100 它是基于内容的基于所有文件的内容分析来做的 It is based on content analysis of all files. 159 00:10:25.100 --> 00:10:28.670 它用了文本挖掘的这样的一些技术 It uses such techniques as text mining. 160 00:10:28.670 --> 00:10:32.500 所以我们发现因为这个领域是我熟悉的领域 So we found that because this is a field that I am familiar with. 161 00:10:32.840 --> 00:10:36.340 然后我们发现做出来确实比较准 Then we found that it was really accurate. 162 00:10:37.320 --> 00:10:40.600 这些文献确实是我们这个领域 These documents are really our field. 163 00:10:40.600 --> 00:10:46.250 因为我前一阵我在做一个就是这个领域的准备 Because I was preparing for this field a while ago. 164 00:10:46.250 --> 00:10:52.100 要写一本这样的书肯定是我们要把所有的这个领域的文献全部收罗 To write a book like this, we must collect all the literature in this field. 165 00:10:52.100 --> 00:10:52.480 进来 Come in 166 00:10:52.480 --> 00:10:54.620 全部要读 All to read 167 00:10:56.250 --> 00:10:59.800 这个排序相对来说是真的是比较准 This sort is relatively true and accurate. 168 00:10:59.800 --> 00:11:01.510 我为什么在这里给大家讲 Why am I here to tell you? 169 00:11:01.510 --> 00:11:03.760 因为如果我要去做城镇化 Because if I had to do urbanization 170 00:11:03.990 --> 00:11:05.660 我是不知道它准不准的 I don't know if it's accurate. 171 00:11:05.680 --> 00:11:07.620 因为我不是城镇化领域的专家 Because I am not an expert in the field of urbanization. 172 00:11:07.620 --> 00:11:10.180 但是至少技术竞争情报这个领域 But at least in the field of technical competitive intelligence 173 00:11:10.220 --> 00:11:13.990 我都知道就是哪些人哪些文献是最经典的 I know who it is and what it is. 174 00:11:13.990 --> 00:11:16.430 然后确实是最好的文献 And then really the best literature 175 00:11:16.620 --> 00:11:17.420 通过cs Via CS 176 00:11:17.420 --> 00:11:18.740 cs跑下来 CS runs down 177 00:11:19.430 --> 00:11:21.930 确实这个表格是比较准的 It is true that this form is quite accurate. 178 00:11:23.560 --> 00:11:25.650 然后技术竞争 Then technical competition 179 00:11:25.650 --> 00:11:29.010 情报的研究前沿很好玩 The frontiers of intelligence research are fun. 180 00:11:29.010 --> 00:11:30.070 这是一个年代 This is an age. 181 00:11:30.070 --> 00:11:32.080 大家看的清楚吗 Can you see it clearly? 182 00:11:34.940 --> 00:11:36.620 这个投影好差 This projection is poor. 183 00:11:41.330 --> 00:11:44.320 你们大概看个大概这十年代 You can see about this decade 184 00:11:44.320 --> 00:11:48.790 98年01年004年这样三年一个条 In '98,' 01, '004. 185 00:11:48.990 --> 00:11:52.740 那么从这篇文献开始这样过来 So this is the way to start with this document 186 00:11:52.910 --> 00:11:53.680 过来 Come here 187 00:11:53.780 --> 00:11:55.980 然后到了研究前沿 And then to the research frontier 188 00:11:55.980 --> 00:12:02.370 实际上这个地方的研究前沿就是到了07年的时候 In fact, the research front of this place was in 2007. 189 00:12:02.940 --> 00:12:05.630 或者是说到了10年的时候 Or 10 years 190 00:12:05.860 --> 00:12:14.170 这些文献就是李燕和刘锡文这两位作者的文献确实是属于这个领域的 The literature of these two authors, Li Yan and Liu Xiwen, really belongs to this field. 191 00:12:14.170 --> 00:12:15.690 前沿的 Forward 192 00:12:17.870 --> 00:12:21.430 那么可能大家要说前言不就是最新的就行 Then perhaps we should say that the preface is not the latest on the line. 193 00:12:21.780 --> 00:12:22.640 不是这样的 That's not true 194 00:12:22.880 --> 00:12:27.430 就说06年07年发表了很多文献 In 2007, a lot of literature was published. 195 00:12:27.820 --> 00:12:31.610 但是通过斯贝斯这样跑下来 But run down through sibeisi like this 196 00:12:31.610 --> 00:12:34.690 实际上这两个人的研究确实是比较前沿 In fact, these two people's research is indeed relatively forward. 197 00:12:35.700 --> 00:12:38.570 当然它内部有很多很多规则 Of course it has a lot of rules inside 198 00:12:38.730 --> 00:12:43.390 就是文本分析的时候有很多规则更复杂了 There are a lot more complicated rules when it comes to text analysis. 199 00:12:43.390 --> 00:12:49.160 那么比如说他们的用的一些词在其他文献里头 So, for example, some of the words they use are in other documents. 200 00:12:49.160 --> 00:12:51.680 就在他们之后的文献里头 In the literature that followed them 201 00:12:52.210 --> 00:12:54.000 出现的频次可能比较多 There may be more frequency. 202 00:12:54.000 --> 00:12:58.470 或者说这样一个词验的文献里头是第一次出现 Or, it is the first time that such a word has appeared in the literature. 203 00:12:58.470 --> 00:12:59.880 但是他出现以后 But after he showed up 204 00:12:59.880 --> 00:13:04.300 这些其他的相关文献有很多跟他时间比较接近 Many of these other related documents are close to his time. 205 00:13:04.300 --> 00:13:07.900 但是比它后面的文献就会出现了等等 But the literature behind it will appear and so on. 206 00:13:07.900 --> 00:13:12.900 这样一些算法在文本挖掘的文本分析的算法里头会用到 Such algorithms will be used in text analysis algorithms for text mining. 207 00:13:13.190 --> 00:13:15.500 所以我刚刚也讲site So I just talked about site 208 00:13:15.500 --> 00:13:22.200 CS会比我们前面讲的webofscience要更就是更先进 CS will be even more advanced than what we talked about before. 209 00:13:22.200 --> 00:13:22.560 一些 Some 210 00:13:22.560 --> 00:13:25.050 它用到的技术也更好一些 It also uses better technology. 211 00:13:26.290 --> 00:13:26.570 好 Well 212 00:13:26.570 --> 00:13:30.320 然后这里是研究热点 And here is the research hotspot 213 00:13:31.090 --> 00:13:37.180 圆圈大小也是表示了他论文的数量 The size of the circle also means the number of his papers. 214 00:13:37.180 --> 00:13:40.160 就是有多少篇论文在讲这些词 How many papers are talking about these words? 215 00:13:40.160 --> 00:13:41.990 这个做得不是很好 This is not done very well 216 00:13:42.280 --> 00:13:42.680 为什么 Why 217 00:13:43.220 --> 00:13:47.810 实际上我们在做分析的时候要对数据做一些清洗 In fact, when we do analysis, we have to do some data cleaning. 218 00:13:47.810 --> 00:13:51.800 那么比如说像技术竞争情报这种词就可以不要放进去 So for example, such as technical competitive intelligence can not be put in the word. 219 00:13:51.800 --> 00:13:55.650 因为我本身就是在分析技术竞争情报是吧 Because I'm just analyzing technical competitive intelligence, right? 220 00:13:55.650 --> 00:13:59.020 那么大家如果说不是把这个去掉的话 So, if we don't take this out, 221 00:13:59.020 --> 00:13:59.780 大家可以看到 You can see 222 00:13:59.780 --> 00:14:03.460 比如说技术竞争是很重要的一个词是吧 For example, technical competition is a very important word, right? 223 00:14:03.460 --> 00:14:08.440 然后技术创新也是在这个领域很重要的词 Then technological innovation is also a very important word in this field. 224 00:14:08.440 --> 00:14:14.320 然后这些这些线就是表示它们的贡献关系 Then these lines represent their contribution relationship. 225 00:14:14.320 --> 00:14:17.800 就说一篇文献可能在讲到技术竞争的时候 Just say that a literature may be talking about technical competition. 226 00:14:17.800 --> 00:14:20.200 可能经常也会讲到情报研究 Information research may also often be mentioned. 227 00:14:20.560 --> 00:14:24.200 还有就是说会讲到这样的一些词 There are some words that will be said. 228 00:14:24.300 --> 00:14:28.140 大家可能科技创新技术竞争力或者技术转移 We may be scientific and technological innovation, technological competitiveness or technology transfer. 229 00:14:28.500 --> 00:14:32.320 这些就属于线条互相之间的关系 These are the relationships between the lines. 230 00:14:32.690 --> 00:14:35.200 那么从这种图我们可以看到 So we can see from this graph 231 00:14:35.200 --> 00:14:37.400 一看到研究热点 On seeing the research hotspot 232 00:14:38.630 --> 00:14:40.740 颜色大小 Color size 233 00:14:41.080 --> 00:14:41.420 对吧 Right 234 00:14:41.420 --> 00:14:42.820 这些都是研究热点 These are the research hot spots. 235 00:14:42.820 --> 00:14:46.800 还有就是词与词之间的这种关系贡献的关系 There is also the relationship between words and the contribution of the relationship between words. 236 00:14:46.800 --> 00:14:49.610 就是共同出现共同出现的关系 Is the common emergence of a common relationship. 237 00:14:49.610 --> 00:14:56.940 那么表示说很多研究可能就是开放式创新和技术创新以及科技信息 In that case, a lot of research may be open innovation and technological innovation, as well as scientific and technological information. 238 00:14:56.940 --> 00:14:59.910 分析这样的一些东西结合在一起 To analyze such things together. 239 00:15:03.400 --> 00:15:07.010 这个是反映期刊之间的关系的 This is a reflection of the relationship between journals. 240 00:15:07.010 --> 00:15:16.270 比如说科技管理研究和图书情报工作 For example, science and technology management, research and library and information work. 241 00:15:16.540 --> 00:15:18.520 科研管理这样的一些文献 ... documents such as scientific research management. 242 00:15:18.520 --> 00:15:21.770 它互相之间饮用的关系比较多 It has more to do with each other's drinking. 243 00:15:22.040 --> 00:15:24.970 供不是互相之间被引用的关系比较多 There are more relationships between references than references. 244 00:15:26.290 --> 00:15:28.770 我这里不解释共被引的概念 I do not explain the concept of co-citation here. 245 00:15:29.130 --> 00:15:34.950 只是说从这个图可以大家可以看到说在一起比较近的就说明这两本 Just to say that from this map, you can see that together, the relatively close on the two. 246 00:15:34.950 --> 00:15:37.280 刊物的刊文的特点 Features of publication 247 00:15:37.280 --> 00:15:42.220 他们刊发表论文的主题可能比较接近 The subject of their paper may be closer. 248 00:15:43.900 --> 00:15:45.320 这是space This is space 249 00:15:45.320 --> 00:15:47.260 它还有其他很多功能 It has many other functions. 250 00:15:47.340 --> 00:15:50.070 大家有兴趣可以去玩一玩 If you are interested, you can go and play. 251 00:15:50.170 --> 00:15:51.750 还是比较好玩的 It's more fun. 252 00:15:53.110 --> 00:16:01.610 然后还有一个就是OOCD这个工具更新可能用的人很少 Then there is the CD, a tool that can be updated with very few people. 253 00:16:01.850 --> 00:16:04.320 在我们这个领域用的也比较少 We also use less in this field. 254 00:16:04.320 --> 00:16:07.050 但他也有他自己的一些特点 But he also has some of his own characteristics. 255 00:16:09.070 --> 00:16:10.800 他也是做知识图谱的 He also does the knowledge map. 256 00:16:10.800 --> 00:16:13.300 然后是荷兰的两个教授 Then there are two professors in the Netherlands. 257 00:16:13.300 --> 00:16:15.530 就是雷登大学的两个教授 Two professors at the university of Leiden 258 00:16:17.460 --> 00:16:19.160 开发的一个免费软件 A free software developed. 259 00:16:19.520 --> 00:16:21.800 国外应用的比较多 There are many applications abroad. 260 00:16:21.800 --> 00:16:23.870 然后国内还是比较少 Then there is still less at home. 261 00:16:23.870 --> 00:16:27.620 它主要是做这些的研究 It's mainly doing these research. 262 00:16:28.260 --> 00:16:30.250 关键词贡献聚类等等的 Keyword contribution clustering, etc. 263 00:16:30.520 --> 00:16:31.360 这些可能 These possibilities 264 00:16:31.930 --> 00:16:36.160 专业术语大家不一定能够马上能够理解 The technical terms are not necessarily immediately understandable. 265 00:16:36.160 --> 00:16:40.280 但是你们看到就是可以对技术热点机构合作 But what you can see is that you can cooperate with the technology hot spots. 266 00:16:40.510 --> 00:16:44.690 作者合作以及技术积累起到比较好的分析作用 The author's cooperation and technology accumulation play a better analytical role. 267 00:16:45.510 --> 00:16:47.190 这些图都比较炫 These pictures are all pretty cool. 268 00:16:47.190 --> 00:16:52.210 然后但是必须要专业人士去解释 Then, but it has to be explained by professionals. 269 00:16:52.760 --> 00:16:53.820 如果我是外行 If I was a layman 270 00:16:53.820 --> 00:16:55.840 我是解释不了这些东西的 I can't explain these things. 271 00:16:56.570 --> 00:17:00.580 那么这个是它的网址 So this is its web site 272 00:17:00.850 --> 00:17:04.630 这个是他的主页 This is his home page. 273 00:17:04.630 --> 00:17:05.450 就后面 Just behind 274 00:17:08.360 --> 00:17:13.260 然后它的特点还是为什么会有人去用 Then it's characteristic or why someone will use it. 275 00:17:13.350 --> 00:17:14.890 或者说比较喜欢用 Or rather like to use. 276 00:17:14.890 --> 00:17:16.520 我们是做过一次 We did it once 277 00:17:16.520 --> 00:17:18.230 我后面的一个例子也可以 The next example I'll give you is fine. 278 00:17:18.230 --> 00:17:19.140 大家看一下 Let's have a look 279 00:17:19.210 --> 00:17:22.410 我们在一个项目里头用过一次 We used it once in a project 280 00:17:23.460 --> 00:17:24.870 首先它是免费使用 First, it is free to use. 281 00:17:24.870 --> 00:17:27.560 还有就是离线在线都可以使用 There is also offline online can be used. 282 00:17:27.690 --> 00:17:30.380 另外使用起来也比较方便 In addition, it is more convenient to use. 283 00:17:30.380 --> 00:17:32.370 支持多种数据格式 Support multiple data formats 284 00:17:32.370 --> 00:17:35.850 我要说的是沃斯的格式它也是支持的 I'm talking about the Voss format, which is also supported. 285 00:17:35.850 --> 00:17:39.140 就是webofscience的格式它是可以支持的 Is the format of the web science, it can be supported. 286 00:17:39.250 --> 00:17:41.590 另外它可以对专利文献进行分析 In addition, it can analyze the patent literature. 287 00:17:41.590 --> 00:17:44.590 因为我本人做很多专利分析 Because I do a lot of patent analysis myself 288 00:17:44.590 --> 00:17:49.040 所以就比较比较对我们的胃口 So it's better for us. 289 00:17:49.440 --> 00:17:54.190 然后可以做很多的OPPO和贡献分析 Then you can do a lot of oppo and contribution analysis. 290 00:17:54.460 --> 00:17:58.280 另外它参数设置也比较灵活 In addition, it is more flexible parameter settings. 291 00:17:58.880 --> 00:18:02.660 具有数据清洗的这样的功能等等 Have the function of data cleaning, etc. 292 00:18:02.760 --> 00:18:08.360 那么可视化的效果它也是可以有很多种的效果呈现 Then the visual effect, it can also have many kinds of effects. 293 00:18:09.750 --> 00:18:15.860 我们这里做了一个某药品的一个发展的研究 We have done a research on the development of a drug here. 294 00:18:15.860 --> 00:18:18.760 这个是我们的一个项目 This is one of our projects. 295 00:18:21.340 --> 00:18:26.840 一个国外的额外的期刊编辑约我们约稿 An additional foreign journal editor about our draft. 296 00:18:26.840 --> 00:18:31.500 做中国的抗癌药的专利分析 Patent analysis of making Chinese anti-cancer drugs 297 00:18:32.730 --> 00:18:43.710 其实他是想要给国际上的这个药物研究者就是展示中国的抗癌药的 In fact, he wants to show the international drug researchers that China's anti-cancer drugs. 298 00:18:43.710 --> 00:18:46.120 一些专利的布局情况 Layout of some patents 299 00:18:46.390 --> 00:18:56.090 那么这是我们通过过沃斯比尔来做出来的一些一些图 So here are some of the pictures we made through bill vosbier 300 00:18:56.350 --> 00:18:59.150 就主要的研发方向 On the main direction of research and development 301 00:18:59.150 --> 00:19:04.780 竞争态势和专利布局等等研究领域 Competition situation, patent layout, etc. 302 00:19:06.740 --> 00:19:07.390 药品的 Pharmaceutical 303 00:19:07.910 --> 00:19:09.810 可能在座的没有学医的 Maybe none of you are studying medicine. 304 00:19:12.340 --> 00:19:15.760 不同的颜色其实代表了不同的领域 Different colors actually represent different areas. 305 00:19:17.010 --> 00:19:18.080 这样四个领域 These four areas 306 00:19:18.080 --> 00:19:20.320 就说颜色一样的 Just say the same color 307 00:19:20.320 --> 00:19:23.630 说明它们是相近的这样的领域 It shows that they are similar to such fields. 308 00:19:25.760 --> 00:19:27.890 这是机构和机构和 This is institutions and institutions and 309 00:19:27.890 --> 00:19:29.720 因为这都是中国的 Because it's all Chinese. 310 00:19:30.720 --> 00:19:32.620 大家看到中山大学 You can see sun Yat-sen university 311 00:19:32.880 --> 00:19:36.190 上海交大药科大学 University of pharmacy, Shanghai Jiaotong university 312 00:19:36.610 --> 00:19:37.990 北京大学是吧 Peking university, right? 313 00:19:38.970 --> 00:19:39.910 中科院的话 The words of the Chinese Academy of sciences 314 00:19:39.910 --> 00:19:43.290 它下面有很多药物的研究所 There's a lot of drug research under it 315 00:19:43.510 --> 00:19:46.670 这些机构的合作都是比较多的 The cooperation of these organizations is more extensive. 316 00:19:47.310 --> 00:19:53.600 然后中国这个大学 Then the university of China 317 00:19:54.970 --> 00:19:57.820 中国药科大学 China pharmaceutical university 318 00:19:58.000 --> 00:19:58.740 在哪里 Where is it 319 00:19:59.510 --> 00:20:00.350 在南京是吧 In Nanjing, right? 320 00:20:00.570 --> 00:20:08.130 对中国药科大学相对来说就比较比较有个性 For the Chinese pharmaceutical university is relatively more personality. 321 00:20:08.250 --> 00:20:09.980 比较独立 Relatively independent 322 00:20:10.080 --> 00:20:12.500 可以看到这个机构的情况 We can see the situation of this organization. 323 00:20:14.090 --> 00:20:17.010 还有很多其他的我可能没有录下来 There are many others that I may not have recorded. 324 00:20:17.010 --> 00:20:20.890 就是我们这个项目做的时候还是有很多其他的一些分析 There are a lot of other analyses when we do this project. 325 00:20:21.050 --> 00:20:26.470 那么如果有兴趣的话可以去玩一玩 Then you can go and play if you are interested. 326 00:20:26.470 --> 00:20:29.350 就是你们找一个自己的领域 Is that you find a field of your own? 327 00:20:30.130 --> 00:20:33.320 当然导数据清洗数据这些还是有点麻烦 Of course, wizard data cleaning data, these are still a bit of trouble. 328 00:20:35.460 --> 00:20:39.350 可以看到一些好玩的这种图 You can see some interesting pictures of this kind. 329 00:20:39.700 --> 00:20:42.630 那么因为你对那个领域比较了解 So because you know more about that field 330 00:20:42.630 --> 00:20:48.050 所以你可以做一些这个解读 So you can do some of this reading 331 00:20:48.050 --> 00:20:53.390 可能会发现一些你可能意想不到的这种结果 You may find some of these results that you might not expect. 332 00:20:53.390 --> 00:20:54.930 还是很有意思的 It's still very interesting. 333 00:20:55.980 --> 00:21:01.600 我今天要给大家讲的就这么一些 That's all I have to tell you today. 334 00:21:02.010 --> 00:21:03.520 非常感谢大家 Thank you very much. 335 00:21:03.520 --> 00:21:07.810 然后时间的话还有时间吗 Then is there still time if there is time? 336 00:21:07.910 --> 00:21:08.810 长时间可以 Long enough 337 00:21:09.500 --> 00:21:10.440 对 Right 338 00:21:10.960 --> 00:21:12.000 如果有问题 If there is a problem 339 00:21:12.000 --> 00:21:15.600 我们好像还有三三分钟是8到10分是吧 We seem to have three minutes, eight to ten, right? 340 00:21:16.200 --> 00:21:17.260 如果有问题的话 If there is a problem 341 00:21:17.260 --> 00:21:20.160 也欢迎大家跟我一起来交流 Also welcome everyone to communicate with me. 342 00:21:20.350 --> 00:21:24.340 或者是说哪些问题觉得讲的不够全面 Or what problems do you think are not comprehensive enough? 343 00:21:24.570 --> 00:21:27.160 或者是或者甚至是不对的地方 Or maybe or even wrong 344 00:21:27.530 --> 00:21:29.090 也希望大家来批评 I also hope that we can criticize it.