Tag Archives: Migration

What’s in a scientist name? Applying onomastics in scientometrics: the case of Cancer Research

The IREG Observatory on Academic Ranking and Excellence is an international institutional non-profit association of ranking organizations, universities and other bodies interested in university rankings and academic excellence.

Our friend Tania Vichnevskaia of the French National Institute for Health (INSERM) presented the following paper ‘Applying onomastics to scientometrics’ on Monday at IREG International symposium organised by University of Maribor and Shanghai Jiao Tong University.

Download PDF 20150119_IREG2015_INSERM_NamSor_vF.pdf

On this same topic:

About NamSor

NamSor™ Applied Onomastics is a European vendor of Name Recognition Software (NamSor sorts names). NamSor mission is to help understand international flows of money, ideas and people.

NamSor launched FDI Magnet,  a consulting offering to help Investment Promotion Agencies and High-Tech Clusters leverage a Diaspora to connect with business and scientific communities abroad.

Leave a comment

Filed under FDI Magnet, General

NamSor presented during Symposium on Academic Excellence

Our friend Tania Vichnevskaia of the French National Institute for Health (INSERM) presented the paper ‘Applying onomastics to scientometrics‘ yesterday at IREG International symposium organised by University of Maribor and Shanghai Jiao Tong University.

NamSor as a private start-up company has been solicited in 2014 by a European country to help measure the ‘brain drain’ affecting its competitiveness in the BioTech sector and to produce a global map of its scientific Diaspora (who are they, where are they and what are they doing). The objective was to build up the country’s scientific international cooperation and to engage its Diaspora.

Serendipity led analysts to discover interesting patterns in the way scientists names affect co-authorship and citation – not just for this particular country, but globally.

Last year, during ICOS2014 conference at Glasgow University, we presented how data mining millions of scientific articles in PubMed/PMC LifeSciences database uncovered amazing patterns in the way scientists names correlate with whom they publish, and who they cite in their papers.

We were interested to mine the large commercial bibliographic databases (Thomson WoS, Scopus) because they offer better data quality on citations and useful additional information, compared to PubMed:

– firstly, they have the full name in addition to the short name cited with just initials; this significantly reduces the error rate of onomastic classification

– secondly, they link scientists to research institutions (affiliations) and geographies (country of affiliation) ; this allows additional analysis on the topic of Diasporas and brain drain, comparing -for example- the research output of Chinese / Chinese American scientists in the US with that of scientists of Mainland China;

– thirdly, those databases have a larger coverage in terms of scientific disciplines, allowing comparison between different fields of research.

So collaboration started between NamSor and bibliometric experts at INSERM –the French National Institute for Health- to evaluate and visualize the effects of migration, Diaspora engagement and possibly cultural biases in Science.

This is Tania’s presentation at the conference:

What does the ‘onomastic millefeuille‘ of the global Cancer Research community look like?

201501_ThomsonWoS_CancerResearchOn this same topic:

The agenda of the Symposium is presented below

2nd Maribor Academicus Event

Academic Excellence: BETWEEN HOLY GRAIL AND MEASURABLE OBJECTIVES

International symposium  organised by University of Maribor and Shanghai Jiao Tong University

within the IREG Project on Academic Excellence

19-20 January 2015, Maribor, Slovenia

Higher education can importantly benefit from the rankings and league tables when used in a context with clear perspective of what ranking actually reflects (Prof. Jan Sadlak, President of IREG)
Active participants at the conference will be:

  •            Prof. Jan Sadlak, President of IREG,
  •            Prof. Gero Federkeil, CHE (Coordinator of Multi-Ranking),
  •            Prof. Nian Cai Liu,  Jiao Tong University in Shanghai (Author of the Shanghai ranking list),
  •            Prof. Seeram Ramakrishna,  National University of Singapore,
  •            Prof. Santo Fortunato,  Aalto University,
  •            Prof. Karin Stana Kleinschek, University of Maribor,
  •            Prof. Henryk Ratajczak, member of Czech Academy of Sciences,
  •            Prof. Edvard Kobal, Slovenian Science Foundation,
  •            Roberta Sinatra, PhD, Northeastern University,
  •            Tania Vichnevskaia, French National Institute for health (INSERM),
  •            Prof. Andrée Sursock, Senior Adviser at EUA,
  •            Prof. Øivind Andersen, University of Oslo.

About NamSor

NamSor™ Applied Onomastics is a European vendor of Name Recognition Software (NamSor sorts names). NamSor mission is to help understand international flows of money, ideas and people.

NamSor launched FDIMagnet,  a consulting offering to help Investment Promotion Agencies and High-Tech Clusters leverage a Diaspora to connect with business and scientific communities abroad.

Leave a comment

Filed under FDI Magnet, General

Onomastics to measure cultural bias in medical research

Elian CARSENAT, NamSor Applied Onomastics

Dr. Evgeny Shokhenmayer, e-onomastics

Abstract

This project involves the analysis of about over ten million medical research articles from PubMed, over three million names of scientists, authors or mentioned in citations. We propose to evaluate the correlation between the onomastic class of the article authors and that of the citation authors. We will demonstrate that the cultural bias exists and also that it evolves in time. Between 2007 and 2008, the ratio of articles authored by Chinese scientists (or scientists with Chinese names) nearly tripled. We will evaluate how fast this surge in Chinese research material (or research material produced by scientists of Chinese origin) became cross-referenced by other authors with Chinese or non-Chinese names. We hope to find that the onomastics provide a good enough estimation of the cultural bias of a research community. The findings can improve the efficiency of a particular research community, for the benefit of Science and the whole humanity.

This paper was prepared for ICOS2014, the 25th International Congress of Onomastic Sciences, the premier conference in the field of name studies

Introduction

PubMed/PMC is a large collection of scientific publication in LifeSciences. We used the 2013 data dump for data mining, with 14 million articles and 3.3 million author names. Some of the names are duplicates due to different orthographies, inconsistent use of initials and other data quality issues.

We used NamSor software to allocate an onomastic class to each author name. NamSor software with initially designed to analyse the big data in the field of economic development[1], business and marketing. The method for anthroponomical classification can be summarized as follow: judging from the name only and the publicly available list of all ~150k Olympic athletes since 1896 (and other similar lists of names), for which national team would the person most likely run? Here, the United-States are typically considered as a melting pot of other ‘cultural origins’: Ireland, Germany, etc. and not as a onomastic class on its own.

The breakdown of author names by onomastic classes is represented below :

2014_ICOS_NamSor_paper_vF_pic1

The largest groups of unique names in PubMed are British, French, German, Italian, Indian, Spanish, Dutch, etc.

An author with a French name might have a name from Brittany, Corsica or Limousin … or he might have a Canadian French name, or a Belgium French name. Or he might be an American professor with a French ancestry.

Scientists performance is often measured according to the number of publications, and the number of times a publication is cited by other publications (bibliometric rankings).

The table below shows the number of publications and the number of citations, by onomastic classes (top 20), as well as the ratio between the two metrics:

Onoma A C Ratio (C/A)
(GB,LATIN) 557,177 1,664,415 3.0
(FR,LATIN) 272,150 743,471 2.7
(DE,LATIN) 192,778 448,103 2.3
(JP,LATIN) 172,866 361,682 2.1
(IT,LATIN) 187,564 323,771 1.7
(IE,LATIN)   86,161 422,103 4.9
(NL,LATIN) 102,982 321,787 3.1
(AT,LATIN)   78,199 339,819 4.3
(CN,LATIN)* 219,040 186,464 0.9
(IN,LATIN) 153,555 221,332 1.4
(ES,LATIN) 113,407 228,650 2.0
(PL,LATIN)   47,961 268,115 5.6
(SE,LATIN)   65,717 237,017 3.6
(FI,LATIN)   35,533 247,231 7.0
(KR,LATIN) 146,444 105,605 0.7
(TW,LATIN)*   88,822 162,132 1.8
(GR,LATIN)   51,564 196,056 3.8
(DK,LATIN)   42,403 181,199 4.3
(BE,LATIN)   44,647 162,146 3.6
(CH,LATIN)   32,295 162,495 5.0
*CN+TW    307,862       348,596 1.1

This table tell us that scientists with British names have published 557 thousand articles in PubMed and have been cited 1.6 million times in other PubMed articles: the ratio is 3.

Articles written by authors with Italian names have been relatively less cited (with a ratio of 1.7) while the articles written by authors with Irish names or Finnish names have been more cited (with ratios respectively 4.9 and 7).

We cannot conclude on the overall performance of British, Italian or Finish scientists (many of them might be American scientists), but already we can observe interesting cultural biases emerging that cannot be explained by the imprecision of onomastic classification only. They raise interesting questions:

– can linguistic mastery of the English language explain why authors with British or Irish names have more citations?

– can features of a particular culture (ex. the Irish are excellent networkers and have great pubs) explain why scientific articles are more cited?

– do scientists with Italian names tend to cite more scientists with Foreign sounding names (English, Irish, etc.)?

– do scientists with Finish names tend to cite more scientists with Finish names?

– are there additional cultural biases in the publication process itself (selection, curation, promotion of scientific publications)?

– is there a gender bias worth noting (ex. male scientists are more cited; a culture with less female scientists would get a higher ratio) ?

Altogether, scientists with Chinese names -with names from mainland China or Taiwan- have altogether produced 307 thousand articles and been cited 348 thousand times: a ratio of 1.1, in the low range. We will now focus the rest of this paper on Chinese names: publications authored by a scientist with a Chinese name, or citations of scientists with Chinese names.

Scientists with Chinese names in PubMed

Globally, the number of publications in life sciences has been growing exponentially. Many countries and institutions encourage scientists to publish and link performance to bibliometric rankings (ie. publications in reputable journals, number of citations, etc.)

2014_ICOS_NamSor_paper_vF_pic2

From this chart, we can observe,

– that the absolute number of publications authored by scientists with a Chinese name has nearly tripled between 2007 and 2008 (x2.5, from 7k to 17k);

– that the relative share of publications authored by scientists with a Chinese name (compared to other onomastic classes) is also growing steadily.

This growth in the number of publications by authors with Chinese names, in absolute and relative terms, is matched by a drop in the ratio of citation/authorship :

2014_ICOS_NamSor_paper_vF_pic3

Year A C Ratio (C/A)
2012 81326 68038  0.8
2011 52396 42371  0.8
2010 33821 49260  1.5
2009 24726 35715  1.4
2008 17258 26321  1.5
2007 6944 17234  2.5
2006 4770 11299  2.4
2005 3260 6910  2.1
2004 1830 3782  2.1
2003 1195 2211  1.9
2002 849 1436  1.7
Before 3477 3823  1.1

Next, we will look at co-authorships. We do expect co-authorships to be more frequent within a same onomastic class, because of the correlation with geography : scientists with an Italian name might live in Italy, work in the same University on a research project, publish together the result of their research. We also expect to find diversity: many publications are the result of an international cooperation ; scientists are internationally mobile; last but not least countries like the US, Switzerland attract talents from everywhere and as a result of this global ‘brain drain’ produce very international research teams.

Both aspects, affinity and diversity, are reflected in the following matrix – displaying the number of co-authorships between onomastic classes:

2014_ICOS_NamSor_paper_vF_pic4

For example, the first column of the matrix (reflected in the pie chart below) shows that scientists with British names have a strong affinity to be co-author with scientists with British names, but also that they are likely to publish (in order) with scientists with French names, German names, Irish names, Italian names etc.

2014_ICOS_NamSor_paper_vF_pic5

Scientists with Chinese names have an even stronger affinity to be co-authors with scientists with Chinese names; they are likely to publish (in order) with scientists with British names, French names, German names, Italian names, Irish names, Korean names etc.

2014_ICOS_NamSor_paper_vF_pic6

Next, we will look at citations. In a perfect world, we expect citations to made based on the merits of scientific research only. We assume some ‘invisible hand’ will self-regulate the visibility of publications among research communities -so all relevant research is known by the experts of the field. If scientific excellence is equally distributed, we expect the number of publications citing authors of a particular onomastic class to be proportional to the number of authors of that particular onomastic class.  However, the following table tells a different story.

Onomastic Class Onoma Authored % Onoma
Self Citations %
Bias Factor
(GB,LATIN) 16.6% 17.0% 1.02
(FR,LATIN) 8.1% 7.6% 0.94
(IT,LATIN) 5.6% 3.8% 0.68
(DE,LATIN) 5.8% 6.1% 1.05
(CN+TW,LATIN) 9.2% 12.1% 1.32
(ES,LATIN) 3.4% 3.8% 1.13
(JP,LATIN) 5.2% 19.3% 3.73
(IE,LATIN) 2.6% 4.4% 1.73
(NL,LATIN) 3.1% 5.6% 1.83
(AT,LATIN) 2.3% 4.2% 1.79
(SE,LATIN) 2.0% 3.5% 1.76
(IN,LATIN) 4.6% 4.1% 0.89
(PT,LATIN) 1.9% 2.3% 1.17
(GR,LATIN) 1.5% 2.8% 1.82
(KR,LATIN) 4.4% 3.0% 0.68
(BE,LATIN) 1.3% 2.6% 1.98
(DK,LATIN) 1.3% 3.4% 2.65

In this table, we observe that authors with British names represent 16.6% of publications, but 17% of their citations : a bias factor of 1.02 (almost no bias). Conversely, we observe that authors with French names represent 8.1% of publications, but only 7.6% of their citations : a bias factor of 0.94 indicating that authors with French names tend to cite authors with foreign names more.

As for authors with Chinese names, they represent 9.2% of the publications, but 12.1% of their citations : a bias factor of 1.32 indicating that they tend to cite authors with Chinese names more.

Authors with Chinese names have a positive bias in citing authors with Chinese names, however we can see other cases where the bias is even stronger: authors with Japanese names citing authors with Japanese names, authors with Danish names…

More interesting, the following table shows that -apart from authors with a Chinese name- every other onomastic class (British, French, Italian, German etc.) have a negative bias towards citing authors with a Chinese name.

Onomastic class Chinese Onoma Citation Pct% Bias Factor
(GB,LATIN) 3.9% 0.43
(FR,LATIN) 3.9% 0.42
(IT,LATIN) 3.9% 0.43
(DE,LATIN) 4.1% 0.44
(CN+TW,LATIN) 12.1% 1.32
(ES,LATIN) 4.0% 0.43
(JP,LATIN) 5.2% 0.56
(IE,LATIN) 4.0% 0.44
(NL,LATIN) 3.5% 0.38
(AT,LATIN) 4.1% 0.44
(SE,LATIN) 3.6% 0.40
(IN,LATIN) 5.9% 0.65
(PT,LATIN) 4.0% 0.43
(GR,LATIN) 3.9% 0.42
(KR,LATIN) 6.8% 0.74
(BE,LATIN) 3.8% 0.42
(DK,LATIN) 3.9% 0.42

Authors with a Chinese name tend to cite authors with a Chinese name more. Comparatively, scientists with non Chinese names (British, French, Italian, German etc.) have a bias factor of 0.46 and are 3 times less likely to cite publications authored by a scientist with a Chinese name.

We will now see of the biases factors evolve between 2002 and 2012:

2014_ICOS_NamSor_paper_vF_pic7

According to this table, the positive bias factor of authors with Chinese names in citing other authors with Chinese names remains roughly stable. On the other hand, the negative bias factor of scientists with non-Chinese names in citing authors with Chinese names is generally increasing.

Manual controls

Given the large number of names automatically classified in a taxonomy based on geographic origin (China, etc.) we could not verify manually the entire database. We verified manually two randomly selected subsets:

– firstly, a list of 1280 names recognized by the software as Chinese names;

– secondly, a list of ~10000 names classified by the software into the full taxonomy (over 100 onomastic classes, corresponding to different countries of origin)

According to the first validation method, 83% of names the software recognized as Chinese were manually verified as Chinese; 2% unknown; 15% as non-Chinese (ie. mis-classifications).

The software outputs a confidence level. 76% of the names were classified with positive confidence. For the names recognized as Chinese with a positive confidence, 94% were manually verified as Chinese; 1% unknown; 4% as non-Chinese (ie. mis-classification).

2014_ICOS_NamSor_paper_vF_pic8

In PubMed, many names do not have a full first name, only initials.

For names classified with positive confidence, we found that first names of just one or two character (ex. J or JH) accounted for 90% of mis-classifications. When the input includes a full name (as would generally be the case with other bibliometric sources such as Thomson WoS, Scopus or ORCID) the accuracy is 99%.

2014_ICOS_NamSor_paper_vF_pic9

According to the second validation method, we can calculate the usual metrics used in classification : precision and recall.

10172 names were manually classified by a manual operator independently. In this method, errors could be made by the computer and also by the manual operator.

For the calculations below, we assume the assume the manual operator made no mistakes (this is not the case, error is human). The manual operator could classify 50% of names, left the rest as ‘Not Sure’.

For Chinese, non Chinese names, the software precision was respectively 81% and 97% and the recall was 59% and 99%. For names classified by the software with positive confidence (52% of all names), the precision was 93% and the recall was 69%. Excluding the names with first name length < 2 (initials, such as J or JH) the precision was 97% and the recall was 72%.

If conversely, we assume that the computer made no mistakes, then we can compare the precision and recall of the operator with that of the computer:

Chinese Names Non Chinese Names
All Names Computer Human Computer Human
Precision 81% 59% 97% 99%
Recall 59% 42% 99% 48%
Chinese Names Non Chinese Names
Confidence>0 Computer Human Computer Human
Precision 93% 69% 96% 99%
Recall 69% 49% 99% 48%
Chinese Names Non Chinese Names
Confidence>0 && Len(firstName)>2 Computer Human Computer Human
Precision 97% 72% 96% 100%
Recall 72% 51% 100% 48%

This method of cross validation between computer and human could be improved by having several manual checks by different operators to obtain a good validation sample.

Future work

For future work, we would data mine the large commercial bibliographic databases (Thomson WoS, Scopus and possibly ORCID) because they offer better data quality and useful additional information:

– firstly, they have the full name in addition to the short name cited with just initials; this significantly reduces the error rate of onomastic classification

– secondly, they link scientists to research institutions (affiliations) and geographies (country of affiliation) ; this allows additional analysis on the topic of Diasporas and brain drain, comparing -for example- the research output of Chinese / Chinese American scientists in the US with that of scientists of Mainland China;

– thirdly, those databases have a larger coverage in terms of scientific disciplines, allowing comparison between different fields of research.

Conclusions

Significant cultural biases exist, not only in the way scientists co-author publications together, but also in the way they make citations. Scientific publications authored by scientists with Chinese names are three times less cited by the international research community that they are cited by other scientists with Chinese names. We cannot conclude on the quality of Chinese research but we can challenge the commonly accepted idea that the volume of publications and citations alone indicate that China is becoming a superpower in Science and Technology.

Given the importance of bibliometric rankings in the way countries build and monitor public policies on Science and Education or international cooperation; in the way research institutions measure and reward scientific excellence of researchers and teams,  those biases should be accounted for. Otherwise, international comparisons are not ‘scientific’, not fair and can lead to wrong decisions.

[PDF 2014_ICOS_NamSor_paper_vF.pdf] [Pitch 20140828_ICOS2014_Pitch_vF.pdf]

[1] Onomastics and Big Data Mining, ParisTech Review 2013, arXiv:1310.6311 [cs.CY]

Source Data

2 Comments

Filed under FDI Magnet, General

Is China really becoming a science and technology superpower?

[read the FULL PAPER : ‘Measuring cultural biases in medical research‘] [in French]

Every now and then we hear stories about the making of China as a scientific superpower. How it overtook France in the global University Rankings:

‘In the 2014 edition of the world 500 top research universities, China keeps progressing. China has 9 universities among the top 200 (77 in the USA, 20 in Great-Britain, 14 in Germany, 8 in France). The trend is impressive: in 2004, China had only one University in the to 200.’ Source: LesEchos.fr

How it keeps increasing its volume of scientific publications produced:

‘The increased share of scientific publications produced by China (+231% between 2002 and 2012) is another indicator of the Chinese scientific growth. According to Ghislaine Filliatreau [of the French OST], it’s not only just an augmentation in volume but also in quality.’ LesEchos.fr

In China really becoming a scientific superpower? It may be so. We should however be careful how we interpret bibliometric information (volume and quality of scientific publications). There could be huge cultural biases currently unaccounted for, impacting international bibliometric rankings.

For example, let’s look at Scimago Journal and Country Ranking, an index based on the Scopus® database (Elsevier B.V.). China is already the second country in the world according to the number of scientific publications produced between 1996 and 2003. But if we consider the number of citations excluding self-citations, then China comes after Spain. The reason is that the ratio of citations per citable document (excluding self-citations) is lower than average.

Scimago Country Rankings

Next week, at ICOS2014 (the 25th International Congress of Onomastic Sciences and premier conference in the field of name studies), we will explore some of the cultural biases at play in LifeSciences. A presentation of PubMed (MedLine/PMC) data mining using NamSor software, conducted with onomast Eugène Schochenmayer, will take place at Glasgow University on the 28th of August.

Onomastics to measure cultural bias in medical research (ABSTRACT)

This project involves the analysis of about one million medical research articles from PubMed. We propose to evaluate the correlation between the onomastic class of the article authors and that of the citation authors. We will demonstrate that the cultural bias exists and also that it evolves in time. Between 2007 and 2008, the ratio of articles authored by Chinese scientists (or scientists with Chinese names) nearly tripled. We will evaluate how fast this surge in Chinese research material (or research material produced by scientists of Chinese origin) became cross-referenced by other authors with Chinese or non-Chinese names. We hope to find that the onomastics provide a good enough estimation of the cultural bias of a research community. The findings can improve the efficiency of a particular research community, for the benefit of Science and the whole humanity.

Some of the tools we’ve used to produce this research:

  • MonetDB, the open-source column-store pioneer; due to the multiplicative aspect of some queries (ex. counting articles authored by a scientist with a Chinese name, cited by a scientist with -say- an Italian name) the volume was huge and we couldn’t do with a classic database
  • RapidMiner, a leading open-source data mining and predictive analytics software
  • Our own RapidMiner Onomastics Extension, to predict the gender and likely origin of personal names

About Evgeny Shokhenmayer

Dr. Evgeny Shokhenmayer (MoDyCo, Paris 10), editor of e-Onomastics, online blog focused on onomastics research and publications.
http://e-onomastics.blogspot.fr/

About NamSor

NamSor™ Applied Onomastics is a European vendor of Name Recognition Software (NamSor sorts names). NamSor mission is to help understand international flows of money, ideas and people. NamSor launched @FDIMagnet,  a consulting offering to help Investment Promotion Agencies and High-Tech Clusters leverage a Diaspora to connect with business and scientific communities abroad.
http://namesorts.com/onomastics/fdi-magnet/

1 Comment

Filed under FDI Magnet, General

Turkish Onomastics and Migration Patterns

Next week at Regent’s University Turkish Migration Conference (TMC2014, London), Elian Carsenat will present breakthrough data mining technology to apply onomastics (the recognition of personal names) to the discovery of new migration patterns.

20140522_TMC_Flyer

As states struggle to provide timely and accurate data to international organizations (such as the OECD, IOM, United Nations High Commissioner for Refugees UNHCR, …), these organizations can turn to the Big Data to identify and monitor new trends. What can Twitter, LinkedIn, Google, Facebook, D&B, Thomson WoS … tell us about the changing migration patterns of highly educated professionals, entrepreneurs? We’ll present how applied onomastics and the Big Data can be a game changer in migration studies, with vast implications on how countries or even regions can engage their Diaspora (to attract FDI, remittances, to build networks of expertise, …)

We look forward to see you at Regent’s University Turkish Migration Conference (TMC2014, London). Full program here.

To download the supporting presentation 20140530_TMS2014_Pitch_vFf.pdf

Further reading:

Leave a comment

Filed under EthnoViz

Announcing NamSor™ for Life Sciences

Life Sciences is a key sector for countries to create organic growth, as well as to attract Foreign Direct Investments (FDIs). It is a sector of constant innovation: public institutions and private companies need access to local and international talents to thrive and remain competitive.

Having an accurate picture of a country’s local talent is critical to make strong local industries stronger (in BioTech, MedTech, Pharma, AgroTech, Industrial BioTech, …)

Talent in Life Sciences is global: the key people who can help design new products, make scientific or technical breakthroughs are mobile: they may be in the US, Switzerland, Denmark or somewhere else. How do you access international talent? Your best ambassadors to attract world class scientists are the people who are already working in your company or your country today.

Read further:

About NamSor

NamSor™ Applied Onomastics is a European vendor of Name Recognition Software (NamSor sorts names). NamSor mission is to help understand international flows of money, ideas and people.

NamSor launched FDIMagnet,  a consulting offering to help Investment Promotion Agencies and High-Tech Clusters leverage a Diaspora to connect with business and scientific communities abroad.

Leave a comment

Filed under FDI Magnet, General

NamSor Applied Onomastics to help Lithuania become a talent magnet in BioTech

LTUWorld reblog

Recognizing names and their origin in global scientific and technology databases allows research institutes and high-tech companies identify where the ‘brain juice‘ flows internationally. What are the main talent magnets (universities, companies) in a particular field? How best can a growing biotechnology industrial cluster retain local talent, as well as attract international professors?

Professor Daumantas Matulis, Head of Department of Biothermodynamics and Drug Design at the Institute of Biotechnology (Vilnius University) said ‘NamSor did a pilot project for us and helped us better understand the depth and breadth of Lithuanian talent deployed internationally, in the field of BioTech and Medical Research.’

Kotryna Stankute, Acting Director at Global Lithuania Leaders (GLL), network of international Lithuania-affiliated professionals, said it is crucially important to each country to be aware about their talent pool based out of the country. A strategy of networking with the world-wide Lithuanian professionals’ Diaspora and their engagement can turn brain drain to brain exchange.

Elian Carsenat, founder of NamSor, said ‘We data mined millions of research articles from PubMed Central® (PMC) to recognize Lithuanian names (or names related to Lithuania). By a happy accident, we also discovered that there exist cultural biases in medical research : a scientist of a given culture or origin is more likely to be interested in a particular topic; some scientific communities are more or less likely to be cited by another community. Such cultural biases could even impact the overall bibliometric ranking of a country’s scientific research. We turned this serendipity into a paper, which will share at the International Congress of Onomastic Sciences (ICOS 2014, Glasgow, August).’

In September 2014, Lithuania will host Life Sciences Baltics 2014, the only international forum in the Baltic region for world-class biotechnology, pharma and medical devices experts worldwide.

About the Institute of Biotechnology
Founded in 1975 as the all Union Research Institute of Applied Enzimology, currently,  the Institute of Biotechnology is mainly involved in research and training in the fields of biotechnology and molecular biology, including research and development of recombinant biomedical proteins, genetic and molecular studies of restriction modification phenomenon, developing of viruses diagnostics, epigenetic study of small RNA, drug design and synthesis, bioinformatics.
http://www.ibt.lt/

About NamSor
NamSor™ Applied Onomastics[i] is a European vendor of Name Recognition Software (NamSor sorts names). NamSor mission is to help understand international flows of money, ideas and people.

NamSor launched FDIMagnet,  a consulting offering to help Investment Promotion Agencies and High-Tech Clusters leverage a Diaspora to connect with business and scientific communities abroad.


[i] Onomastics (or onomatology) is the science of proper names. NamSor and NomTri are registered trademarks.

PDF document 20140101_NamSor_BioTech_offering_vFinaf.pdf

Further reading : [Leveraging a large Diaspora for Economic Growth | Investment Promotion Agency innovates to attract Foreign Direct Investments | Onomastics for Business]

1 Comment

Filed under FDI Magnet, General

Leveraging a large Diaspora for Economic growth

Any country with a strong outflows of bright young people faces the following dilemma: is it a threat or an opportunity?

On one hand, young graduates seek abroad the experience they think they couldn’t have at home. This is an opportunity loss for the country that raised them, trained them and hoped for a future return on that investment. It can unbalance the country demographics.

On the other hand, having a large Diaspora offers many benefits: money flows to families at home, cultural and scientific transfers (including bringing home a more entrepreneurial culture), inbound Foreign Direct Investments (FDIs), access to a large global network of well-connected people…

Lithuania economic growth: role of Diaspora / brain circulation

We had the opportunity to participate to the last panel at the World Lithuanian Economic Forum 2013 (WLEF / PLEF) and to discuss the following topic ‘Country economic growth: role of Diaspora / brain circulation’. We used onomastics to share several insights into the Lithuanian Diaspora with the other panellists and the participants: how Global Lithuanian Leaders can be leveraged in the fields of Innovation, Business and Communication (Softpower).

You will find our presentation below for download (in PDF):

20130603 Presentation To WLEF vF.pdf

We are very grateful to Global Lithuanian Leaders (GLL) for inviting us at ForumOne and WLEF.

About NamSor™ Applied Onomastics

We design name recognition software to help countries and international companies better understand global flows of money, ideas and people. We also offer consulting services to help Investment Promotion Agencies and countries reconnect with their business communities abroad.

Leave a comment

Filed under FDI Magnet, General

About travelling IDEAS and the Global Innovation GAP

[Jan 2014, on this topic : NamSor Applied Onomastics to help Lithuania become a talent magnet in BioTech]

Recently, the French-based INSEAD and the World Intellectual Property Organization (WIPO) published The Global Innovation Index 2012 (GII), in collaboration with Alcatel-Lucent, Booz&Co, the Confederation of Indian Industry (CII) and the European Commission’s Joint Research Centre.

Why is innovation is such a strategic issue? For different reasons depending on the country’s economic situation:

– traditional industrial countries would like to maintain their lead : for example Switzerland leads the pack, Germany keeps it’s pace, France is slowly loosing it;

– some countries depend too much on a specific sector and would like to diversify their economy : for example the United-Kingdom relies too much on Financial Services, the Russian Federation relies too much on Oil, Gas and other commodities;

– the emerging countries that have big service and manufacturing industries would like to join the club of advanced countries, through innovation : for example Brazil, China and India.

It’s not just a matter for one country to reach sustainable economic growth : the world’s economic balance is at stake. Should China for example, with its immense trade surpluses and its population, reach the status of a innovative country – the economic balance as well as the strategic balance with its neighbours will have definitively shifted.

The Global Innovation Index 2012 report is an excellent document, very useful to guide governmental policies on innovation. However, we believe something is missing : innovation, like financial resources, moves around. Students, researchers, inventors, entrepreneurs travel. Ideas travel with them and sometimes without them. This is not accounted in the Global Innovation Index report.

On page 6 (and excerpt below), we can see the overall structure of the Global Innovation Index : it distinguishes Innovation Input (for example, Education and Investment) and Innovation Output (for example, Knowledge creation and Online creativity). But what if a country has a good education system, but not enough Investment or a poor Business environment? You guessed right : students, researchers, inventors, entrepreneurs will go somewhere else and their ideas will grow some other country’s Innovation Output. This natural movement can be reinforced by an active brain drain by a country with the means to do it.

We believe Onomastics can help account for those movements of ideas across borders. Below, a (very draft) account of where the Polish brain juice went over the past 20 years.

Polish brain juice top recipients (1993-2013)

Over the past 20 years, Polish ideas went to grow the following countries’ innovation output : the United-States, Germany, Poland, Canada, Great-Britain, Israel, Switzerland, Netherland, France … and some other countries.

Call for collaboration : we would be delighted to work with a Consulting firm, a Think tank or a University to help and prepare a comprehensive report on the global gaps between Innovation input and Innovation output, using onomastics analysis. We believe it will help countries tune their innovation policy and reduce the innovation IO gap : with great business value and strategic benefits in the long term.

Follow us on Twitter

©2013 NameSorts.com – NamSor™ NomTri™

— EXCERPT – slide P6 – Structure of the Global Innovation Index —

Innovation Input

Institutions (Political environment, Regulatory environment, Business environment)
Human capital & research (Education, Tertiary education, Research & development)
Infrastructure (ICT, General infrastructure,Ecological sustainability)
Market sophistication (Credit Investment, Trade & competition)
Business sophistication (Knowledge workers, Innovation linkages, Knowledge absorption)

Innovation Output

Knowledge & technology outputs (Knowledge creationn Knowledge impact, Knowledge diffusion)
Creative outputs (Creative intangibles, Creative goods & services, Online creativity)

2 Comments

Filed under EthnoViz, FDI Magnet