Click here to load reader
Upload
xuefeng-fang
View
215
Download
0
Embed Size (px)
Citation preview
RESEARCH ARTICLE
Landscape of the SOX2 protein–protein interactome
Xuefeng Fang1,2,3,4, Jae-Geun Yoon1, Lisha Li2, Yihsuan S. Tsai3, Shu Zheng4, Leroy Hood5,David R. Goodlett3, Gregory Foltz1 and Biaoyang Lin1
1 Swedish Neuroscience Institute, Swedish Medical Center, Seattle, WA, USA2 Zhejiang-California International NanoSystems Institute, Zhejiang University, Hangzhou, Zhejiang, P. R. China3 Department of Medicinal Chemistry, University of Washington, Seattle, WA, USA4 Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education),
The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, P. R. China5 The Institute for Systems Biology, Seattle, WA, USA
Received: July 7, 2010
Revised: November 19, 2010
Accepted: December 5, 2010
SOX2 is a key gene implicated in maintaining the stemness of embryonic and adult stem
cells that appears to re-activate in several human cancers including glioblastoma multiforme.
Using immunoprecipitation (IP)/MS/MS, we identified 144 proteins that are putative SOX2
interacting proteins. Of note, SOX2 was found to interact with several heterogeneous nuclear
ribonucleoprotein family proteins, including HNRNPA2B1, HNRNPA3, HNRNPC,
HNRNPK, HNRNPL, HNRNPM, HNRNPR, HNRNPU, as well as other ribonucleoproteins,
DNA repair proteins and helicases. Gene ontology (GO) analysis revealed that the SOX2
interactome was enriched for GO terms GO:0030529 ribonucleoprotein complex and
GO:0004386 helicase activity. These findings indicate that SOX2 associates with the hetero-
geneous nuclear ribonucleoprotein complex, suggesting a possible role for SOX2 in post-
transcriptional regulation in addition to its function as a transcription factor.
Keywords:
Biomedicine / Glioblastoma multiforme / Heterogeneous nuclear ribonucleoprotein /
Mass spectrometry / SOX2
1 Introduction
SOX (SRY-like HMG box) gene family represents a family of
transcriptional factors characterized by the presence of a
homologous sequence called the HMG (high mobility group)
box in their genes. SOX2, one of the key members of the SOX
family gene, is highly expressed in embryonic stem cells
(ESCs) [1]. Recently, Takahashi et al. showed that SOX2 is a
key transcription factor, in conjunction with KLK4, Oct4 and c-
Myc, whose overexpression can induce pluripotency in both
mouse and human somatic cells [2, 3]. SOX2 is one of the four
factors (OCT4, SOX2, NANOG, and LIN28) that Yu et al. used
to reprogram human somatic cells to pluripotent stem cells
that exhibit the essential characteristics of ESCs [4]. SOX2 is
one of the two factors (SOX2 and OCT4) that were sufficient
to generate induced pluripotent stem cells from human cord
blood cells [5]. These suggest that SOX2 is the key gene in
conferring stemness of cells. The stemness program can also
play an important role in cancer because self-renewal is a
hallmark of for both stem cells and cancer cells.
SOX2 are overexpressed in malignant glioma while
displaying minimal expression in normal tissues [6, 7].
We previously completed massively parallel signatureAbbreviations: ESC, embryonic stem cell; GBM, glioblastoma
multiforme; GO, gene ontology; HMG, high mobility group;
HNRNPU, heterogeneous nuclear ribonucleoprotein U; IP,
immunoprecipitation; mRNP, messenger ribonucleoprotein;
RBM14, RNA-binding motif protein 14; SOX2, SRY (sex-deter-
mining region Y)-box 2
�Additional corresponding author: Dr. Gregory Foltz
E-mail: [email protected]
Colour Online: See the article online to view Figs. 2 and 3 in colour
Correspondence: Dr. Biaoyang Lin, Swedish Neuroscience
Institute, Swedish Medical Center, 550 17th Ave., James Tower,
Suite 570, Seattle, WA 98122, USA
E-mail: [email protected]
Fax: 11-206-320-3166
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
Proteomics 2011, 11, 921–934 921DOI 10.1002/pmic.201000419
sequencing (MPSS) and identified SOX2 as significantly
overexpressed in glioblastoma multiforme (GBM) tissues
compared to normal brain tissues [8]. We identified two
MPSS tags that correspond to different polyadenylated
isoform, and both are up-regulated (unpublished result, data
not shown). More recently, Gangemi et al. showed that
silencing of the SOX2 in freshly derived glioblastoma tumor
initiating cells (TICs) stopped proliferation and the resulting
cells lost tumorigenicity in immunodeficient mice [9]. More
recently, Ikushima et al. showed that inhibition of TGF-bsignaling drastically deprived tumorigenicity of glioma-
initiating cells (GICs) by promoting their differentiation,
and that these effects were attenuated in GICs transduced
with SOX2 or SOX4 [10]. Their data again demonstrated that
SOX2 is a key gene in maintaining the stemness of glioma
stem cells. Therefore, SOX2 may be a gene that is predo-
minantly expressed in embryonic and adult stems cells
including neural progenitor cells and re-activates in cancers
including brain tumors. It is also a key gene involved in
tumorigenesis of gliomas and in the maintenance of glioma
stem cells.
Despite its importance, what other proteins interact
with SOX2 in glioma cells to achieve its functions
was not known. We therefore analyzed the SOX2 inter-
actome by immunoprecipitation (IP) coupled with MS
analysis.
2 Materials and methods
2.1 Immunoprecipitation (IP)
The LN229 cells were grown in DMEM with 10% FBS.
When the cells’ growth reached 80% confluence 4 days after
sub-culturing, the cells were washed three times with ice-
cold PBS. The cells were lysated in ice-cold lysis buffer
(20 mM Tris-HCl, pH 7.4, 150 mM NaCl, 10% glycerol, 1%
NP-40, and complete protease inhibitor from Roche Applied
Sciences). We used the Immunoprecipitation Kit – Dyna-
beadss Protein A for the IP. Fifty microliters of protein-A
Dynabeads (Invitrogen) and 5mg SOX2 antibody (ab59776,
Abcam) or heterogeneous nuclear ribonucleoprotein U
(HNRNPU) antibody [3G6] (ab10297, Abcam) were,
respectively, incubated to generate Protein-A–Dynabea-
d–antibody complex. In brief, the Protein-A–Dynabead and
antibody were incubated in pH 5.0 citrate phosphate buffer
at room temperature for 60 min with tilting/rotation.
Protein-A–Dynabead–antibody complex was washed twice in
cell lysis buffer and then resuspended in 100mL cell lysis
buffer. The supernatants of 1 mg cell lysate were incubated
with the Protein A–Dynabeads–Ab complex at 41C over-
night. The beads were washed three times with PBS
containing 0.05% Tween-20 at 41C. After washing, the
protein complex was eluted with 0.1 M citrate (pH 2.3) and
analyzed by Western blot with the SOX2 or the HNRNPU
antibody.
2.2 Mass spectrometry and data analysis
In-solution and in-gel digestion was carried out as described
previously [11]. Detailed MS analysis and its downstream
data processing are shown in Supporting Information 1.
High-Throughput GoMiner [12] was used to find statis-
tically overrepresented gene ontology (GO) terms using all
evidence levels and categories. One-sided Fisher’s exact
p-value corrected for multiple comparisons was used to
calculate the FDR (false discovery rate).
2.3 Western blot analysis
Primary antibodies used were: rabbit polyclonal Ku70 anti-
body (ab10878, Abcam), mouse monoclonal Ku80 antibody
[S10B1] (ab2173, Abcam), rabbit polyclonal TLS/FUS
(fusion gene in myxoid liposarcoma 2) antibody (ab70381,
Abcam), and mouse monoclonal HNRNPU antibody [3G6]
(ab10297, Abcam). Standard immunoblot protocol was used.
2.4 Taqman assays
The SOX2 taqman assay (Hs01053049_s1) and the
normalization control Human GUSB (b-glucuronidase)
(Life Technologies) were used. Standard protocol according
to the manufacturer’s instruction was used.
3 Results
3.1 IP-MS analysis of the SOX2 interactome
We screened by real-time PCR the SOX2 expression in eight
commonly used glioblastoma cell lines T98, U118, U87,
H683, M059J, M059K, LN229, and LN18. We found that
LN229 expressed the highest amount of SOX2 (Fig. 1). We
also compared the expression levels (using the Ct values of
the real-time quantitative PCR data) of LN229 and 19 GBM
tissue specimens (Supporting Information Fig. 1) and
showed that LN229 has comparable expression levels of
SOX2 with GBM tissues. We, therefore, decided to use
LN229 for our SOX2-IP experiments.
We first tested the SOX2 antibody for its specificity
using cell lysates from glioblastoma cell line LN229. We
showed that SOX2 antibody (ab59776, Abcam) detected
two bands, one at the right predicted size of 34 kDa and
another probably representing the post-translationally
modified form of around 36 kDa (Fig. 2B). For the IgG
control, we used the normal rabbit IgG (sc-2027, Santa Cruz
Biotechnology), which was widely used as a negative IP
control [13, 14]. We then performed two replicate IP
experiments for both SOX2 and IgG controls using LN229
glioblastoma cell line obtained from the American Type
Culture Collection.
922 X. Fang et al. Proteomics 2011, 11, 921–934
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
A Coomassie-stained protein gel revealed that there are
14 additional bands specific to SOX2-IP, which were not
found in the IgG-IP (Fig. 2A). Three major non-specific
bands were evident in the control IgG-IP, and additional
background was observable after adjusting the contrast of
the image for Supporting Information Fig. 2 (data not
shown). Western blot analysis confirmed that the SOX2-IP
product contained the SOX2 protein (Fig. 2B) with a specific
band at 34 kDa, whereas the IgG-IP did not contain the
SOX2 product, suggesting that the SOX2-IP is specific for
SOX2.
We then performed MS analysis. We carried out in-
solution trypsin digestion of the SOX2-IP and the IgG-IP
products, as well as in-gel digestion of individual bands
identified (Fig. 2A). After MS analysis, we identified a total
of 11 peptides that corresponded to the SOX2 protein
(Supporting Information Table 1), suggesting that our
SOX2-IP product contains the SOX2 protein, which is
consistent with our Western blot analysis data shown above
(Fig. 2B). For the SOX2-IP replicate 1, we obtained 321
entries (70 single hits) with Protein prophet p40.9 (with
estimated error rate of 0.8%). For the SOX2-IP replicate 2,
we obtained 294 entries (57 single hits) with Protein prophet
p40.9 (with estimated error rate of 0.6%). To remove as
much as potential contamination and/or non-specific bind-
ing of proteins from the IgG portion of the antibody, we
removed those proteins that were identified in the IgG-IP
with Protein Prophet p40.5 (with error rate of 6.7%). In
addition, we removed proteins with single peptide hits from
the SOX2-IP list and then obtained a list for the intersection
of two SOX2-IP analyses. In the end, we identified a total of
144 proteins that are putative SOX2 interacting proteins
(Table 1) in GBM cells. Supporting Information Table 2 lists
the number of peptides, the sequence and charge state of
each peptide, and other additional information for the
proteins identified.
We also analyzed the MS data separately for each indi-
vidual band. The Supporting Information Table 3 lists the
plausible identifications of the most abundant proteins from
each band: HNRNPU could be found in band 5, HNRMPR
and HNRNPL and FUS (HNRNP-P2) in band 8; SOX2,
HNRNPC, and HNRNP2B1 in band 9.
To confirm that our approach and analysis pipeline were
able to identify SOX2-interacting proteins, we randomly
picked several proteins for which antibodies were available
and good for Western blot analysis. The proteins are
HNRNPU, Ku70 (XRCC6), and TLS (FUS, hnRNP-P2).
Figure 2C–F shows that only the SOX2-IP product
contained these proteins, whereas the IgG-IP product did
not, confirming that our SOX2-IP was able to pull down
SOX2-interacting proteins specifically.
3.2 GO analysis of the SOX2 interactome
We found that the SOX2 interactome was enriched for GO
terms: GO:0030529 ribonucleoprotein complex, GO:0030530
heterogeneous nuclear ribonucleoprotein complex, GO:0031981
nuclear lumen, GO:0005730 nucleolus (Table 2), suggesting
that SOX2 is a protein involved in forming heterogeneous
nuclear ribonucleoprotein complex at the nucleolus. To further
confirm our finding that SOX2 is an HNRNP complex protein,
we performed reciprocal IP for HNRNPU using HNRNPU
mouse monoclonal antibody (ab10297, Abcam). We confirmed
that SOX2 could be detected in the HNRNPU-IP product
(Fig. 2G).
Other interesting enriched GO terms included the GO
terms related to helicase activity, which include GO:0004386
helicase activity, GO:0003724 RNA helicase activity, and
GO:0003678 DNA helicase activity (Table 2).
4 Discussion
The SOX2 interactome contains many proteins that belong
to the heterogeneous nuclear ribonucleoprotein family
Figure 1. Real-time quantitative PCR of SOX2
expression in eight glioblastoma cell lines
T98, U118, U87, H683, M059J, M059K, LN229,
and LN18. Relative fold of expression
normalized to GUSB was shown as bar chart.
Standard deviations of three replicate assays
were indicated as error bars.
Proteomics 2011, 11, 921–934 923
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
including HNRNPA2B1, HNRNPA3, HNRNPK, HNRNPL,
heterogeneous nuclear ribonucleoprotein M (HNRNPM),
HNRNPR, HNRNPU, FUS (which is heterogeneous nuclear
ribonucleoprotein P2), other ribonucleoproteins, DNA
repair proteins, and helicases (Table 1). HNRNPK is a
protein that is found at higher density at transcribed gene
loci compared with silent gene loci [15]. Mikula et al. used
the IP-MS/MS approach to identify proteins that interact
with hnrnpk in rat hepatoma cells and they identified 89
unique proteins from the hnrnpk protein complex (the
union of Supporting Information Tables 1–3 of their paper)
[16]. They identified several hnRNP proteins including
hnrnpl, hnrnpg, hnrnp2b1, and several dead box containing
proteins ddx1, ddx5, ddx17. Using the homologene table
from NCBI (www.ncbi.nlm.nih.gov/homologene), we were
able to find 74 human orthologs for hnrnpk protein
complexes. We then compared these 74 human orthologs
with our SOX2 interactome list, and we identified 16
proteins (about 22%) that are in common, which included
three HNRNP proteins, three DDX proteins, and two heat
shock proteins (Table 1). The overlap could be bigger as
several proteins have similar functions but those were given
different protein names and we did not consider them to be
exact orthologs based on the homologene table from NCBI.
Previous protein–protein interaction analysis revealed that
HNRNPK interacts with multiple proteins involved in
multiple steps in the gene expression including chromatin
remodeling, transcription, RNA processing, and translation
[16].
Many other ribonucleoproteins (Table 1) were identified
to be SOX2 binding partners, including PTBP1, YBX1,
RNA-binding motif protein 14 (RBM14), STAU1, and
KHDRBS1. PTBP1 is a multi-functional RNA-binding
protein that is aberrantly overexpressed in glioma. Knock-
down of PTBP1 in glioma cells slowed cell proliferation,
inhibited cell migration, and increased adhesion of cells to
fibronectin and vitronectin [17]. YBX1 is highly expressed
in primary GBM but not in normal brain tissues [18].
Figure 2. (A) A Coomassie-stained protein gel
of IP products for SOX2 and IgG control
revealed that there are about 14 additional
distinct bands specific to SOX2-IP, which
were not found in the IgG-IP. Positions
marked as 15–18 correspond to blank areas
between distinct bands. (B) Western blot
analysis of the cell lysate (left), IgG-IP
(middle), and SOX2-IP (right) with the SOX2
antibody. (C–F) Western blot confirmation of
HNRNPU, XRCC6, TLS, and XRCC5 proteins.
IP-products using IgG antibody and SOX2
antibody were analyzed by Western blot
using the corresponding antibodies.
924 X. Fang et al. Proteomics 2011, 11, 921–934
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
Tab
le1.
SO
X2
inte
ract
ing
pro
tein
sid
en
tifi
ed
inre
pli
cate
SO
X2-I
Pb
yM
S
IPI
nu
mb
er
Nu
mb
er
of
un
iqu
ep
ep
tid
eh
its
Desc
rip
tio
n%
Seq
uen
ceco
vera
ge
iden
tifi
ed
fro
mM
S/M
Sd
ata
Gen
en
am
eId
en
tifi
ed
as
hn
RN
PK
inte
ract
ing
inm
ou
se
Iden
tifi
ed
inm
RN
Pco
mp
lexes
inra
t
Pu
tati
ve
dir
ect
or
ind
irect
?
Hete
rog
en
eo
us
nu
clear
rib
on
ucle
op
rote
ins
IPI0
0216592
8H
ete
rog
en
eo
us
nu
clear
rib
on
ucl
eo
pro
tein
sC
28
HN
RN
PC
Yes
IPI0
0396378
7H
ete
rog
en
eo
us
nu
clear
rib
on
ucl
eo
pro
tein
sA
2/B
121.2
HN
RN
PA
2B
1IP
I00419373
5H
ete
rog
en
eo
us
nu
clear
rib
on
ucl
eo
pro
tein
A3
19.5
HN
RN
PA
3IP
I00216049
9H
ete
rog
en
eo
us
nu
clear
rib
on
ucl
eo
pro
tein
K20.1
HN
RN
PK
Yes
IPI0
0027834
7H
ete
rog
en
eo
us
nu
clear
rib
on
ucl
eo
pro
tein
L21.5
HN
RN
PL
Yes
Yes
Ind
irect
IPI0
0171903
29
Hete
rog
en
eo
us
nu
clear
rib
on
ucl
eo
pro
tein
M29.4
HN
RN
PM
IPI0
0012074
8H
ete
rog
en
eo
us
nu
clear
rib
on
ucl
eo
pro
tein
R24.2
HN
RN
PR
Ind
irect
IPI0
0479217
13
Hete
rog
en
eo
us
nu
clear
rib
on
ucl
eo
pro
tein
U17.2
HN
RN
PU
Yes
Yes
IPI0
0221354
11
Fu
sio
ng
en
ein
myxo
idli
po
sarc
om
a2
14
TLS
/FU
S(H
nR
NP
-P2)
Rib
on
ucle
op
rote
ins
an
do
ther
RN
Ab
ind
ing
pro
tein
s
IPI0
0444262
9N
ucl
eo
lin
16.9
NC
LY
es
Ind
irect
IPI0
0041325
3N
ucl
eo
lar
pro
tein
fam
ily
A,
mem
ber
2(H
/AC
Asm
all
nu
cleo
lar
RN
Ps)
25.5
NO
LA
2
IPI0
0006379
6N
OP
5/N
OP
58
nu
cleo
lar
pro
tein
515.3
NO
P5/N
OP
58
IPI0
0220740
5N
ucl
eo
ph
osm
in(n
ucl
eo
lar
ph
osp
ho
pro
tein
B23,
nu
matr
in)
23
NP
M1
Ind
irect
IPI0
0215914
2A
DP
-rib
osy
lati
on
fact
or
319.3
AR
F3
IPI0
0215918
4A
DP
-rib
osy
lati
on
fact
or
425.6
AR
F4
IPI0
0002349
24
Nu
clear
frag
ile
Xm
en
tal
reta
rdati
on
pro
tein
inte
ract
ing
pro
tein
230.1
NU
FIP
2Y
es
IPI0
0179964
4P
oly
pyri
mid
ine
tract
bin
din
gp
rote
in1
12.2
PT
BP
1In
dir
ect
IPI0
0013174
15
RN
A-b
ind
ing
mo
tif
pro
tein
14
24.1
RB
M14
Ind
irect
IPI0
0008575
3K
Hd
om
ain
con
tain
ing
,R
NA
bin
din
g,
sig
nal
tran
sdu
ctio
nass
oci
ate
d1
6.9
KH
DR
BS
1In
dir
ect
IPI0
0000001
4S
tau
fen
,R
NA
-bin
din
gp
rote
in,h
om
olo
g1
(Dro
sop
hil
a)
8.5
ST
AU
1S
TA
U2
Ind
irect
IPI0
0031812
3Y
bo
xb
ind
ing
pro
tein
120.5
YB
X1
Yes
DN
Are
pair
pro
tein
san
dh
eli
cases
IPI0
0002564
4X
-ray
rep
air
com
ple
men
tin
gd
efe
ctiv
ere
pair
inC
hin
ese
ham
ster
cells
15.7
XR
CC
1In
dir
ect
IPI0
0220834
42
X-r
ay
rep
air
com
ple
men
tin
gd
efe
ctiv
ere
pair
inC
hin
ese
ham
ster
cells
5(d
ou
ble
-str
an
d-b
reak
rejo
inin
g;
Ku
au
toan
tig
en
,80
kDa)
39.5
XR
CC
5In
dir
ect
IPI0
0644712
35
X-r
ay
rep
air
com
ple
men
tin
gd
efe
ctiv
ere
pair
inC
hin
ese
ham
ster
cells
6(K
uau
toan
tig
en
,70
kDa)
48.8
XR
CC
6In
dir
ect
IPI0
0009841
12
Ew
ing
sarc
om
ab
reakp
oin
tre
gio
n1
15
EW
SR
1IP
I00293655
30
DE
AD
(Asp
-Glu
-Ala
-Asp
)b
ox
po
lyp
ep
tid
e1
42.6
DD
X1
Yes
IPI0
0215637
17
DE
AD
(Asp
-Glu
-Ala
-Asp
)b
ox
po
lyp
ep
tid
e3,
X-l
inke
d31.6
DD
X3X
IPI0
0007208
7D
EA
D(A
sp-G
lu-A
la-A
sp)
bo
xp
oly
pep
tid
e41
14.3
DD
X41
Proteomics 2011, 11, 921–934 925
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
Tab
le1.
Co
nti
nu
ed
IPI
nu
mb
er
Nu
mb
er
of
un
iqu
ep
ep
tid
eh
its
Desc
rip
tio
n%
Seq
uen
ceco
vera
ge
iden
tifi
ed
fro
mM
S/M
Sd
ata
Gen
en
am
eId
en
tifi
ed
as
hn
RN
PK
inte
ract
ing
inm
ou
se
Iden
tifi
ed
inm
RN
Pco
mp
lexes
inra
t
Pu
tati
ve
dir
ect
or
ind
irect
?
IPI0
0017617
12
DE
AD
(Asp
-Glu
-Ala
-Asp
)b
ox
po
lyp
ep
tid
e5
34.2
DD
X5
Yes
Yes
IPI0
0023785
8D
EA
D(A
sp-G
lu-A
la-A
sp)
bo
xp
oly
pep
tid
e17
23.3
DD
X17
Yes
Ind
irect
IPI0
0411733
3D
EA
H(A
sp-G
lu-A
la-H
is)
bo
xp
oly
pep
tid
e30
4.2
DH
X30
Yes
IPI0
0396435
8D
EA
H(A
sp-G
lu-A
la-H
is)
bo
xp
oly
pep
tid
e15
11.9
DH
X15
IPI0
0844578
30
DE
AH
(Asp
-Glu
-Ala
-His
)b
ox
po
lyp
ep
tid
e9
24.1
DH
X9
IPI0
0375358
7R
ep
lica
tio
nfa
cto
rC
(act
ivato
r1)
1,
145
kDa
8.1
RFC
1IP
I00029744
5S
ing
le-s
tran
ded
DN
Ab
ind
ing
pro
tein
122.3
SS
BP
1
Rib
oso
mal
pro
tein
s
IPI0
0376798
3R
ibo
som
al
pro
tein
L11
13
RP
L11
Yes
IPI0
0024933
6R
ibo
som
al
pro
tein
L12
54.5
RP
L12
IPI0
0026202
9R
ibo
som
al
pro
tein
L18a
38.1
RP
L18A
IPI0
0306332
5R
ibo
som
al
pro
tein
L24
36.4
RP
L24
IPI0
0007144
3R
ibo
som
al
pro
tein
L26
15.5
RP
L26
IPI0
0219155
7R
ibo
som
al
pro
tein
L27
36
RP
L27
IPI0
0398135
3R
ibo
som
al
pro
tein
L27a
22.3
RP
L27A
IPI0
0550021
5R
ibo
som
al
pro
tein
L3
13
RP
L3
IPI0
0219156
5R
ibo
som
al
pro
tein
L30
40.9
RP
L30
Yes
Ind
irect
IPI0
0219160
2R
ibo
som
al
pro
tein
L34
12.8
RP
L34
IPI0
0029731
3R
ibo
som
al
pro
tein
L35a
17
RP
L35A
IPI0
0003918
2R
ibo
som
al
pro
tein
L4
6.8
RP
L4
Yes
Ind
irect
IPI0
0025091
9R
ibo
som
al
pro
tein
S11
34.2
RP
S11
Ind
irect
IPI0
0013917
2R
ibo
som
al
pro
tein
S12
12.9
RP
S12
IPI0
0221089
9R
ibo
som
al
pro
tein
S13
42.4
RP
S13
Ind
irect
IPI0
0026271
4R
ibo
som
al
pro
tein
S14
29.1
RP
S14
Yes
IPI0
0221091
5R
ibo
som
al
pro
tein
S15a
29.2
RP
S15A
Ind
irect
IPI0
0221092
10
Rib
oso
mal
pro
tein
S16
48.6
RP
S16
IPI0
0221093
7R
ibo
som
al
pro
tein
S17
48.1
RP
S17
IPI0
0215780
6R
ibo
som
al
pro
tein
S19
28.3
RP
S19
Yes
IPI0
0012493
5R
ibo
som
al
pro
tein
S20
22.7
RP
S20
Yes
IPI0
0011253
14
Rib
oso
mal
pro
tein
S3
51.9
RP
S3
IPI0
0419880
7R
ibo
som
al
pro
tein
S3A
20.8
RP
S3A
Yes
Ind
irect
IPI0
0217030
15
Rib
oso
mal
pro
tein
S4,
X-l
inke
d42.2
RP
S4X
IPI0
0021840
6R
ibo
som
al
pro
tein
S6
18.1
RP
S6
IPI0
0012772
2R
ibo
som
al
pro
tein
L8
8.5
RP
L8
Ind
irect
IPI0
0013296
5R
ibo
som
al
pro
tein
S18
23.7
RP
S18
Tra
nsp
ort
er
an
dG
TP
-bin
din
gp
rote
ins
IPI0
0007188
7S
olu
teca
rrie
rfa
mil
y25
(mit
och
on
dri
alca
rrie
r;ad
en
ine
nu
cleo
tid
etr
an
slo
cato
r),
mem
ber
524.5
SLC
25A
5
IPI0
0012442
18
GT
Pase
act
ivati
ng
pro
tein
(SH
3d
om
ain
)b
ind
ing
pro
tein
139.3
G3B
P1
926 X. Fang et al. Proteomics 2011, 11, 921–934
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
Tab
le1.
Co
nti
nu
ed
IPI
nu
mb
er
Nu
mb
er
of
un
iqu
ep
ep
tid
eh
its
Desc
rip
tio
n%
Seq
uen
ceco
vera
ge
iden
tifi
ed
fro
mM
S/M
Sd
ata
Gen
en
am
eId
en
tifi
ed
as
hn
RN
PK
inte
ract
ing
inm
ou
se
Iden
tifi
ed
inm
RN
Pco
mp
lexes
inra
t
Pu
tati
ve
dir
ect
or
ind
irect
?
IPI0
0009057
13
GT
Pase
act
ivati
ng
pro
tein
(SH
3d
om
ain
)b
ind
ing
pro
tein
224.5
G3B
P2
IPI0
0465121
3G
uan
ine
nu
cleo
tid
eb
ind
ing
pro
tein
(Gp
rote
in),a
inh
ibit
ing
act
ivit
yp
oly
pep
tid
e2
14.6
GN
AI2
Oth
er
TFs
IPI0
0019996
5S
AFB
-lik
e,
tran
scri
pti
on
mo
du
lato
r8.3
SLT
MIP
I00009703
16
SR
Y(s
ex
dete
rmin
ing
reg
ion
Y)-
bo
x2
25.2
SO
X2
IPI0
0006079
6B
CL2-a
sso
ciate
dtr
an
scri
pti
on
fact
or
110.6
BC
LA
F1
Str
uctu
rep
rs
IPI0
0005087
25
Tro
po
mo
du
lin
3(u
biq
uit
ou
s)52.3
TM
OD
3IP
I00216230
8T
hym
op
oie
tin
22.9
TM
PO
IPI0
0554811
4A
RP
C4
act
in-r
ela
ted
pro
tein
2/3
com
ple
xsu
bu
nit
422.6
TT
LL3;A
RP
C4
IPI0
0376344
39
Myo
sin
IB39.9
MY
O1B
IPI0
0414980
39
Myo
sin
IB41
MY
O1B
IPI0
0743335
16
Myo
sin
IC36.8
MY
O1C
IPI0
0010418
15
Myo
sin
IC36.4
MY
O1C
IPI0
0329672
29
Myo
sin
IE28.9
MY
O1E
IPI0
0302592
52
Fil
am
inA
,a
(act
in-b
ind
ing
pro
tein
280)
26.6
FLN
AIP
I00005162
5cD
NA
FLJ51245,
hig
hly
sim
ilar
toact
in-r
ela
ted
pro
tein
2/3
com
ple
xsu
bu
nit
322.2
–
IPI0
0003269
16
AC
TB
L2b-
act
in-l
ike
pro
tein
229.5
AC
TB
L2
IPI0
0013808
9A
ctin
in,a
411.2
AC
TN
4IP
I00028091
16
AR
P3
act
in-r
ela
ted
pro
tein
3h
om
olo
g(y
east
)40.9
AC
TR
3In
dir
ect
IPI0
0333068
2A
ctin
-rela
ted
pro
tein
2/3
com
ple
x,
sub
un
it1A
,41
kDa
5.9
AR
PC
1A
IPI0
0005161
5A
ctin
-rela
ted
pro
tein
2/3
com
ple
x,
sub
un
it2,
34
kDa
15.7
AR
PC
2In
dir
ect
IPI0
0550234
4A
ctin
-rela
ted
pro
tein
2/3
com
ple
x,
sub
un
it5,
16
kDa
34.4
AR
PC
5In
dir
ect
IPI0
0414554
4A
ctin
-rela
ted
pro
tein
2/3
com
ple
x,
sub
un
it5-l
ike
34
AR
PC
5L
IPI0
0012011
4C
ofi
lin
1(n
on
-mu
scle
)23.5
CFL1
IPI0
0291136
6C
oll
ag
en
,ty
pe
VI,a
18
CO
L6A
1IP
I00022200
28
Co
llag
en
,ty
pe
VI,a
310.8
CO
L6A
3IP
I00013933
26
Desm
op
laki
n8.4
DS
P
Un
kn
ow
nfu
ncti
on
s
IPI0
0012750
414
kDa
pro
tein
24.2
–IP
I00013415
319
kDa
pro
tein
20.1
–IP
I00174442
13
Hyp
oth
eti
cal
pro
tein
LO
C25940
25.3
FA
M98A
Oth
ers
IPI0
0182938
5S
-Ad
en
osy
lho
mo
cyst
ein
eh
yd
rola
se-l
ike
19.6
AH
CY
L1
IPI0
0418169
5A
nn
exin
A2
16.3
AN
XA
2Y
es
IPI0
0456359
25
Ata
xin
2-l
ike
23.3
AT
XN
2L
Proteomics 2011, 11, 921–934 927
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
Tab
le1.
Co
nti
nu
ed
IPI
nu
mb
er
Nu
mb
er
of
un
iqu
ep
ep
tid
eh
its
Desc
rip
tio
n%
Seq
uen
ceco
vera
ge
iden
tifi
ed
fro
mM
S/M
Sd
ata
Gen
en
am
eId
en
tifi
ed
as
hn
RN
PK
inte
ract
ing
inm
ou
se
Iden
tifi
ed
inm
RN
Pco
mp
lexes
inra
t
Pu
tati
ve
dir
ect
or
ind
irect
?
IPI0
0783872
7C
ell
-cycl
e-a
sso
ciate
dp
rote
in1
12.5
CA
PR
IN1
IPI0
0005969
9C
ap
pin
gp
rote
in(a
ctin
fila
men
t)m
usc
leZ
-lin
e,a
135.5
CA
PZ
A1
IPI0
0218782
13
Cap
pin
gp
rote
in(a
ctin
fila
men
t)m
usc
leZ
-lin
e,b
33.1
CA
PZ
BIP
I00290770
3C
hap
ero
nin
con
tain
ing
TC
P1,
sub
un
it3
(gam
ma)
6.5
CC
T3
Ind
irect
IPI0
0011302
2C
D59
mo
lecu
le,
com
ple
men
tre
gu
lato
ryp
rote
in9.4
CD
59
IPI0
0289773
3C
CA
AT
/en
han
cer
bin
din
gp
rote
in(C
/EB
P),b
8.1
CE
BP
BIP
I00295857
5C
oato
mer
pro
tein
com
ple
x,
sub
un
ita
4.8
CO
PA
IPI0
0029601
5C
ort
act
in15.1
CT
TN
IPI0
0013933
26
Desm
op
laki
n8.4
DS
PIP
I00186290
6E
uka
ryo
tic
tran
slati
on
elo
ng
ati
on
fact
or
28.3
EE
F2
IPI0
0060181
14
EF-h
an
dd
om
ain
fam
ily,
mem
ber
D2
39.6
EFH
D2
IPI0
0025491
3E
uka
ryo
tic
tran
slati
on
init
iati
on
fact
or
4A
,is
ofo
rm1
8.9
EIF
4A
1IP
I00039626
7Fam
ily
wit
hse
qu
en
cesi
mil
ari
ty120A
7.3
FA
M120A
IPI0
0031820
3P
hen
yla
lan
yl-
tRN
Asy
nth
eta
se,a
sub
un
it6.7
FA
RS
AIP
I00031023
31
Fli
gh
tless
Ih
om
olo
g(D
roso
ph
ila)
24.9
FLII
IPI0
0059366
7H
2A
his
ton
efa
mily,
mem
ber
Y26.6
H2A
FY
IPI0
0382470
4H
eat
sho
ckp
rote
in90
kDaa
(cyto
soli
c),
class
Am
em
ber
119.1
HS
P90A
A1
IPI0
0414676
9H
eat
sho
ckp
rote
in90
kDaa
(cyto
soli
c),
class
Bm
em
ber
124.9
HS
P90A
B1
Ind
irect
IPI0
0007765
9H
eat
sho
ck70
kDa
pro
tein
9(m
ort
alin
)18.4
HS
PA
9Y
es
IPI0
0784154
5H
eat
sho
ck60
kDa
pro
tein
1(c
hap
ero
nin
)13.3
HS
PD
1Y
es
IPI0
0179713
5In
sulin
-lik
eg
row
thfa
cto
r2
mR
NA
bin
din
gp
rote
in2
16.7
IGF2B
P2
IPI0
0005198
10
Inte
rleu
kin
en
han
cer
bin
din
gfa
cto
r2,
45
kDa
23.8
ILF2
IPI0
0298788
8In
terl
eu
kin
en
han
cer
bin
din
gfa
cto
r3,
90
kDa
11.7
ILF3
Yes
Ind
irect
IPI0
0291579
4K
inesi
nfa
mil
ym
em
ber
23
5.6
KIF
23
Ind
irect
IPI0
0294186
5Lact
am
ase
,b
16.3
LA
CT
BIP
I00219219
3Lect
in,
gala
cto
sid
e-b
ind
ing
,so
lub
le,
1(g
ale
ctin
1)
31.9
LG
ALS
1IP
I00008918
13
LIM
do
main
an
dact
inb
ind
ing
123.4
LIM
A1
IPI0
0016373
2LO
C100131294
sim
ilar
toR
AB
13
pro
tein
15.8
LO
C100131294
IPI0
0005160
4LO
C653888
sim
ilar
top
41-A
rc10.5
LO
C653888
IPI0
0296830
12
Leu
cin
ezi
pp
er
pro
tein
113.7
LU
ZP
1IP
I00008868
11
Mic
rotu
bu
le-a
sso
ciate
dp
rote
in1B
7.3
MA
P1B
IPI0
0017297
11
Matr
in3
19
MA
TR
3IP
I00335168
14
Myo
sin
,li
gh
tch
ain
6B
,alk
ali
,sm
oo
thm
usc
lean
dn
on
-m
usc
le60.7
MY
L6B
IPI0
0300127
5N
-ace
tylt
ran
sfera
se10
5.5
NA
T10
IPI0
0009456
12
50 -
Nu
cleo
tid
ase
,ect
o(C
D73)
29.8
NT
5E
IPI0
0012726
25
Po
ly(A
)b
ind
ing
pro
tein
,cy
top
lasm
ic4
(in
du
cib
lefo
rm)
35
PA
BP
C4
Yes
IPI0
0449049
40
Po
ly(A
DP
-rib
ose
)p
oly
mera
sefa
mil
y,
mem
ber
132.1
PA
RP
1
928 X. Fang et al. Proteomics 2011, 11, 921–934
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
Inhibition of YBX1 in GBM cells reduced tumor cell inva-
sion and growth in monolayer as well as in soft agar, and
delayed tumor onset in mice [19]. In addition, inhibition of
YBX1 enhanced temozolomide sensitivity in a manner that
was independent of MGMT [19].
RBM14 (also named CoAA, coactivator activator) is a
nuclear receptor coactivator protein involved in transcrip-
tional coactivation and RNA splicing [20]. It also plays a role
in regulating stem/progenitor cell differentiation [21]. Kang
et al. recently showed that RBM14 (CoAA) is a potential
tumor suppressor in renal carcinoma and it represses the
proto-oncogene c-myc by recruiting HDAC3 protein and
decreasing both the acetylation of histone H3 and the
presence of RNA polymerase II on the c-myc promoter [22].
STAU1 is an RNA-binding protein that is required for the
survival and migration of primordial germ cells in zebrafish
and is required for ESC differentiation in mouse [23, 24].
KHDRBS1 (also named Sam68, Src-associated in mitosis of
68 kDa) is a known RNA-binding protein that involves in
modulating splicing of genes [25]. In prostate cancer, Sam68
expression supports proliferation and survival to cytotoxic
agents [26].
Another interesting family of proteins we identified as
SOX2 binding include nine members of the DEAD (Asp-
Glu-Ala-Asp) box containing genes (Table 1), which are
human RNA helicases participating in transcription,
splicing, and translation by modulating the structure of
RNA [27]. In addition, XRCC6 (Ku autoantigen, 70 kDa) and
XRCC5 (Ku autoantigen, 80 kDa) are single-stranded DNA-
dependent ATP-dependent helicases.
SOX2, together with Oct4, Klf4, c-Myc, Nanog, and
Lin28, are ESC markers [28] and different combination of
these genes were shown to be able to generate induced
pluripotent stem (iPS) cells [4, 29–31]. Physical interactions
among these members were shown previously. Wei et al.
showed that Klf4 could interact directly with Oct4 and Sox2
in iPS cells and mouse ESCs [32]. SOX2 and OCT4 interact
with each other in mouse ESCs [32]. However, we did not
detect OCT4 in our IP-MS analysis of SOX2. This could be
due to different cellular context of mouse ES and human
glioma cells, or due to the fact that Oct4 and Sox2 have low
affinity for each other in solution [33, 34]. Interestingly, in a
recent proteomics studies of the Sox2-IP in mouse ESCs, the
Oct4 protein was not identified either [35].
Transcriptional regulation and post-transcriptional regu-
lation may also exist among these six ESC markers. For
example, Sox2 and Oct4 bind to the promoter of Nanog [36].
Lin28 encodes a cytoplasmic mRNA-binding protein in
messenger ribonucleoprotein (mRNP) complexes [37].
Interestingly, many SOX2-binding proteins that we identi-
fied also belong to mRNP complexes. Angenstein et al.
conducted a proteomic analysis of mRNP complexes in the
rat cerebral cortex and they identified 30 proteins in the
mRNP complexes [38]. Comparing their list with our SOX2
binding protein list, we identified 10 proteins that are in
common including HNRNPC, HNRNPL, HNRNPU,Tab
le1.
Co
nti
nu
ed
IPI
nu
mb
er
Nu
mb
er
of
un
iqu
ep
ep
tid
eh
its
Desc
rip
tio
n%
Seq
uen
ceco
vera
ge
iden
tifi
ed
fro
mM
S/M
Sd
ata
Gen
en
am
eId
en
tifi
ed
as
hn
RN
PK
inte
ract
ing
inm
ou
se
Iden
tifi
ed
inm
RN
Pco
mp
lexes
inra
t
Pu
tati
ve
dir
ect
or
ind
irect
?
IPI0
0183002
11
Pro
tein
ph
osp
hata
se1,
reg
ula
tory
(in
hib
ito
r)su
bu
nit
12A
11.8
PP
P1R
12A
IPI0
0045550
5P
rote
inp
ho
sph
ata
se1,
reg
ula
tory
(in
hib
ito
r)su
bu
nit
9B
5.9
PP
P1R
9B
IPI0
0000874
8P
ero
xir
ed
oxin
144.2
PR
DX
1IP
I00292953
41
Reti
no
icaci
din
du
ced
14
38.2
RA
I14
IPI0
0297211
4S
WI/
SN
F-r
ela
ted
,m
atr
ix-a
sso
ciate
d,
act
in-d
ep
en
den
tre
gu
lato
ro
fch
rom
ati
n,
sub
fam
ily
a,
mem
ber
54.5
SM
AR
CA
5
IPI0
0178072
13
SP
EC
C1-l
ike
13.5
SP
EC
C1L
IPI0
0005154
11
Str
uct
ure
-sp
eci
fic
reco
gn
itio
np
rote
in1
15.2
SS
RP
1In
dir
ect
IPI0
0026970
10
Su
pp
ress
or
of
Ty
16
ho
mo
log
(S.
cere
vis
iae)
12.3
SU
PT
16H
IPI0
0020194
10
TA
F15
RN
Ap
oly
mera
seII,
TA
TA
bo
xb
ind
ing
pro
tein
(TB
P)-
ass
oci
ate
dfa
cto
r,68
kDa
20.5
TA
F15
Yes
IPI0
0104050
6T
hyro
idh
orm
on
ere
cep
tor
ass
oci
ate
dp
rote
in3
7.4
TH
RA
P3
Proteomics 2011, 11, 921–934 929
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
NUFIP2, YBX1, DDX5, DHX30, and ILF3 (Table 1). The
hypergeometric probability of the overlap (10 overlap genes
in two lists of 30 and 144 genes) with 17 512 human-rat
homologs at NCBI’s database is highly significant
(po2.690e-14). This suggests that SOX2 might also be a
member of mRNP complexes. Lin28 binds to IGF2 mRNA
and regulated its translation efficiency [38]. Although we
showed that SOX2-binding proteins or SOX2 might belong
to mRNP, whether SOX2 binds directly to any specific
species of mRNA remains to be determined.
The limitation of the IP/MS/MS approach is that it
cannot distinguish between direct and indirect interacting
partners of a protein as both can be immunoprecipitated
and therefore identified. The proteins that we identified
by SOX2-IP/MS analysis may include both direct SOX2
binding proteins and indirect SOX2-binding proteins. We
were not able to differentiate these two possibilities and
further experimentation is necessary to distinguish
them. We found that many proteins that we identified
(Table 1) show existing protein–protein interactions. For
example, hnRNP A2/B1 and TLS/FUS are interacting
partners in a mass spectrometry analysis [39]. XRCC5 and
XRCC6 form heterodimers [40]. KHDRBS1 (Sam68) was
found to be associated with heterogeneous nuclear ribonu-
cleoproteins hnRNP A1, A2/B1, G and L [41], and with
hnRNP K [42]. To systematically identifying all existing
protein–protein interactions among the SOX2-binding
proteins that identified, we searched the human HPRD
(www.hprd.org) using the Cytoscape (www.cytoscape.org)
MiMI plugin [43]. A protein–protein interacting map
showing SOX2 interactome as well as existing protein–-
protein interaction is shown in Fig. 3A. The protein–protein
interaction that was confirmed by us or in the literature is
shown in Fig. 3B.
Table 2. Interesting GO terms that are enriched in SOX2 interactome
GO category Totalgenes
Changedgenes
Enrichmentfolds
p-Value(log 10)
FDR
GO:0030529_ribonucleoprotein_complex 406 36 10.59 �26.59 0.0000GO:0030530_heterogeneous_nuclear_ribonucleoprotein_complex 10 5 59.69 �8.03 0.0000GO:0031981_nuclear_lumen 608 19 3.73 �6.17 0.0000GO:0005730_nucleolus 159 10 7.51 �6.06 0.0000GO:0003723_RNA_binding 645 55 10.05 �41.12 0.0000GO:0003735_structural_constituent_of_ribosome 159 25 18.53 �24.34 0.0000GO:0003676_nucleic_acid_binding 3070 70 2.69 �17.57 0.0000GO:0005198_structural_molecule_activity 656 33 5.93 �16.45 0.0000GO:0005515_protein_binding 6544 94 1.69 �12.27 0.0000GO:0008026_ATP-dependent_helicase_activity 89 11 14.57 �9.67 0.0000GO:0003779_actin_binding 287 17 6.98 �9.52 0.0000GO:0000166_nucleotide_binding 1889 43 2.68 �9.47 0.0000GO:0004386_helicase_activity 131 12 10.80 �8.97 0.0000GO:0003724_RNA_helicase_activity 28 7 29.47 �8.57 0.0000GO:0008092_cytoskeletal_protein_binding 403 18 5.26 �8.10 0.0000GO:0005488_binding 11 156 116 1.23 �7.73 0.0000GO:0042623_ATPase_activity__coupled 251 13 6.11 �6.69 0.0000GO:0016887_ATPase_activity 304 14 5.43 �6.54 0.0000GO:0017111_nucleoside-triphosphatase_activity 535 18 3.97 �6.24 0.0007GO:0016462_pyrophosphatase_activity 562 18 3.78 �5.94 0.0006GO:0016818_hydrolase_activity__acting_on_acid_anhydrides__in_phosphorus-
containing_anhydrides564 18 3.76 �5.91 0.0006
GO:0016817_hydrolase_activity__acting_on_acid_anhydrides 567 18 3.74 �5.88 0.0006GO:0003725_double-stranded_RNA_binding 31 5 19.01 �5.24 0.0005GO:0005524_ATP_binding 1240 26 2.47 �4.92 0.0005GO:0032553_ribonucleotide_binding 1577 30 2.24 �4.82 0.0005GO:0032555_purine_ribonucleotide_binding 1577 30 2.24 �4.82 0.0005GO:0032559_adenyl_ribonucleotide_binding 1259 26 2.43 �4.81 0.0004GO:0019843_rRNA_binding 19 4 24.82 �4.76 0.0004GO:0051082_unfolded_protein_binding 102 7 8.09 �4.60 0.0004GO:0008186_RNA-dependent_ATPase_activity 21 4 22.45 �4.58 0.0004GO:0017076_purine_nucleotide_binding 1648 30 2.15 �4.46 0.0004GO:0030554_adenyl_nucleotide_binding 1330 26 2.30 �4.39 0.0004GO:0004003_ATP-dependent_DNA_helicase_activity 19 3 18.61 �3.28 0.0048GO:0004004_ATP-dependent_RNA_helicase_activity 20 3 17.68 �3.21 0.0063GO:0003697_single-stranded_DNA_binding 47 4 10.03 �3.18 0.0066GO:0008094_DNA-dependent_ATPase_activity 47 4 10.03 �3.18 0.0066GO:0003678_DNA_helicase_activity 34 3 10.40 �2.53 0.0444
930 X. Fang et al. Proteomics 2011, 11, 921–934
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
Recently, Mallanna et al. conducted a proteomic analysis
of SOX2-associated proteins during early stages of mouse
ESC differentiation [35], and they identified 60 nuclear
proteins that associate with Sox2 during early ESC differ-
entiation with high confidence scores. Additionally, they
identified 194 less confident but probable binding partners
of Sox2 in mouse ESC cells. Fifty-eight of the 60 high
confident mouse Sox2-associating proteins have human
homologs. There are 11 proteins that are common between
the 58 Sox2-associated proteins in mouse ESC and
the SOX2-binding proteins in glioblastoma cells. These
proteins are listed in Table 3, which include H2A histone
Figure 3. (A) A cytoscape map of the SOX2 interactome. Nodes indicate proteins and the edges indicate protein–protein interactions.
(B) The protein–protein interaction that was confirmed by us or in the literature. Nodes indicate proteins and the edges (marked by pp)
indicate protein–protein interactions.
Proteomics 2011, 11, 921–934 931
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
family, member Y (H2AFY), single-stranded DNA binding
protein 1 (SSBP1), RBM14, XRCC5 (double-strand-break
rejoining; Ku autoantigen, 80 kDa), XRCC6 (Ku autoanti-
gen, 70 kDa), and HNRNPM. Additionally, the mouse
heterogeneous nuclear ribonucleoprotein U-like 2
(Hnrnpul2) was identified in the mouse Sox2-IP, whereas in
human the HNRNPU was identified. The hypergeometric
probability of the overlap (12 genes) with 19 570 human–
mouse homologs at NCBI’s database is highly significant
(po1.056e–14).
The difference is not surprising as the SOX2-associated
proteins are from different cell types and different species.
Furthermore, the experimental approaches are different.
While we used natural SOX2 expression cancer cell line and
used SOX2 antibody to isolate SOX2 binding proteins,
Mallanna et al. overexpressed the SOX2 protein as a Flag-
Sox2 fusion protein in an inducible system and used anti-
Flag M2 affinity beads to isolate SOX2-binding proteins.
Finally, it is possible that some of the proteins are non-
specific binding proteins that we failed to filter out even
when we used stringent filtering criteria. Additional
confirmation studies are necessary when one wants to focus
on studying a specific interaction between SOX2 and a
protein.
4 Concluding remarks
A critical discovery in our analysis is that SOX2 belongs to the
hnRNP or mRNP complexes, which are key components of
transcription and post-transcription regulations. This suggests
a potential new role for SOX2 as not only a transcription factor
but also a factor involved in post-transcriptional regulation.
Such examples of dual functions of transcriptional factors have
been documented previously. Cassiday and Maher surveyed
and identified many examples of transcription factors that
bind to both DNAs and RNAs [44], including the prototypic
Xenopus TFIIIA protein, the Caenorhabditis elegans TRA-1, the
Drosophila bicoid, and the mammalian P53, WT-1, STAT1, and
TLS/FUS. The mammalian p53 binds to cognate site in
promoters target genes involved in growth arrest or apoptosis
[45]. Yoshida et al. showed that p53 interacts with RNA via its
C-terminal domain, which regulate its DNA-binding activity
and its oligomerization [46]. Recently, Grinberg et al. showed
that p52 could bind to double-stranded RNA and destroy them
[47]. The Ku protein, consists of a heterodimeric complex of 70
(Ku70) and 80 kDa (Ku80) subunits, is also both a DNA- and
RNA-binding protein in yeast [48]. It binds to double-stranded
DNA to repair double-stranded DNA breaks and it also binds
to telomerase RNA to promote telomere addition [48]. We also
identified Ku70 (XRCC6) and Ku80 (XRCC5) as SOX2-binding
proteins. TLS/FUS is another protein that possesses both
DNA- and RNA-binding activities [49, 50] and we also identi-
fied it as a SOX2-binding protein.
Tung et al. recently showed that SOX2 has RNA-binding
activity and the binding needs the HMG domains using an in
vitro pull down assay in transitional cell carcinoma (TCC).
They further showed that ectopic expression of SOX2 modu-
lates alternative splicing of genes [51]. They postulated that
SOX2 is an RNA splicer. Our data support this speculation.
Further experiments were necessary to establish the functional
consequences and detailed mechanisms of these interactions,
and SOX2’s exact roles in post-transcriptional regulation.
The authors have declared no conflict of interest
5 References
[1] Avilion, A. A., Nicolis, S. K., Pevny, L. H., Perez, L. et al.,
Multipotent cell lineages in early mouse development
depend on SOX2 function. Genes Dev. 2003, 17, 126–140.
[2] Takahashi, K., Yamanaka, S., Induction of pluripotent stem
cells from mouse embryonic and adult fibroblast cultures
by defined factors. Cell 2006, 126, 663–676.
[3] Takahashi, K., Tanabe, K., Ohnuki, M., Narita, M. et al.,
Induction of pluripotent stem cells from adult human
fibroblasts by defined factors. Cell 2007, 131, 861–872.
[4] Yu, J., Vodyanik, M. A., Smuga-Otto, K., Antosiewicz-
Bourget, J. et al., Induced pluripotent stem cell lines
derived from human somatic cells. Science 2007, 318,
1917–1920.
Table 3. Common SOX2-interacting proteins between humanglioblastoma and mouse ESC cells
Genename
Description
H2AFY H2A histone family, member Y ��
HNRNPM Heterogeneous nuclear ribonucleoprotein M ��
ILF3 Interleukin enhancer binding factor 3, 90 kDa ��
PARP1 Poly (ADP-ribose) polymerase family,member 1
��
RBM14 RNA-binding motif protein 14 ��
RPS11 Ribosomal protein S11 ��
SSBP1 Single-stranded DNA-binding protein 1 ��
SSRP1 Structure-specific recognition protein 1 ��
SUPT16H Suppressor of Ty 16 homolog (S. cerevisiae) ��
XRCC5 X-ray repair complementing defective repairin Chinese hamster cells 5 (double-strand-break rejoining; Ku autoantigen, 80 kDa)
��
XRCC6 X-ray repair complementing defective repairin Chinese hamster cells 6 (Kuautoantigen, 70 kDa)
��
AHCYL1 S-adenosylhomocysteine hydrolase-like 1 �
CCT3 Chaperonin-containing TCP1, subunit 3(gamma)
�
KIF23 Kinesin family member 23 �
SMARCA5 SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin,subfamily a, member 5
�
TMPO Thymopoietin �
�� Identified in mouse with high-frequency and enrichmentvalues.
� Identified in mouse with low-frequency and enrichmentvalues.
932 X. Fang et al. Proteomics 2011, 11, 921–934
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
[5] Giorgetti, A., Montserrat, N., Rodriguez-Piza, I., Azqueta, C.
et al., Generation of induced pluripotent stem cells from
human cord blood cells with only two factors: Oct4 and
Sox2. Nat. Protoc. 2010, 5, 811–820.
[6] Schmitz, M., Temme, A., Senner, V., Ebner, R. et al., Iden-
tification of SOX2 as a novel glioma-associated antigen and
potential target for T cell-based immunotherapy. Br.
J. Cancer 2007, 96, 1293–1301.
[7] Phi, J. H., Park, S. H., Kim, S. K., Paek, S. H. et al., Sox2
expression in brain tumors: a reflection of the neuroglial
differentiation pathway. Am. J. Surg. Pathol. 2008, 32,
103–112.
[8] Lin, B., Madan, A., Yoon, J. G., Fang, X. et al., Massively
parallel signature sequencing and bioinformatics analysis
identifies up-regulation of TGFBI and SOX4 in human glio-
blastoma. PLoS ONE 2010, 5, e10210.
[9] Gangemi, R. M., Griffero, F., Marubbi, D., Perera, M. et al.,
SOX2 silencing in glioblastoma tumor initiating cells causes
stop of proliferation and loss of tumorigenicity. Stem Cells
2009, 27, 40–48.
[10] Ikushima, H., Todo, T., Ino, Y., Takahashi, M. et al., Auto-
crine TGF-beta signaling maintains tumorigenicity of
glioma-initiating cells through Sry-related HMG-box
factors. Cell Stem Cell 2009, 5, 504–514.
[11] Shevchenko, A., Wilm, M., Vorm, O., Mann, M., Mass
spectrometric sequencing of proteins silver-stained poly-
acrylamide gels. Anal. Chem. 1996, 68, 850–858.
[12] Zeeberg, B. R., Qin, H., Narasimhan, S., Sunshine, M. et al.,
High-throughput GoMiner, an ‘industrial-strength’ inte-
grative gene ontology tool for interpretation of multiple-
microarray experiments, with application to studies of
common variable immune deficiency (CVID). BMC Bioin-
formatics 2005, 6, 168.
[13] Dey, S., Curtis, D. J., Jane, S. M., Brandt, S. J., The TAL1/
SCL transcription factor regulates cell cycle progression
and proliferation in differentiating murine bone
marrow monocyte precursors. Mol. Cell Biol. 2010, 30,
2181–2192.
[14] Verzi, M. P., Hatzis, P., Sulahian, R., Philips, J. et al., TCF4
and CDX2, major transcription factors for intestinal func-
tion, converge on the same cis-regulatory regions. Proc.
Natl. Acad. Sci. USA 2010, 107, 15157–15162.
[15] Ostrowski, J., Kawata, Y., Schullery, D. S., Denisenko, O. N.,
Bomsztyk, K., Transient recruitment of the hnRNP K protein
to inducibly transcribed gene loci. Nucleic Acids Res. 2003,
31, 3954–3962.
[16] Mikula, M., Dzwonek, A., Karczmarski, J., Rubel, T. et al.,
Landscape of the hnRNP K protein–protein interactome.
Proteomics 2006, 6, 2395–2406.
[17] Cheung, H. C., Hai, T., Zhu, W., Baggerly, K. A. et al., Spli-
cing factors PTBP1 and PTBP2 promote proliferation
and migration of glioma cell lines. Brain 2009, 132,
2277–2288.
[18] Faury, D., Nantel, A., Dunn, S. E., Guiot, M. C. et al., Mole-
cular profiling identifies prognostic subgroups of pediatric
glioblastoma and shows increased YB-1 expression in
tumors. J. Clin. Oncol. 2007, 25, 1196–1208.
[19] Gao, Y., Fotovati, A., Lee, C., Wang, M. et al., Inhibition of
Y-box binding protein-1 slows the growth of glioblastoma
multiforme and sensitizes to temozolomide independent
O6-methylguanine-DNA methyltransferase. Mol. Cancer
Ther. 2009, 8, 3276–3284.
[20] Auboeuf, D., Dowhan, D. H., Li, X., Larkin, K. et al., CoAA, a
nuclear receptor coactivator protein at the interface of
transcriptional coactivation and RNA splicing. Mol. Cell
Biol. 2004, 24, 442–453.
[21] Brooks, Y. S., Wang, G., Yang, Z., Smith, K. K. et al., Func-
tional pre-mRNA trans-splicing of coactivator CoAA and
corepressor RBM4 during stem/progenitor cell differentia-
tion. J. Biol. Chem. 2009, 284, 18033–18046.
[22] Kang, Y. K., Schiff, R., Ko, L., Wang, T. et al., Dual roles for
coactivator activator and its counterbalancing isoform
coactivator modulator in human kidney cell tumorigenesis.
Cancer Res. 2008, 68, 7887–7896.
[23] Ramasamy, S., Wang, H., Quach, H. N., Sampath, K.,
Zebrafish Staufen1 and Staufen2 are required for the
survival and migration of primordial germ cells. Dev. Biol.
2006, 292, 393–406.
[24] Gautrey, H., McConnell, J., Lako, M., Hall, J., Hesketh, J.,
Staufen1 is expressed in preimplantation mouse embryos
and is required for embryonic stem cell differentiation.
Biochim. Biophys. Acta 2008, 1783, 1935–1942.
[25] Lukong, K. E., Richard, S., Targeting the RNA-binding
protein Sam68 as a treatment for cancer? Future Oncol.
2007, 3, 539–544.
[26] Busa, R., Paronetto, M. P., Farini, D., Pierantozzi, E. et al.,
The RNA-binding protein Sam68 contributes to prolifera-
tion and survival of human prostate cancer cells. Oncogene
2007, 26, 4372–4382.
[27] Abdelhaleem, M., RNA helicases: regulators of differentia-
tion. Clin. Biochem. 2005, 38, 499–503.
[28] Hu, G., Kim, J., Xu, Q., Leng, Y. et al., A genome-wide RNAi
screen identifies a new transcriptional module required for
self-renewal. Genes Dev. 2009, 23, 837–848.
[29] Zaehres, H., Kim, J. B., Scholer, H. R., Induced pluripotent
stem cells. Methods Enzymol. 2010, 476, 309–325.
[30] Okita, K., Ichisaka, T., Yamanaka, S., Generation of germ-
line-competent induced pluripotent stem cells. Nature 2007,
448, 313–317.
[31] Wernig, M., Meissner, A., Foreman, R., Brambrink, T. et al.,
In vitro reprogramming of fibroblasts into a pluripotent ES-
cell-like state. Nature 2007, 448, 318–324.
[32] Wei, Z., Yang, Y., Zhang, P., Andrianakos, R. et al., Klf4
interacts directly with Oct4 and Sox2 to promote repro-
gramming. Stem Cells 2009, 27, 2969–2978.
[33] Ambrosetti, D. C., Basilico, C., Dailey, L., Synergistic
activation of the fibroblast growth factor 4 enhancer
by Sox2 and Oct-3 depends on protein–protein
interactions facilitated by a specific spatial arrangement
of factor binding sites. Mol. Cell Biol. 1997, 17,
6321–6329.
[34] Wissmuller, S., Kosian, T., Wolf, M., Finzsch, M., Wegner,
M., The high-mobility-group domain of Sox proteins
Proteomics 2011, 11, 921–934 933
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com
interacts with DNA-binding domains of many transcription
factors. Nucleic Acids Res. 2006, 34, 1735–1744.
[35] Mallanna, S. K., Ormsbee, B. D., Iacovino, M., Gilmore, J. M.
et al., Proteomic analysis of Sox2-associated proteins
during early stages of mouse embryonic stem cell differ-
entiation identifies Sox21 as a novel regulator of stem cell
fate. Stem Cells 2010, 28, 1715–1727.
[36] Rodda, D. J., Chew, J. L., Lim, L. H., Loh, Y. H. et al., Tran-
scriptional regulation of nanog by OCT4 and SOX2. J. Biol.
Chem. 2005, 280, 24731–24737.
[37] Balzer, E., Moss, E. G., Localization of the developmental
timing regulator Lin28 to mRNP complexes, P-bodies and
stress granules. RNA Biol. 2007, 4, 16–25.
[38] Angenstein, F., Evans, A. M., Ling, S. C., Settlage, R. E. et al.,
Proteomic characterization of messenger ribonucleoprotein
complexes bound to nontranslated or translated poly(A)
mRNAs in the rat cerebral cortex. J. Biol. Chem. 2005, 280,
6496–6503.
[39] Kasyapa, C. S., Kunapuli, P., Cowell, J. K., Mass spectroscopy
identifies the splicing-associated proteins, PSF, hnRNP H3,
hnRNP A2/B1, and TLS/FUS as interacting partners of the
ZNF198 protein associated with rearrangement in myelo-
proliferative disease. Exp. Cell Res. 2005, 309, 78–85.
[40] Jin, S., Weaver, D. T., Double-strand break repair by Ku70
requires heterodimerization with Ku80 and DNA binding
functions. EMBO J. 1997, 16, 6874–6885.
[41] Rajan, P., Dalgliesh, C., Bourgeois, C. F., Heiner, M. et al.,
Proteomic identification of heterogeneous nuclear ribonu-
cleoprotein L as a novel component of SLM/Sam68 Nuclear
Bodies. BMC Cell Biol. 2009, 10, 82.
[42] Yang, J. P., Reddy, T. R., Truong, K. T., Suhasini, M., Wong-
Staal, F., Functional interaction of Sam68 and hetero-
geneous nuclear ribonucleoprotein K. Oncogene 2002, 21,
7187–7194.
[43] Gao, J., Ade, A. S., Tarcea, V. G., Weymouth, T. E.
et al., Integrating and annotating the interactome using the
MiMI plugin for cytoscape. Bioinformatics 2009, 25,
137–138.
[44] Cassiday, L. A., Maher, L. J., III, Having it both ways: tran-
scription factors that bind DNA and RNA. Nucleic Acids Res.
2002, 30, 4118–4126.
[45] Vousden, K. H., Prives, C., Blinded by the light: the growing
complexity of p53. Cell 2009, 137, 413–431.
[46] Yoshida, Y., Izumi, H., Torigoe, T., Ishiguchi, H. et al.,
Binding of RNA to p53 regulates its oligomerization and
DNA-binding activity. Oncogene 2004, 23, 4371–4379.
[47] Grinberg, S., Teiblum, G., Rahav, G., Bakhanashvili, M., p53
in cytoplasm exerts 30-50 exonuclease activity with dsRNA.
Cell Cycle 2010, 9, 2442–2455.
[48] Fisher, T. S., Zakian, V. A., Ku: a multifunctional protein
involved in telomere maintenance. DNA Repair (Amst)
2005, 4, 1215–1226.
[49] Bertrand, P., Akhmedov, A. T., Delacote, F., Durrbach, A.,
Lopez, B. S., Human POMp75 is identified as the pro-
oncoprotein TLS/FUS: both POMp75 and POMp100 DNA
homologous pairing activities are associated to cell prolif-
eration. Oncogene 1999, 18, 4515–4521.
[50] Iko, Y., Kodama, T. S., Kasai, N., Oyama, T. et al., Domain
architectures and characterization of an RNA-binding
protein, TLS. J. Biol. Chem. 2004, 279, 44834–44840.
[51] Tung, C. L., Hou, P. H., Kao, Y. L., Huang, Y. W. et al., SOX2
modulates alternative splicing in transitional cell carcinoma.
Biochem. Biophys. Res. Commun. 2010, 393, 420–425.
934 X. Fang et al. Proteomics 2011, 11, 921–934
& 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim www.proteomics-journal.com