; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G13700 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G13700
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr4:11657093..11657680
RNA-Seq ExpressionCSPI04G13700
SyntenyCSPI04G13700
Gene Ontology termsGO:0007165 - signal transduction (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003953 - NAD+ nucleosidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]3.0e-10796.41Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLK AP RGILY+DHGHTR+ECFS+ADW GSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQ ALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]3.0e-10796.41Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLK AP RGILY+DHGHTR+ECFS+ADW GSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQ ALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]3.0e-10796.41Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLK AP RGILY+DHGHTR+ECFS+ADW GSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQ ALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]3.0e-10796.41Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLK AP RGILY+DHGHTR+ECFS+ADW GSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQ ALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]3.0e-10796.41Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLK AP RGILY+DHGHTR+ECFS+ADW GSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQ ALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

TrEMBL top hitse value%identityAlignment
A0A5A7T406 Copia protein1.1e-9487.37Show/hide
Query:  VDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE
        VDHWA VEQILCY K AP RGILYRDHG+TR+ECFS+ADW GSR+D+RSTSGYCVFVG NLV WKSKKQNV+SRSSAESEYRAM QSVC IVWIHQLLSE
Subjt:  VDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE

Query:  IGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
          FSITVPAKLWCDNQVALHIASNPVFHE+TK++EVDCHFIREKIQDGLVSTGYVKTGE+LGDILTKA+NG RISYLCNKL MIDIFAPA
Subjt:  IGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A5A7UHS1 Retrovirus-related Pol polyprotein from transposon TNT 1-947.9e-9888.72Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MS PTVDHWAAVEQILCYLK A  RGILY+DHGHT+++CFS+ADW GSREDRRS SGYCVFVGGNLVSWKSKKQNVVS SSA+SEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVP KLWCDNQVALHIASNPVFHE+TKHIEVDCHFIREKIQDGL+STGYVKTGEQLGDILTK +NG RISYL  KL MIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A5D3DZU1 Retrovirus-related Pol polyprotein from transposon TNT 1-947.9e-9888.72Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MS PTVDHWAAVEQILCYLK A  RGILY+DHGHT+++CFS+ADW GSREDRRS SGYCVFVGGNLVSWKSKKQNVVS SSA+SEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVP KLWCDNQVALHIASNPVFHE+TKHIEVDCHFIREKIQDGL+STGYVKTGEQLGDILTK +NG RISYL  KL MIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A5D3E5M8 Copia protein2.4e-9486.84Show/hide
Query:  VDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE
        VDHWA VEQILCY K AP RGILY+DHG+TR+ECFS+ADW GSR+D+RSTSGYCVFVG NLV WKSKKQNV+SRSSAESEYRAM QSVC IVWIHQLLSE
Subjt:  VDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE

Query:  IGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
          FSITVPAKLWCDNQVALHIASNPVFHE+TK++EVDCHFIREKIQDGLVSTGYVKTGE+LGDILTKA+NG RISYLCNKL MIDIFAPA
Subjt:  IGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A6P6VCZ7 uncharacterized protein LOC1137194881.3e-9582.56Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHW AVEQ+L YLK AP RGILY +HGHTRIECFS++DW G +EDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAE+EYRAMA+SVCE++W++
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSE+G  ++VPAKLWCDNQ ALHIASNPVFHERTKHIE+DCHF+REKIQ GL++TGYVKTGEQLGDI TKALNG RI YLCNKLGMI+I+APA
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.8e-3136.96Show/hide
Query:  WAAVEQILCYLKVAPRRGILYRDH--GHTRIECFSNADWTGSREDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE
        W  ++++L YLK      ++++ +     +I  + ++DW GS  DR+ST+GY       NL+ W +K+QN V+ SS E+EY A+ ++V E +W+  LL+ 
Subjt:  WAAVEQILCYLKVAPRRGILYRDH--GHTRIECFSNADWTGSREDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE

Query:  IGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMI
        I   +  P K++ DNQ  + IA+NP  H+R KHI++  HF RE++Q+ ++   Y+ T  QL DI TK L   R   L +KLG++
Subjt:  IGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMI

P0CV72 Secreted RxLR effector protein 1615.1e-1741.84Show/hide
Query:  SSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWI
        S P   HW A++++L YL+     G+ +   G  ++  +S+ADW G  E RRSTSGY   + G  VSW+SKKQ  V+ SS E EY A++++  E VW+
Subjt:  SSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-2432.26Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        + +P  +HW AV+ IL YL+       L        ++ +++AD  G  ++R+S++GY     G  +SW+SK Q  V+ S+ E+EY A  ++  E++W+ 
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKL
        + L E+G        ++CD+Q A+ ++ N ++H RTKHI+V  H+IRE + D  +    + T E   D+LTK +   +   LC +L
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.9e-4242.55Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        M  PT +H  A+++IL YL   P  GI  +      +  +S+ADW G ++D  ST+GY V++G + +SW SKKQ  V RSS E+EYR++A +  E+ WI 
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGM
         LL+E+G  +T P  ++CDN  A ++ +NPVFH R KHI +D HFIR ++Q G +   +V T +QL D LTK L+ T      +K+G+
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-4241.88Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        M  PT DHW A++++L YL   P  GI  +      +  +S+ADW G  +D  ST+GY V++G + +SW SKKQ  V RSS E+EYR++A +  E+ WI 
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDI
         LL+E+G  ++ P  ++CDN  A ++ +NPVFH R KHI +D HFIR ++Q G +   +V T +QL D LTK L+         K+G+I +
Subjt:  QLLSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.9e-4142.27Show/hide
Query:  SPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQL
        +P + H  AV +IL Y+K    +G+ Y      +++ FS+A +   ++ RRST+GYC+F+G +L+SWKSKKQ VVS+SSAE+EYRA++ +  E++W+ Q 
Subjt:  SPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQL

Query:  LSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREK-IQDGLVSTGYVKTGEQLG--DILTKALNGTRISYLCNKLGMIDIFA
          E+   ++ P  L+CDN  A+HIA+N VFHERTKHIE DCH +RE+ +    +S  +    EQ G  + L+  L GT I Y+ +  G+  + A
Subjt:  LSEIGFSITVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREK-IQDGLVSTGYVKTGEQLG--DILTKALNGTRISYLCNKLGMIDIFA

ATMG00810.1 DNA/RNA polymerases superfamily protein6.1e-1837.76Show/hide
Query:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVW
        M  PT+  +  ++++L Y+K     G+    +    ++ F ++DW G    RRST+G+C F+G N++SW +K+Q  VSRSS E+EYRA+A +  E+ W
Subjt:  MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGTTGCTCCTCGACGTGGGATCCTATACAGAGATCATGGACATACGAGAAT
TGAATGTTTTTCTAATGCTGATTGGACGGGGTCTCGTGAGGATAGGAGATCAACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAAC
AAAATGTTGTTTCTCGTTCGAGTGCTGAGTCAGAATATAGAGCTATGGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTGAGATAGGCTTCAGTATT
ACCGTGCCAGCTAAATTATGGTGTGATAATCAAGTTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCG
TGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGACCGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATCTGTGCA
ACAAGCTGGGCATGATCGACATATTTGCTCCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGTTGCTCCTCGACGTGGGATCCTATACAGAGATCATGGACATACGAGAAT
TGAATGTTTTTCTAATGCTGATTGGACGGGGTCTCGTGAGGATAGGAGATCAACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAAC
AAAATGTTGTTTCTCGTTCGAGTGCTGAGTCAGAATATAGAGCTATGGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTGAGATAGGCTTCAGTATT
ACCGTGCCAGCTAAATTATGGTGTGATAATCAAGTTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCG
TGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGACCGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATCTGTGCA
ACAAGCTGGGCATGATCGACATATTTGCTCCAGCTTGA
Protein sequenceShow/hide protein sequence
MSSPTVDHWAAVEQILCYLKVAPRRGILYRDHGHTRIECFSNADWTGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSI
TVPAKLWCDNQVALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA