; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12300 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12300
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr4:10639239..10639826
RNA-Seq ExpressionCSPI04G12300
SyntenyCSPI04G12300
Gene Ontology termsGO:0007165 - signal transduction (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003953 - NAD+ nucleosidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]4.1e-10998.97Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCH I EKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]4.1e-10998.97Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCH I EKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]4.1e-10998.97Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCH I EKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]4.1e-10998.97Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCH I EKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]4.1e-10998.97Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCH I EKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

TrEMBL top hitse value%identityAlignment
A0A5A7UHS1 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-9889.74Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MS PTVDHWAAVEQILCYLKAA GRGILYKDHGHT+V+CFSDADW GSREDRRS SGYCVFVGGNLVSWKSKKQNVVS SSA+SEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVP KLWCDNQ ALHIASNPVFHE+TKHIEVDCH I EKIQDGL+STGYVKTGEQLGDILTK +NG RISYL  KL MIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A5D3DZU1 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-9889.74Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MS PTVDHWAAVEQILCYLKAA GRGILYKDHGHT+V+CFSDADW GSREDRRS SGYCVFVGGNLVSWKSKKQNVVS SSA+SEYRAMAQSVCEIVWIH
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSEIGFSITVP KLWCDNQ ALHIASNPVFHE+TKHIEVDCH I EKIQDGL+STGYVKTGEQLGDILTK +NG RISYL  KL MIDIFAPA
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A5D3E5M8 Copia protein2.8e-9588.42Show/hide
Query:  VDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE
        VDHWA VEQILCY KAAPGRGILYKDHG+TRVECFSDADWAGSR+D+RSTSGYCVFVG NLV WKSKKQNV+SRSSAESEYRAM QSVC IVWIHQLLSE
Subjt:  VDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE

Query:  IGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
          FSITVPAKLWCDNQ ALHIASNPVFHE+TK++EVDCH I EKIQDGLVSTGYVKTGE+LGDILTKA+NG RISYLCNKL MIDIFAPA
Subjt:  IGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A6J1DFP5 uncharacterized protein LOC1110203913.7e-9584.54Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPT DHW A+E ILCYLK APGRG+LYKDHGH  +ECFSDADWAGS+E+RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCE+VWIH
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAP
        QLL+E+GF IT P KLWCDNQAALHIASN VFHERTKHIEVDCH +CEKI  GLV TGYVKTG+QLGDI TKALNG RI YL NKLGMI+I+AP
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAP

A0A6P6VCZ7 uncharacterized protein LOC1137194881.5e-9683.08Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        MSSPTVDHW AVEQ+L YLK APGRGILY +HGHTR+ECFSD+DWAG +EDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAE+EYRAMA+SVCE++W++
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        QLLSE+G  ++VPAKLWCDNQAALHIASNPVFHERTKHIE+DCH + EKIQ GL++TGYVKTGEQLGDI TKALNG RI YLCNKLGMI+I+APA
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.5e-3236.96Show/hide
Query:  WAAVEQILCYLKAAPGRGILYKDH--GHTRVECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE
        W  ++++L YLK      +++K +     ++  + D+DWAGS  DR+ST+GY       NL+ W +K+QN V+ SS E+EY A+ ++V E +W+  LL+ 
Subjt:  WAAVEQILCYLKAAPGRGILYKDH--GHTRVECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSE

Query:  IGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMI
        I   +  P K++ DNQ  + IA+NP  H+R KHI++  H   E++Q+ ++   Y+ T  QL DI TK L   R   L +KLG++
Subjt:  IGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.8e-2632.8Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        + +P  +HW AV+ IL YL+   G  + +       ++ ++DAD AG  ++R+S++GY     G  +SW+SK Q  V+ S+ E+EY A  ++  E++W+ 
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKL
        + L E+G        ++CD+Q+A+ ++ N ++H RTKHI+V  H I E + D  +    + T E   D+LTK +   +   LC +L
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKL

P92519 Uncharacterized mitochondrial protein AtMg008102.7e-1840.82Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVW
        M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F+G N++SW +K+Q  VSRSS E+EYRA+A +  E+ W
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-4243.09Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        M  PT +H  A+++IL YL   P  GI  K      +  +SDADWAG ++D  ST+GY V++G + +SW SKKQ  V RSS E+EYR++A +  E+ WI 
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGM
         LL+E+G  +T P  ++CDN  A ++ +NPVFH R KHI +D H I  ++Q G +   +V T +QL D LTK L+ T      +K+G+
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.4e-4342.41Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH
        M  PT DHW A++++L YL   P  GI  K      +  +SDADWAG  +D  ST+GY V++G + +SW SKKQ  V RSS E+EYR++A +  E+ WI 
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIH

Query:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDI
         LL+E+G  ++ P  ++CDN  A ++ +NPVFH R KHI +D H I  ++Q G +   +V T +QL D LTK L+         K+G+I +
Subjt:  QLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-4243.3Show/hide
Query:  SPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQL
        +P + H  AV +IL Y+K   G+G+ Y      +++ FSDA +   ++ RRST+GYC+F+G +L+SWKSKKQ VVS+SSAE+EYRA++ +  E++W+ Q 
Subjt:  SPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQL

Query:  LSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEK-IQDGLVSTGYVKTGEQLG--DILTKALNGTRISYLCNKLGMIDIFA
          E+   ++ P  L+CDN AA+HIA+N VFHERTKHIE DCH + E+ +    +S  +    EQ G  + L+  L GT I Y+ +  G+  + A
Subjt:  LSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEK-IQDGLVSTGYVKTGEQLG--DILTKALNGTRISYLCNKLGMIDIFA

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.4e-0636.54Show/hide
Query:  AVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFV
        AV ++L Y+K   G+G+ Y      +++ F+D+DWA   + RRS +G+C  V
Subjt:  AVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFV

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-1940.82Show/hide
Query:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVW
        M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F+G N++SW +K+Q  VSRSS E+EYRA+A +  E+ W
Subjt:  MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATACAAAGATCATGGACATACGAGAGT
TGAATGTTTTTCTGATGCTGATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAAC
AAAATGTTGTTTCTCGTTCGAGTGCTGAGTCAGAATATAGAGCTATGGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTGAGATAGGCTTCAGTATT
ACCGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACATCATTTG
TGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGACCGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATCTGTGCA
ACAAGCTGGGCATGATCGACATATTTGCTCCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATACAAAGATCATGGACATACGAGAGT
TGAATGTTTTTCTGATGCTGATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAAC
AAAATGTTGTTTCTCGTTCGAGTGCTGAGTCAGAATATAGAGCTATGGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTGAGATAGGCTTCAGTATT
ACCGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACATCATTTG
TGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGACCGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATCTGTGCA
ACAAGCTGGGCATGATCGACATATTTGCTCCAGCTTGA
Protein sequenceShow/hide protein sequence
MSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSI
TVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHIICEKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA