; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004446 (gene) of Snake gourd v1 genome

Gene IDTan0004446
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionglutamic acid-rich protein-like
Genome locationLG05:2374217..2375023
RNA-Seq ExpressionTan0004446
SyntenyTan0004446
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605505.1 hypothetical protein SDJN03_02822, partial [Cucurbita argyrosperma subsp. sororia]1.3e-7358.89Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------
        MKT+TG+VVSSKPISLSKAASTLSSFLS DNGAS+ALCAYLRRASASFNELKQLHKDLKSSRSDR  RH G EVS+ LEAAVD                 
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------

Query:  -------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDA
                                                               SRRVE+DV+SSDR +SVV IEKKKKKHKKKS+D     GEDERD 
Subjt:  -------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDA

Query:  AEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKDEKSADKDNGDRKDLVEMST------KKKKKKKKREEDADELQNNSGVAIE
        AE  +SYGKS+ SANNGE EAT D ++NNV  GKDRK+H+D  ++G   DE    KDN D   LVE+ST      KKKKKKK REED D+LQNNSG AIE
Subjt:  AEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKDEKSADKDNGDRKDLVEMST------KKKKKKKKREEDADELQNNSGVAIE

Query:  KEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRK
        KE+ PVSDS+ELKRKE KKRKN DLEEG DDGSEEQQG KRRK
Subjt:  KEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRK

KAG7010591.1 hypothetical protein SDJN02_27385, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-7455.77Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------
        MKTVTGS+VSSKPIS+SKAASTLSSFLS DNGAS+A+CAYLRRASASFNELKQLHK+LKSSRSDRKHRH GSE SND EA+ D                 
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------

Query:  --------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERD
                                                                +R+VE+DVESSD+ KSVV +E K+KKHKKKSED H ++ +DER+
Subjt:  --------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERD

Query:  AAEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMSTKKKKKKKKREEDADEL
           AR+SY KSRIS NNGEIEA+G  V+NN+A GKDRK+HED KS+ D KD              EKS +KDN D  +  +   KKKKKKK REE+ D+ 
Subjt:  AAEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMSTKKKKKKKKREEDADEL

Query:  QNNSGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL
        QNNSG A+ KEE PVSD KELKRKE+KKRKNR LEEGGDDGSEEQQ  KRRKGNL
Subjt:  QNNSGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL

KAG7035443.1 hypothetical protein SDJN02_02239, partial [Cucurbita argyrosperma subsp. argyrosperma]4.5e-7459.01Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------
        MKT+TG+VVSSKPISLSKAASTLSSFLS DNGAS+ALCAYLRRASASFNELKQLHKDLKSSRSDR  RH G EVS+ LEAAVD                 
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------

Query:  -------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDA
                                                               SRRVE+DV+SSDR +SVV IEKKKKKHKKKS+D     GEDERD 
Subjt:  -------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDA

Query:  AEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKDEKSADKDNGDRKDLVEMST-------KKKKKKKKREEDADELQNNSGVAI
        AE  +SYGKS+ SANNGE EAT D ++NNV  GKDRK+H+D  ++G   DEK   KDN D   LVE+ST       KKKKKKK REED D+LQNNSG AI
Subjt:  AEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKDEKSADKDNGDRKDLVEMST-------KKKKKKKKREEDADELQNNSGVAI

Query:  EKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRK
        EKE+ PVSDS+ELKRKE KKRKN DLEEG DDGSEEQQG KRRK
Subjt:  EKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRK

XP_022943393.1 glutamic acid-rich protein-like [Cucurbita moschata]3.5e-7455.77Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAA-------------------
        MKTVTGS+VSSKPIS+SKAASTLSSFLS DNGAS+A+CAYLRRASASFNELKQLHK+LKSSRSDRKHRH GSE SND EA+                   
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAA-------------------

Query:  ------------------------------------------------------VDSRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERD
                                                                +R+VE+DVESSD+ KSVV +EKK KKHKKKSED H ++ +DER+
Subjt:  ------------------------------------------------------VDSRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERD

Query:  AAEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMSTKKKKKKKKREEDADEL
           AR+SY KSR S NNGEIEA+G  V+NN+A GKDRK+HED KS+GD KD              EKS +KDN D  +  +   KKKKKKK REE+ D+ 
Subjt:  AAEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMSTKKKKKKKKREEDADEL

Query:  QNNSGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL
        QNNSG A+ KEE PVSD KELKRKE+KKRKNR LEEGGDDGSEEQQ  KRRKGNL
Subjt:  QNNSGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL

XP_038902882.1 probable xyloglucan galactosyltransferase GT11 [Benincasa hispida]1.3e-7355.56Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------
        MKTVTGSVVSSKPIS+SKAASTLSSFLS DNGASQALCAYLRRASASFNELKQLHK+LKSSRS RKH H GSEVSN+LEAA+D                 
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------

Query:  ----------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDE
                                                                  +R+VE+DVESSDR K VV +EKK+KKHKKK+ED HG + +DE
Subjt:  ----------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDE

Query:  RDAAEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMST--KKKKKKKKREED
        RD+  AR S+ KS+ S NNG IEA+G+ V+NNVA  K  K+HED KS+GD KD              EK  +KDN D  D+V++ST  KKKKKKKKREED
Subjt:  RDAAEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMST--KKKKKKKKREED

Query:  ADELQNNSGVAIEKEERPVSDSKELKRKERKKRKNRDL-EEGGDDGSEEQQGAKRRKGNL
         D+ QNNSG A+  +E PVS+SKELKRK+RKKRKNR+L EEGGDD SEE+QG KRRKGNL
Subjt:  ADELQNNSGVAIEKEERPVSDSKELKRKERKKRKNRDL-EEGGDDGSEEQQGAKRRKGNL

TrEMBL top hitse value%identityAlignment
A0A6J1CXN0 nuclear speckle splicing regulatory protein 11.1e-7053.02Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------
        MKTVTG V+SSKPISLSKAASTLSSFLS DNGAS A CAYLRRASASFNELKQLHK+LKSSRSDRKHRH  SEV + LE AVD                 
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------

Query:  ---------------------------------------------------------------------------------SRRVEIDVESSDRGKSVVE
                                                                                         SRRVE+DVESSD  KS V 
Subjt:  ---------------------------------------------------------------------------------SRRVEIDVESSDRGKSVVE

Query:  IEKKKKKHKKKSEDGHGRVGEDERDAAEARKSYGKSRISANNGEIE-ATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNG
        +E ++KKHKKKSE+ HG+ G+DERDA  AR+SY KSRIS NNGEIE A GDLV+NNVA GKDRK+  D K++GD +D              E++ADK+NG
Subjt:  IEKKKKKHKKKSEDGHGRVGEDERDAAEARKSYGKSRISANNGEIE-ATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNG

Query:  DRKDLVEMSTKKKKKKKKREEDADELQNNSGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL
        D KDLVE+ T  KK KKKREEDA +LQNNSG A+E++  PV DSK+LKRKE+KKRKNRDLE GG  GSEEQQG KRRKGNL
Subjt:  DRKDLVEMSTKKKKKKKKREEDADELQNNSGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL

A0A6J1FSX8 glutamic acid-rich protein-like1.7e-7455.77Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAA-------------------
        MKTVTGS+VSSKPIS+SKAASTLSSFLS DNGAS+A+CAYLRRASASFNELKQLHK+LKSSRSDRKHRH GSE SND EA+                   
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAA-------------------

Query:  ------------------------------------------------------VDSRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERD
                                                                +R+VE+DVESSD+ KSVV +EKK KKHKKKSED H ++ +DER+
Subjt:  ------------------------------------------------------VDSRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERD

Query:  AAEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMSTKKKKKKKKREEDADEL
           AR+SY KSR S NNGEIEA+G  V+NN+A GKDRK+HED KS+GD KD              EKS +KDN D  +  +   KKKKKKK REE+ D+ 
Subjt:  AAEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMSTKKKKKKKKREEDADEL

Query:  QNNSGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL
        QNNSG A+ KEE PVSD KELKRKE+KKRKNR LEEGGDDGSEEQQ  KRRKGNL
Subjt:  QNNSGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL

A0A6J1G5Q5 DEAD-box ATP-dependent RNA helicase 42-like isoform X11.1e-7358.94Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------
        MKT+TG+VVSSKPISLSKAASTLSSFLS DNGAS+ALCAYLRRASASFNELKQLHKDLKSSRSDR  RH G EVS+ LEAAVD                 
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------

Query:  -------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDA
                                                               SRRVE+DV+SSDR +SVV IEKKKKKHKKKS+D     GEDERD 
Subjt:  -------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDA

Query:  AEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKDEKSADKDNGDRKDLVEMST----KKKKKKKKREEDADELQNNSGVAIEKE
        AE  +SYGKS+ SANNGE EAT D ++NNV  GKDRK+H+D  ++G   DEK   KDN D   LVE+ST    KKKKKKK REED D+LQNN G AIEKE
Subjt:  AEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKDEKSADKDNGDRKDLVEMST----KKKKKKKKREEDADELQNNSGVAIEKE

Query:  ERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRK
        + PVSD +ELKRKE KKRKN DLEEG DDGSEEQQG KRRK
Subjt:  ERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRK

A0A6J1G5Y0 sarcoplasmic reticulum histidine-rich calcium-binding protein-like isoform X23.8e-6656.01Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------
        MKT+TG+VVSSKPISLSKAASTLSSFLS DNGAS+ALCAYLRRASASFNELKQLHKDLKSSRSDR  RH G EVS+ LEAAVD                 
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------

Query:  -------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDA
                                                               SRRVE+DV+SSDR +SVV IEKKKKKHKKKS+D     GEDERD 
Subjt:  -------------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDA

Query:  AEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKDEKSADKDNGDRKDLVEMST----KKKKKKKKREEDADELQNNSGVAIEKE
        AE  +SYGKS+ SANNGE EAT D ++NNV  GKDRK+H+D  ++G   DEK   KDN D   LVE+ST    KKKKKKK REED D+LQNN G AIEKE
Subjt:  AEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKDEKSADKDNGDRKDLVEMST----KKKKKKKKREEDADELQNNSGVAIEKE

Query:  ERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRK
                       KKRKN DLEEG DDGSEEQQG KRRK
Subjt:  ERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRK

A0A6J1JCJ7 cylicin-1-like2.4e-7355.4Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------
        MKTVTGS+VSSKPIS+SKAASTLSSFLS DNGAS+A+CAYLRRASASFNELKQLHK+LKSSRSDRKHRH GSE SND EAA D                 
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVD-----------------

Query:  -----------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDAAE
                                                             +R+VE+DVESSD+ KSVV +EKK KKH+KKSED + ++ +DE  A  
Subjt:  -----------------------------------------------------SRRVEIDVESSDRGKSVVEIEKKKKKHKKKSEDGHGRVGEDERDAAE

Query:  ARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMSTKKKKKKKKREEDADELQNN
        AR+S  KSR S NNGEIEA+   V+NN+A GKDRK+H D KS+GD KD              EKS +KDN D  +  +   KKKKKKK REE+ D+ QNN
Subjt:  ARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKD--------------EKSADKDNGDRKDLVEMSTKKKKKKKKREEDADELQNN

Query:  SGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL
        SG A+ KEE PV D KELKRKE+KKRKNRDLEEGGDDGSEEQQ  KRRKGNL
Subjt:  SGVAIEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75335.1 unknown protein2.5e-1460Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLK------SSRSDRK-HRHLGSE
        MKTVTG V S+KPISLSKAA+ LS F+S++NGASQ + AYLRRAS +F ELK +H+++K      SS+  RK HR +GSE
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLK------SSRSDRK-HRHLGSE

AT5G60030.1 unknown protein4.2e-1735.4Show/hide
Query:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSR----SDRKHRHLGSEVSNDLEA------AVDSRRVEID
        MKTVTG VVS++PISLSKAA  LS F S+DNGASQ + AYLRRASA+F ELK  H+++KS      SDR+ +   ++ S+D ++        D R++   
Subjt:  MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSR----SDRKHRHLGSEVSNDLEA------AVDSRRVEID

Query:  VESSDRGKSVV--EIEKKKKKHKKKSEDGHGRVGE--DERDAAEARKSYGKSRISANNGEIEATGDLVDNNVAMG-KDRKRHEDNKSMGDVKDEKSADKD
           +   +SV   E ++KK K  K ++    +V E  +    +E R+   K +    N + E   D+VD  V    +D ++  D K     K +K+ D+D
Subjt:  VESSDRGKSVV--EIEKKKKKHKKKSEDGHGRVGE--DERDAAEARKSYGKSRISANNGEIEATGDLVDNNVAMG-KDRKRHEDNKSMGDVKDEKSADKD

Query:  NGDRKDLVEMSTKKKK-KKKKREEDADELQNNSGVAIEKEERPVSDSKELKRK----------ERKKRKNRDLEEGGDDGSEEQQGAKRRK
          D K+ +E   K  + K+KK+ +D D +       +E E+R     KE K+K          ERK +K R  +E  + GSEE++  K+RK
Subjt:  NGDRKDLVEMSTKKKK-KKKKREEDADELQNNSGVAIEKEERPVSDSKELKRK----------ERKKRKNRDLEEGGDDGSEEQQGAKRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACAGTCACCGGGAGCGTGGTTTCTTCGAAGCCAATTTCTCTCTCCAAGGCGGCGTCCACCCTCTCCTCCTTTCTTTCCGCCGATAATGGCGCTTCACAAGCGCT
CTGTGCCTACCTGAGACGCGCCTCCGCCTCCTTCAACGAGTTAAAGCAGCTTCACAAGGACCTCAAGTCTTCGCGGTCCGATCGAAAGCACCGGCATCTCGGATCCGAGG
TTTCAAATGATTTAGAGGCTGCCGTAGACAGTCGAAGAGTTGAGATCGATGTGGAGTCGAGCGATAGAGGTAAGAGCGTTGTAGAAATTGAGAAAAAGAAAAAAAAGCAC
AAGAAAAAGAGCGAGGATGGACATGGTAGAGTTGGAGAAGATGAACGTGATGCTGCTGAAGCCAGGAAAAGTTATGGTAAATCTCGAATTAGTGCTAATAATGGTGAGAT
TGAAGCTACTGGGGATCTCGTTGATAACAATGTAGCAATGGGAAAAGATAGAAAGAGGCATGAGGACAATAAGAGTATGGGCGATGTGAAGGATGAAAAGAGCGCGGATA
AGGACAATGGTGATCGAAAGGATCTTGTGGAGATGTCGACTAAGAAGAAGAAGAAGAAGAAGAAAAGGGAAGAAGATGCTGACGAGCTTCAAAATAACAGTGGAGTAGCT
ATAGAGAAAGAGGAAAGGCCAGTTTCGGATAGCAAAGAGTTGAAAAGGAAAGAAAGGAAAAAGAGGAAGAATCGAGACTTAGAAGAAGGCGGTGATGATGGTTCAGAAGA
GCAGCAGGGTGCAAAGAGAAGGAAAGGAAATTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGACAGTCACCGGGAGCGTGGTTTCTTCGAAGCCAATTTCTCTCTCCAAGGCGGCGTCCACCCTCTCCTCCTTTCTTTCCGCCGATAATGGCGCTTCACAAGCGCT
CTGTGCCTACCTGAGACGCGCCTCCGCCTCCTTCAACGAGTTAAAGCAGCTTCACAAGGACCTCAAGTCTTCGCGGTCCGATCGAAAGCACCGGCATCTCGGATCCGAGG
TTTCAAATGATTTAGAGGCTGCCGTAGACAGTCGAAGAGTTGAGATCGATGTGGAGTCGAGCGATAGAGGTAAGAGCGTTGTAGAAATTGAGAAAAAGAAAAAAAAGCAC
AAGAAAAAGAGCGAGGATGGACATGGTAGAGTTGGAGAAGATGAACGTGATGCTGCTGAAGCCAGGAAAAGTTATGGTAAATCTCGAATTAGTGCTAATAATGGTGAGAT
TGAAGCTACTGGGGATCTCGTTGATAACAATGTAGCAATGGGAAAAGATAGAAAGAGGCATGAGGACAATAAGAGTATGGGCGATGTGAAGGATGAAAAGAGCGCGGATA
AGGACAATGGTGATCGAAAGGATCTTGTGGAGATGTCGACTAAGAAGAAGAAGAAGAAGAAGAAAAGGGAAGAAGATGCTGACGAGCTTCAAAATAACAGTGGAGTAGCT
ATAGAGAAAGAGGAAAGGCCAGTTTCGGATAGCAAAGAGTTGAAAAGGAAAGAAAGGAAAAAGAGGAAGAATCGAGACTTAGAAGAAGGCGGTGATGATGGTTCAGAAGA
GCAGCAGGGTGCAAAGAGAAGGAAAGGAAATTTATGA
Protein sequenceShow/hide protein sequence
MKTVTGSVVSSKPISLSKAASTLSSFLSADNGASQALCAYLRRASASFNELKQLHKDLKSSRSDRKHRHLGSEVSNDLEAAVDSRRVEIDVESSDRGKSVVEIEKKKKKH
KKKSEDGHGRVGEDERDAAEARKSYGKSRISANNGEIEATGDLVDNNVAMGKDRKRHEDNKSMGDVKDEKSADKDNGDRKDLVEMSTKKKKKKKKREEDADELQNNSGVA
IEKEERPVSDSKELKRKERKKRKNRDLEEGGDDGSEEQQGAKRRKGNL