; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025011 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025011
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon protein
Genome locationscaffold12:12489783..12492089
RNA-Seq ExpressionSpg025011
SyntenySpg025011
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]4.4e-6144.48Show/hide
Query:  LLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR-----LMAGADKQQKHIWTR
        +LRT  GL  T+ VDVEEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN  L+ VLRL+++LLK+P+ +T SC   +     + +   K  KH WT 
Subjt:  LLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR-----LMAGADKQQKHIWTR

Query:  QEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-
         E+  LVE L++LV +G WR DNGTF+ GYL ++++++K+K+    I+ T  ++  V+ LK+QY+ I+EM+GP CSGF WN E KCI+AE+ V + WVK 
Subjt:  QEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-

Query:  ----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGGDQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRA
                        DL  VF +D A+GG C  P +    T  D   D  +   +D  +P P  +   +    ED+P TPTS    AGS +  K+ R+
Subjt:  ----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGGDQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRA

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]3.7e-6032.55Show/hide
Query:  SQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRR
        SQRQL  ++    N+  RI +     RH+IRQLA FR+IH SDL CR+STRMDRRCF+ILC LLRT +GL  TE+VDVEEMVAMFLHI+AHDVKNRVI+R
Subjt:  SQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRR

Query:  QFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR----------------------------------------------------------
        +F RSGET+SRHFN  L AV+RL+D LLKKP+P+   C D R                                                          
Subjt:  QFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------LMAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDC
                                       M  + +  KH WT++EEA     LVELV+ GGWR DNGTFR GYL +L RM+  K+P C I + S ID 
Subjt:  ------------------------------LMAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDC

Query:  KVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAW---------------VKDLAFVFDKDWASGGACNVPAK-QAENTHGDDGGDQNVQAEQ
        +++ +KR + A++EM GP CSGFGWNDE KCI AE+EV+D W                 +L++VF KD A+GG     A   + N  G D    +   + 
Subjt:  KVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAW---------------VKDLAFVFDKDWASGGACNVPAK-QAENTHGDDGGDQNVQAEQ

Query:  DCYVPAPPDINLAADMDFEDVPITPTSRPSTAGS-SQSRKRSRASYEAEA
        D      P  +L  +M  +D+  T T+R S   + S   KR R  +  ++
Subjt:  DCYVPAPPDINLAADMDFEDVPITPTSRPSTAGS-SQSRKRSRASYEAEA

KAA0036474.1 retrotransposon protein [Cucumis melo var. makuwa]6.8e-6234.38Show/hide
Query:  MNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRH
        MNN+ R+ +  P  RH+IR+LA FR+IHESDL CR+STRMDRR F+ILC LLR  +GL  TEIVDVEEMVAMFLHI AHDVKNRVI+R+F RSGETVSRH
Subjt:  MNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRH

Query:  FNATLSAVLRLYDVLLKKPEPITTSCQDGRL---------------------------------------------------------------------
        FN  L AVLRLY+ L+K+P P+T++C D R                                                                      
Subjt:  FNATLSAVLRLYDVLLKKPEPITTSCQDGRL---------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------MAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTS
                                           M+ +++  +H+WTR+EE  LVE L+ELV  GGW+ DNGTFR+GYLA+L RM+ +K+  C + +T+
Subjt:  -----------------------------------MAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTS

Query:  VIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK
        VIDC++++LKR + AI+EM GP CSGFGWNDE KCI AE+E++D WV+
Subjt:  VIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]1.0e-5740.22Show/hide
Query:  MDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQ-DG---------
        MDRRCF+ILC++LRT  GL  T+ VDV+EMV +FLHI+AHDVKNRV RR  ARSGETVSRHFNA L+AVLRL+++LLK+P+P+T SC  DG         
Subjt:  MDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQ-DG---------

Query:  ------------------------------------RLMAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIE
                                             + +   K  KH WT  E+  LVE L++LV EGGWR DNGTF+ GYL                 
Subjt:  ------------------------------------RLMAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIE

Query:  STSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGG
                     +QY+AI+EM+GP CSGFGWN+  KCI+ E+ V+D WVK                 DL  VF +D A+GG C  P + +  T  D   
Subjt:  STSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGG

Query:  DQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRASYEAEAL
        D      +D  +P P  +   +    ED+P TPTS    AGSS+  K+ R SY  + +
Subjt:  DQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRASYEAEAL

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]8.9e-6244.82Show/hide
Query:  LLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR-----LMAGADKQQKHIWTR
        +LRT  GL  T+ VDVEEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN  L+ VLRL+++LLK+P+ +T SC   +     + +   K  KH WT 
Subjt:  LLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR-----LMAGADKQQKHIWTR

Query:  QEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-
         E+  LVE L++LV +G WR DNGTF+ GYL ++++++K+K+    I+ T  ++  V+ LK+QY+ I+EM+GP CSGF WN E KCI+AE+ V + WVK 
Subjt:  QEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-

Query:  ----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGGDQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRA
                        DL  VF +D A+GG C  P +    T  D   D  +   +D  +P P  +   +    ED+P TPTS    AGSS+  K+ R+
Subjt:  ----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGGDQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRA

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein2.1e-6144.48Show/hide
Query:  LLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR-----LMAGADKQQKHIWTR
        +LRT  GL  T+ VDVEEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN  L+ VLRL+++LLK+P+ +T SC   +     + +   K  KH WT 
Subjt:  LLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR-----LMAGADKQQKHIWTR

Query:  QEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-
         E+  LVE L++LV +G WR DNGTF+ GYL ++++++K+K+    I+ T  ++  V+ LK+QY+ I+EM+GP CSGF WN E KCI+AE+ V + WVK 
Subjt:  QEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-

Query:  ----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGGDQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRA
                        DL  VF +D A+GG C  P +    T  D   D  +   +D  +P P  +   +    ED+P TPTS    AGS +  K+ R+
Subjt:  ----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGGDQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRA

A0A5A7SWD8 Retrotransposon protein1.8e-6032.55Show/hide
Query:  SQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRR
        SQRQL  ++    N+  RI +     RH+IRQLA FR+IH SDL CR+STRMDRRCF+ILC LLRT +GL  TE+VDVEEMVAMFLHI+AHDVKNRVI+R
Subjt:  SQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRR

Query:  QFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR----------------------------------------------------------
        +F RSGET+SRHFN  L AV+RL+D LLKKP+P+   C D R                                                          
Subjt:  QFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------LMAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDC
                                       M  + +  KH WT++EEA     LVELV+ GGWR DNGTFR GYL +L RM+  K+P C I + S ID 
Subjt:  ------------------------------LMAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDC

Query:  KVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAW---------------VKDLAFVFDKDWASGGACNVPAK-QAENTHGDDGGDQNVQAEQ
        +++ +KR + A++EM GP CSGFGWNDE KCI AE+EV+D W                 +L++VF KD A+GG     A   + N  G D    +   + 
Subjt:  KVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAW---------------VKDLAFVFDKDWASGGACNVPAK-QAENTHGDDGGDQNVQAEQ

Query:  DCYVPAPPDINLAADMDFEDVPITPTSRPSTAGS-SQSRKRSRASYEAEA
        D      P  +L  +M  +D+  T T+R S   + S   KR R  +  ++
Subjt:  DCYVPAPPDINLAADMDFEDVPITPTSRPSTAGS-SQSRKRSRASYEAEA

A0A5A7SYW1 Retrotransposon protein3.3e-6234.38Show/hide
Query:  MNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRH
        MNN+ R+ +  P  RH+IR+LA FR+IHESDL CR+STRMDRR F+ILC LLR  +GL  TEIVDVEEMVAMFLHI AHDVKNRVI+R+F RSGETVSRH
Subjt:  MNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRH

Query:  FNATLSAVLRLYDVLLKKPEPITTSCQDGRL---------------------------------------------------------------------
        FN  L AVLRLY+ L+K+P P+T++C D R                                                                      
Subjt:  FNATLSAVLRLYDVLLKKPEPITTSCQDGRL---------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------MAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTS
                                           M+ +++  +H+WTR+EE  LVE L+ELV  GGW+ DNGTFR+GYLA+L RM+ +K+  C + +T+
Subjt:  -----------------------------------MAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTS

Query:  VIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK
        VIDC++++LKR + AI+EM GP CSGFGWNDE KCI AE+E++D WV+
Subjt:  VIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK

A0A5D3C7T4 Uncharacterized protein4.9e-5840.22Show/hide
Query:  MDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQ-DG---------
        MDRRCF+ILC++LRT  GL  T+ VDV+EMV +FLHI+AHDVKNRV RR  ARSGETVSRHFNA L+AVLRL+++LLK+P+P+T SC  DG         
Subjt:  MDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQ-DG---------

Query:  ------------------------------------RLMAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIE
                                             + +   K  KH WT  E+  LVE L++LV EGGWR DNGTF+ GYL                 
Subjt:  ------------------------------------RLMAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIE

Query:  STSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGG
                     +QY+AI+EM+GP CSGFGWN+  KCI+ E+ V+D WVK                 DL  VF +D A+GG C  P + +  T  D   
Subjt:  STSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGG

Query:  DQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRASYEAEAL
        D      +D  +P P  +   +    ED+P TPTS    AGSS+  K+ R SY  + +
Subjt:  DQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRASYEAEAL

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein4.3e-6244.82Show/hide
Query:  LLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR-----LMAGADKQQKHIWTR
        +LRT  GL  T+ VDVEEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN  L+ VLRL+++LLK+P+ +T SC   +     + +   K  KH WT 
Subjt:  LLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGR-----LMAGADKQQKHIWTR

Query:  QEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-
         E+  LVE L++LV +G WR DNGTF+ GYL ++++++K+K+    I+ T  ++  V+ LK+QY+ I+EM+GP CSGF WN E KCI+AE+ V + WVK 
Subjt:  QEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVK-

Query:  ----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGGDQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRA
                        DL  VF +D A+GG C  P +    T  D   D  +   +D  +P P  +   +    ED+P TPTS    AGSS+  K+ R+
Subjt:  ----------------DLAFVFDKDWASGGACNVPAKQAENTHGDDGGDQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPSTAGSSQSRKRSRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G28730.1 unknown protein9.6e-0632.94Show/hide
Query:  IHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRL
        I+ +++ C+   RM    F+ LC +L    GL  +  + ++E VA+FL I A +   R I  +F  + ET+ R F+  L A+ RL
Subjt:  IHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRL

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)9.0e-1238.64Show/hide
Query:  FRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRL
        +++++  +  C E+ RMD+  F  LC LL+T   L  T  + +E  +A+FL II H+++ R ++  F  SGET+SRHFN  L+AV+ +
Subjt:  FRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGETVSRHFNATLSAVLRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACTCTCAGCGCCAACTATTCAACTTGATTAACTCCTTCATGAACAACCACCCTAGGATAGAAAACCAAACTCCATACCTCAGACACCAGATAAGGCAGTTAGC
CTGCTTCCGGTTGATTCATGAAAGTGACCTATGCTGTCGAGAAAGCACCAGGATGGATAGAAGATGTTTTTCCATTCTATGTAGTCTGTTGAGAACGACGTCCGGGTTGG
TAGGAACGGAAATCGTAGATGTGGAAGAGATGGTCGCGATGTTCTTGCACATCATTGCTCATGATGTTAAGAATCGAGTCATTAGAAGACAGTTTGCACGGTCGGGCGAA
ACCGTTTCTCGGCACTTCAACGCGACTTTGAGTGCCGTACTACGATTGTACGACGTTCTACTTAAGAAACCTGAACCGATCACGACTTCTTGCCAAGATGGGAGACTAAT
GGCAGGTGCAGATAAACAACAGAAGCACATCTGGACGAGGCAGGAGGAGGCAAGGTTGGTGGAATCCCTCGTGGAGCTCGTCCACGAAGGTGGATGGAGAGGGGACAACG
GGACCTTCAGGGCCGGATACCTAGCCCGACTGAAGCGGATGATAAAAGATAAAATGCCTTCCTGCACCATAGAGTCAACGTCCGTAATAGACTGCAAGGTGCGGTCCTTG
AAACGGCAATACAGTGCCATCTCGGAGATGCTGGGTCCAGGCTGCAGTGGATTTGGTTGGAATGATGAGTTTAAATGCATCCAGGCTGAGAGGGAGGTATATGATGCATG
GGTGAAGGATCTTGCTTTCGTGTTCGACAAAGACTGGGCGAGTGGCGGCGCGTGTAATGTTCCAGCGAAACAGGCAGAAAACACCCACGGGGACGACGGGGGTGATCAGA
ATGTCCAGGCGGAACAGGATTGTTACGTCCCCGCCCCTCCAGACATTAATCTGGCCGCGGACATGGACTTCGAGGACGTCCCCATCACACCGACAAGCCGACCAAGCACT
GCAGGGTCCTCCCAGAGTCGAAAGCGGAGCAGAGCTTCATATGAAGCTGAAGCCTTGATATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGACTCTCAGCGCCAACTATTCAACTTGATTAACTCCTTCATGAACAACCACCCTAGGATAGAAAACCAAACTCCATACCTCAGACACCAGATAAGGCAGTTAGC
CTGCTTCCGGTTGATTCATGAAAGTGACCTATGCTGTCGAGAAAGCACCAGGATGGATAGAAGATGTTTTTCCATTCTATGTAGTCTGTTGAGAACGACGTCCGGGTTGG
TAGGAACGGAAATCGTAGATGTGGAAGAGATGGTCGCGATGTTCTTGCACATCATTGCTCATGATGTTAAGAATCGAGTCATTAGAAGACAGTTTGCACGGTCGGGCGAA
ACCGTTTCTCGGCACTTCAACGCGACTTTGAGTGCCGTACTACGATTGTACGACGTTCTACTTAAGAAACCTGAACCGATCACGACTTCTTGCCAAGATGGGAGACTAAT
GGCAGGTGCAGATAAACAACAGAAGCACATCTGGACGAGGCAGGAGGAGGCAAGGTTGGTGGAATCCCTCGTGGAGCTCGTCCACGAAGGTGGATGGAGAGGGGACAACG
GGACCTTCAGGGCCGGATACCTAGCCCGACTGAAGCGGATGATAAAAGATAAAATGCCTTCCTGCACCATAGAGTCAACGTCCGTAATAGACTGCAAGGTGCGGTCCTTG
AAACGGCAATACAGTGCCATCTCGGAGATGCTGGGTCCAGGCTGCAGTGGATTTGGTTGGAATGATGAGTTTAAATGCATCCAGGCTGAGAGGGAGGTATATGATGCATG
GGTGAAGGATCTTGCTTTCGTGTTCGACAAAGACTGGGCGAGTGGCGGCGCGTGTAATGTTCCAGCGAAACAGGCAGAAAACACCCACGGGGACGACGGGGGTGATCAGA
ATGTCCAGGCGGAACAGGATTGTTACGTCCCCGCCCCTCCAGACATTAATCTGGCCGCGGACATGGACTTCGAGGACGTCCCCATCACACCGACAAGCCGACCAAGCACT
GCAGGGTCCTCCCAGAGTCGAAAGCGGAGCAGAGCTTCATATGAAGCTGAAGCCTTGATATTATGA
Protein sequenceShow/hide protein sequence
MADSQRQLFNLINSFMNNHPRIENQTPYLRHQIRQLACFRLIHESDLCCRESTRMDRRCFSILCSLLRTTSGLVGTEIVDVEEMVAMFLHIIAHDVKNRVIRRQFARSGE
TVSRHFNATLSAVLRLYDVLLKKPEPITTSCQDGRLMAGADKQQKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRAGYLARLKRMIKDKMPSCTIESTSVIDCKVRSL
KRQYSAISEMLGPGCSGFGWNDEFKCIQAEREVYDAWVKDLAFVFDKDWASGGACNVPAKQAENTHGDDGGDQNVQAEQDCYVPAPPDINLAADMDFEDVPITPTSRPST
AGSSQSRKRSRASYEAEALIL