; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020129 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020129
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon protein
Genome locationscaffold1:23813590..23814634
RNA-Seq ExpressionSpg020129
SyntenySpg020129
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044610.1 retrotransposon protein [Cucumis melo var. makuwa]3.7e-5340.92Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        M+ +++ P+H+WTR+EE  LVE L+ELV  GGW+ DNGTFRPGYLA+L RM+ EKLP C + +T++IDC++++LKR + AI+EM GP CSGFGWNDE KC
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGD----------QNVQADQD--------------
        I AEKE++D WV+SH +AKGLLNKPF +Y++L +VFG+DRA+G      A    +  G      D GD          Q V   QD              
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGD----------QNVQADQD--------------

Query:  WSSQSRKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGL
          S   KR R S     L+ +  ++     Q  +IA+W     A +   R     +L    EL+  +R  L R LF+ +      + +P   R  F R L
Subjt:  WSSQSRKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGL

Query:  LNE
        L +
Subjt:  LNE

KAA0065929.1 retrotransposon protein [Cucumis melo var. makuwa]2.8e-5341.94Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        M+ +++ P+H+WT++EE  LVE L+ELV  GGW+ DNGTFRPGYLA+L RM+ EKLP C + +T++IDC++++LKR + AI+EM GP CSGFGWNDE KC
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGDQNVQADQDWSSQSRKRNRASYEAKALDIMRQS
        I AEKE++D WV+SH +AKGLLNKPF +Y++L +VFG+DRA+G      A    +  G      D GD N        S   KR R S     ++ +  +
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGDQNVQADQDWSSQSRKRNRASYEAKALDIMRQS

Query:  VAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGLLNE
        +     Q  +IA+W     A +   R     +L    EL+  +R  L   L + +      + +P   R  F R LL +
Subjt:  VAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGLLNE

XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]7.9e-5642.05Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        MA   + PKH WT++EE + VE LVELV  GGWR DNGTF+PGYLA+L+RM+ EKLP   I  +S IDC V+SLK+ Y AI+EM GP CSGFGWN+EF+C
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGG--------ACNVPAKKADSNHGDDEGDQNV---------------------QADQD
        I AE++++D+W+KSH +AKGLL+K F +Y+DL++VFGKDRA+G           NV     D+    D  D+++                     QA + 
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGG--------ACNVPAKKADSNHGDDEGDQNV---------------------QADQD

Query:  WSSQS-RKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRG
         +  S  KR R S   + ++++R  +     Q   IADW   + A E + R  V + L    +L   +R  LM+ILF  L+     LS+P  L+L +   
Subjt:  WSSQS-RKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRG

Query:  LL
        LL
Subjt:  LL

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]4.3e-5442.7Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        M G  K  KH+W++ E+ARLVE+L+ LV E GWR DNGTFRPGYL  L++++ EK+P C ++  + I+CKVRSLK+QY+A+SEML    SGF WN+EFKC
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVP-AKKADSNHGDDEGDQNVQADQDW--------SSQSRKRNRASYEAKALDI
        +Q E+E++D WV+SH +AKG+  KPF HY+DL+ VFGKDRA    C+ P  ++ +S    DE D+   A+Q          SS+  KR R+S++ + +DI
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVP-AKKADSNHGDDEGDQNVQADQDW--------SSQSRKRNRASYEAKALDI

Query:  MRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGLL
        ++ +V MQ T   ++A W + +   E K    V   +    +L ++++V L+ ++  D++ T+  L+VP   R R+   LL
Subjt:  MRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGLL

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]6.5e-5845.91Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        MAG+ K  KH+W++ E+ +LVE+L+ LV E GWR DNGTFR GYL  L+R++ EK+P C ++  + I+CKVRSLK+QY+A+SEML    SGFGWN+EFKC
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVP-AKKADSNHGDDEGDQNVQADQDW--------SSQSRKRNRASYEAKALDI
        +Q EKE++D WV+SH +AKG+ NK FLHY+DL+ VFGKDRA+   C+ P   +A+S    DE D+   A+Q          SS+  KR R S++A+ +DI
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVP-AKKADSNHGDDEGDQNVQADQDW--------SSQSRKRNRASYEAKALDI

Query:  MRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGLL
        MR +V MQ T   ++A W   +   EF RR  V   + +   L +D++V  + +L  D++ T+  L+VP   R R+   LL
Subjt:  MRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGLL

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859533.8e-5642.05Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        MA   + PKH WT++EE + VE LVELV  GGWR DNGTF+PGYLA+L+RM+ EKLP   I  +S IDC V+SLK+ Y AI+EM GP CSGFGWN+EF+C
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGG--------ACNVPAKKADSNHGDDEGDQNV---------------------QADQD
        I AE++++D+W+KSH +AKGLL+K F +Y+DL++VFGKDRA+G           NV     D+    D  D+++                     QA + 
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGG--------ACNVPAKKADSNHGDDEGDQNV---------------------QADQD

Query:  WSSQS-RKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRG
         +  S  KR R S   + ++++R  +     Q   IADW   + A E + R  V + L    +L   +R  LM+ILF  L+     LS+P  L+L +   
Subjt:  WSSQS-RKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRG

Query:  LL
        LL
Subjt:  LL

A0A5A7TN44 Retrotransposon protein1.8e-5340.92Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        M+ +++ P+H+WTR+EE  LVE L+ELV  GGW+ DNGTFRPGYLA+L RM+ EKLP C + +T++IDC++++LKR + AI+EM GP CSGFGWNDE KC
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGD----------QNVQADQD--------------
        I AEKE++D WV+SH +AKGLLNKPF +Y++L +VFG+DRA+G      A    +  G      D GD          Q V   QD              
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGD----------QNVQADQD--------------

Query:  WSSQSRKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGL
          S   KR R S     L+ +  ++     Q  +IA+W     A +   R     +L    EL+  +R  L R LF+ +      + +P   R  F R L
Subjt:  WSSQSRKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGL

Query:  LNE
        L +
Subjt:  LNE

A0A5A7TRV1 Retrotransposon protein8.8e-5340.59Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        M+ +++ P+H+WTR+EE  LVE L+ELV  GGW+ DNGTFRPGYLA+L RM+ EKLP C + +T++IDC++++LKR + AI+EM GP CSGFGWNDE KC
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGD----------QNVQADQD--------------
        I AEKE++D WV+SH +AKGLLNKPF +Y++L +VFG+DRA+G      A    +  G      D GD          Q V   QD              
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGD----------QNVQADQD--------------

Query:  WSSQSRKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGL
          S   KR R S     L+ +  ++     Q  +IA+W     A +   R     +L    EL+  +R  L R L + +      + +P   R  F R L
Subjt:  WSSQSRKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGL

Query:  LNE
        L +
Subjt:  LNE

A0A5A7U0H7 Retrotransposon protein3.8e-5642.05Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        MA   + PKH WT++EE + VE LVELV  GGWR DNGTF+PGYLA+L+RM+ EKLP   I  +S IDC V+SLK+ Y AI+EM GP CSGFGWN+EF+C
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGG--------ACNVPAKKADSNHGDDEGDQNV---------------------QADQD
        I AE++++D+W+KSH +AKGLL+K F +Y+DL++VFGKDRA+G           NV     D+    D  D+++                     QA + 
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGG--------ACNVPAKKADSNHGDDEGDQNV---------------------QADQD

Query:  WSSQS-RKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRG
         +  S  KR R S   + ++++R  +     Q   IADW   + A E + R  V + L    +L   +R  LM+ILF  L+     LS+P  L+L +   
Subjt:  WSSQS-RKRNRASYEAKALDIMRQSVAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRG

Query:  LL
        LL
Subjt:  LL

A0A5A7VKT2 Retrotransposon protein1.4e-5341.94Show/hide
Query:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        M+ +++ P+H+WT++EE  LVE L+ELV  GGW+ DNGTFRPGYLA+L RM+ EKLP C + +T++IDC++++LKR + AI+EM GP CSGFGWNDE KC
Subjt:  MAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGDQNVQADQDWSSQSRKRNRASYEAKALDIMRQS
        I AEKE++D WV+SH +AKGLLNKPF +Y++L +VFG+DRA+G      A    +  G      D GD N        S   KR R S     ++ +  +
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGD-----DEGDQNVQADQDWSSQSRKRNRASYEAKALDIMRQS

Query:  VAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGLLNE
        +     Q  +IA+W     A +   R     +L    EL+  +R  L   L + +      + +P   R  F R LL +
Subjt:  VAMQETQFIKIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGLLNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein7.7e-0928.67Show/hide
Query:  GADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIK--EKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC
        G +K P + WT  E     + L+EL+ +  WR  +G    G L    +++    K   C  +  + +  +++ LK  Y +  + L    SGFGW+ E K 
Subjt:  GADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIK--EKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKC

Query:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASG
          A  EV+  ++K+H + K +  +   H+EDL  +FG   A+G
Subjt:  IQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASG

AT4G02210.1 unknown protein4.7e-0621.63Show/hide
Query:  VELVHEGGWRGD--NGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLL
        ++L+ +   RG+   G FR      +  +   K  +       ++  + +SL+RQ++AI  +L     GF W++E + + A+  V+  ++K+H  A+  +
Subjt:  VELVHEGGWRGD--NGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLL

Query:  NKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGDDEGDQNVQADQDWSSQSRKRNRASYEAK----------ALDIMRQSVAMQETQFIKIADWSDA
         +P  +Y+DL  + G        C V     D      E   +   D   S++    N   ++ K             I  +   + ETQ + I D  +A
Subjt:  NKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGDDEGDQNVQADQDWSSQSRKRNRASYEAK----------ALDIMRQSVAMQETQFIKIADWSDA

Query:  QDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMML
          A      D   E++L   +L +D ++     L  D+K+    L
Subjt:  QDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMML

AT4G02210.2 unknown protein4.7e-0621.63Show/hide
Query:  VELVHEGGWRGD--NGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLL
        ++L+ +   RG+   G FR      +  +   K  +       ++  + +SL+RQ++AI  +L     GF W++E + + A+  V+  ++K+H  A+  +
Subjt:  VELVHEGGWRGD--NGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLL

Query:  NKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGDDEGDQNVQADQDWSSQSRKRNRASYEAK----------ALDIMRQSVAMQETQFIKIADWSDA
         +P  +Y+DL  + G        C V     D      E   +   D   S++    N   ++ K             I  +   + ETQ + I D  +A
Subjt:  NKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGDDEGDQNVQADQDWSSQSRKRNRASYEAK----------ALDIMRQSVAMQETQFIKIADWSDA

Query:  QDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMML
          A      D   E++L   +L +D ++     L  D+K+    L
Subjt:  QDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TACATACTTACGAAATCTAATGACACAATACTCTATCCATGTACACTAATGGCAGGTGCAGATAAACACCCGAAACACATCTGGACAAGGCAGGAGGAGGCAAGGTTGGT
CGAATCCCTCGTGGAGCTCGTCCACGAAGGTGGATGGAGAGGGGACAACGGGACCTTCAGGCCCGGATACCTCGCCCGATTGAAGCGGATGATAAAAGAGAAATTGCCGA
CATGCACCATAGATTCAACGTCCATAATAGACTGCAAGGTGCGGTCCTTGAAACGACAATACAGTGCCATCTCGGAGATGCTAGGTCCGGGCTGCAGTGGATTCGGTTGG
AATGACGAGTTTAAATGCATCCAGGCTGAGAAGGAGGTCTACGATGCATGGGTGAAGTCACACTCCAGTGCAAAGGGACTGCTGAACAAGCCATTTCTTCACTACGAGGA
TCTTGCTTTCGTGTTCGGCAAAGACAGGGCGAGTGGCGGCGCATGCAATGTTCCAGCGAAAAAGGCCGACAGTAACCACGGGGACGACGAGGGTGATCAGAATGTCCAGG
CGGACCAGGATTGGTCCTCCCAGAGTCGGAAGCGGAACAGAGCATCATACGAAGCGAAAGCCCTTGATATTATGAGGCAGTCAGTGGCTATGCAGGAGACACAGTTCATT
AAGATCGCTGACTGGTCGGACGCCCAAGACGCACGAGAGTTCAAGAGGCGAGACACGGTCGCAGAGATGCTCTTGGCGCAGCAGGAGCTATCGGACGATGAGAGAGTTGC
TCTTATGCGCATCCTATTCGCCGACCTGAAGATGACAAATATGATGCTGTCTGTGCCACCCAGCCTCAGGCTTCGCTTTCTACGAGGACTACTCAACGAACGCCGGTGA
mRNA sequenceShow/hide mRNA sequence
TACATACTTACGAAATCTAATGACACAATACTCTATCCATGTACACTAATGGCAGGTGCAGATAAACACCCGAAACACATCTGGACAAGGCAGGAGGAGGCAAGGTTGGT
CGAATCCCTCGTGGAGCTCGTCCACGAAGGTGGATGGAGAGGGGACAACGGGACCTTCAGGCCCGGATACCTCGCCCGATTGAAGCGGATGATAAAAGAGAAATTGCCGA
CATGCACCATAGATTCAACGTCCATAATAGACTGCAAGGTGCGGTCCTTGAAACGACAATACAGTGCCATCTCGGAGATGCTAGGTCCGGGCTGCAGTGGATTCGGTTGG
AATGACGAGTTTAAATGCATCCAGGCTGAGAAGGAGGTCTACGATGCATGGGTGAAGTCACACTCCAGTGCAAAGGGACTGCTGAACAAGCCATTTCTTCACTACGAGGA
TCTTGCTTTCGTGTTCGGCAAAGACAGGGCGAGTGGCGGCGCATGCAATGTTCCAGCGAAAAAGGCCGACAGTAACCACGGGGACGACGAGGGTGATCAGAATGTCCAGG
CGGACCAGGATTGGTCCTCCCAGAGTCGGAAGCGGAACAGAGCATCATACGAAGCGAAAGCCCTTGATATTATGAGGCAGTCAGTGGCTATGCAGGAGACACAGTTCATT
AAGATCGCTGACTGGTCGGACGCCCAAGACGCACGAGAGTTCAAGAGGCGAGACACGGTCGCAGAGATGCTCTTGGCGCAGCAGGAGCTATCGGACGATGAGAGAGTTGC
TCTTATGCGCATCCTATTCGCCGACCTGAAGATGACAAATATGATGCTGTCTGTGCCACCCAGCCTCAGGCTTCGCTTTCTACGAGGACTACTCAACGAACGCCGGTGA
Protein sequenceShow/hide protein sequence
YILTKSNDTILYPCTLMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKLPTCTIDSTSIIDCKVRSLKRQYSAISEMLGPGCSGFGW
NDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFLHYEDLAFVFGKDRASGGACNVPAKKADSNHGDDEGDQNVQADQDWSSQSRKRNRASYEAKALDIMRQSVAMQETQFI
KIADWSDAQDAREFKRRDTVAEMLLAQQELSDDERVALMRILFADLKMTNMMLSVPPSLRLRFLRGLLNERR