; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G022210 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G022210
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionVARLMGL domain-containing protein
Genome locationchr03:33239215..33240442
RNA-Seq ExpressionLsi03G022210
SyntenyLsi03G022210
Gene Ontology termsNA
InterPro domainsIPR032795 - DUF3741-associated sequence motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137714.1 uncharacterized protein LOC101211240 [Cucumis sativus]2.3e-9959.5Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPNS   +S CFSG+LRRLLCTGNLPTHPSEALN+S+F+IPK EAKLVA++AES PGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNF DYLLDFD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----
        SNQ+HHRRIRTSASFREV    PH+D+FV+ TKDYFDGYGIESN KKPET RFDE KQ      +SNDL KKKKKKKEN RNE+KISKLKDEPRR     
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----

Query:  --------------------------------------------------------------------------------------------------NR
                                                                                                          NR
Subjt:  --------------------------------------------------------------------------------------------------NR

Query:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL
         YDYGELVERICRLAEEDIREAKWT +IKNVDESEALEE+CMEIERHVVD LLVHTL+E  YL
Subjt:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL

XP_008442409.1 PREDICTED: uncharacterized protein LOC103486287 [Cucumis melo]3.3e-9859.23Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPNS   +SGCFSG+LRRLLCTGNLPTHPSEAL+ES+F+IPK EAKL A++AES PGVVARLMGLSSLPDANWVPNH+ RPGAVSRSKSVNF DYLLDFD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----
        SNQ+HHRRIRTSASFREVPPLNPH+DFFV+ TKD F+GYGIESNLKKPET RFDE KQ      +SNDL KKKKKKKENARNE+KISKLKDEPRR     
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----

Query:  --------------------------------------------------------------------------------------------------NR
                                                                                                          NR
Subjt:  --------------------------------------------------------------------------------------------------NR

Query:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL
         YDYGELVERICRLAEEDI EAKWT +IKN D+SEALEE+CMEIERHVVD LLVHTL+E   L
Subjt:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL

XP_022145653.1 uncharacterized protein LOC111015050 [Momordica charantia]2.6e-7450.67Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPNSG  + GCFSGMLRRLLCTGNLPTHPS+ALNE     P+ E    A  A+ GPGVVARLMGLSSLPDANWVPNHRA PGAV RSKSVNF DYLL FD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGY--GIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR---
        S+QAHHRR+RTSASFREVPPLNPH+DF V+ TKDYFDGY   IESNLK+PETHR  E KQGKE  + S+++KKKKK    N+RNE KISKLK+EPRR   
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGY--GIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR---

Query:  ------------NRD-----------------------------------------------------------------------------YDYG----
                    N+D                                                                             +D+     
Subjt:  ------------NRD-----------------------------------------------------------------------------YDYG----

Query:  -------------------ELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELV
                           E V R CRLA+EDIREA W+  +KNV E EALEELCME+ERHVV+VLLV TLD+LV
Subjt:  -------------------ELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELV

XP_022971608.1 uncharacterized protein LOC111470283 [Cucurbita maxima]2.8e-6849.07Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPN     S CF  +LRRLLC+GNLPTHPSEALNE     P ++ KLV  A+E GPGVVARLMGLSSLPDANWVP +RA PGAVSRSKSVNF DYLL FD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYG--IESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRRNRD
        SNQAHHRRIRTSASFREVPPLNP ++FFV+ TKDYFDGY   IES+LKK ET RF E KQGKEQS  SND+K KKKK      NE KISKLKDEPRR   
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYG--IESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRRNRD

Query:  YDY-------------------------------------------------------------------------------------------------
         ++                                                                                                 
Subjt:  YDY-------------------------------------------------------------------------------------------------

Query:  ----------------GELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL
                         +LVERICRLAEEDI+EA W  +IK VD    +EELCME+ERHVV+VLL  +L+ELV L
Subjt:  ----------------GELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL

XP_038905307.1 uncharacterized protein LOC120091377 [Benincasa hispida]1.5e-11164.46Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPNSGD + GCFSGMLRRLLCTGNLPTHPSEALNES+F+IPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNF DYLLDFD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----
        SNQAHHRRIRTSASFREVPPLNPH+ FFV+  KDYFD YGI+SNLKKPET R +E KQ  EQS  SNDLKK KKKKKENARN+IKISKLKDEPRR     
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----

Query:  --------------------------------------------------------------------------------------------------NR
                                                                                                          NR
Subjt:  --------------------------------------------------------------------------------------------------NR

Query:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL
        DYDYGELVERICRLAE+DIR+AKWT KIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL
Subjt:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL

TrEMBL top hitse value%identityAlignment
A0A0A0L9V4 VARLMGL domain-containing protein1.1e-9959.5Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPNS   +S CFSG+LRRLLCTGNLPTHPSEALN+S+F+IPK EAKLVA++AES PGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNF DYLLDFD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----
        SNQ+HHRRIRTSASFREV    PH+D+FV+ TKDYFDGYGIESN KKPET RFDE KQ      +SNDL KKKKKKKEN RNE+KISKLKDEPRR     
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----

Query:  --------------------------------------------------------------------------------------------------NR
                                                                                                          NR
Subjt:  --------------------------------------------------------------------------------------------------NR

Query:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL
         YDYGELVERICRLAEEDIREAKWT +IKNVDESEALEE+CMEIERHVVD LLVHTL+E  YL
Subjt:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL

A0A1S3B6D9 uncharacterized protein LOC1034862871.6e-9859.23Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPNS   +SGCFSG+LRRLLCTGNLPTHPSEAL+ES+F+IPK EAKL A++AES PGVVARLMGLSSLPDANWVPNH+ RPGAVSRSKSVNF DYLLDFD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----
        SNQ+HHRRIRTSASFREVPPLNPH+DFFV+ TKD F+GYGIESNLKKPET RFDE KQ      +SNDL KKKKKKKENARNE+KISKLKDEPRR     
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----

Query:  --------------------------------------------------------------------------------------------------NR
                                                                                                          NR
Subjt:  --------------------------------------------------------------------------------------------------NR

Query:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL
         YDYGELVERICRLAEEDI EAKWT +IKN D+SEALEE+CMEIERHVVD LLVHTL+E   L
Subjt:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL

A0A5D3DMU1 VARLMGL domain-containing protein1.6e-9859.23Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPNS   +SGCFSG+LRRLLCTGNLPTHPSEAL+ES+F+IPK EAKL A++AES PGVVARLMGLSSLPDANWVPNH+ RPGAVSRSKSVNF DYLLDFD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----
        SNQ+HHRRIRTSASFREVPPLNPH+DFFV+ TKD F+GYGIESNLKKPET RFDE KQ      +SNDL KKKKKKKENARNE+KISKLKDEPRR     
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----

Query:  --------------------------------------------------------------------------------------------------NR
                                                                                                          NR
Subjt:  --------------------------------------------------------------------------------------------------NR

Query:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL
         YDYGELVERICRLAEEDI EAKWT +IKN D+SEALEE+CMEIERHVVD LLVHTL+E   L
Subjt:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL

A0A6J1CWJ6 uncharacterized protein LOC1110150501.2e-7450.67Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPNSG  + GCFSGMLRRLLCTGNLPTHPS+ALNE     P+ E    A  A+ GPGVVARLMGLSSLPDANWVPNHRA PGAV RSKSVNF DYLL FD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGY--GIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR---
        S+QAHHRR+RTSASFREVPPLNPH+DF V+ TKDYFDGY   IESNLK+PETHR  E KQGKE  + S+++KKKKK    N+RNE KISKLK+EPRR   
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGY--GIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR---

Query:  ------------NRD-----------------------------------------------------------------------------YDYG----
                    N+D                                                                             +D+     
Subjt:  ------------NRD-----------------------------------------------------------------------------YDYG----

Query:  -------------------ELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELV
                           E V R CRLA+EDIREA W+  +KNV E EALEELCME+ERHVV+VLLV TLD+LV
Subjt:  -------------------ELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELV

E5GBE7 VARLMGL domain-containing protein1.6e-9859.23Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD
        MPNS   +SGCFSG+LRRLLCTGNLPTHPSEAL+ES+F+IPK EAKL A++AES PGVVARLMGLSSLPDANWVPNH+ RPGAVSRSKSVNF DYLLDFD
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFD

Query:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----
        SNQ+HHRRIRTSASFREVPPLNPH+DFFV+ TKD F+GYGIESNLKKPET RFDE KQ      +SNDL KKKKKKKENARNE+KISKLKDEPRR     
Subjt:  SNQAHHRRIRTSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR-----

Query:  --------------------------------------------------------------------------------------------------NR
                                                                                                          NR
Subjt:  --------------------------------------------------------------------------------------------------NR

Query:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL
         YDYGELVERICRLAEEDI EAKWT +IKN D+SEALEE+CMEIERHVVD LLVHTL+E   L
Subjt:  DYDYGELVERICRLAEEDIREAKWTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G01370.1 ALC-interacting protein 14.8e-1028.69Show/hide
Query:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNI-----------PKAEAKLVARAAESG-----------PGVVARLMGLS-SLPDANWVPNH
        M    +  +GC + ++RRLLC+G+  THPS+ + +S   +           PK E K                    P VVA+LMGL    P +      
Subjt:  MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNI-----------PKAEAKLVARAAESG-----------PGVVARLMGLS-SLPDANWVPNH

Query:  RARPGAVSRSKSVNFVDYLL----------DFDSNQAHHRRIRTSASFREVPPLNPH---------SDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQ
             AV+RSKSVNF+DY+L          D    Q   RR++ S SFRE+ P +            DF ++    Y D    +  L    +    +R +
Subjt:  RARPGAVSRSKSVNFVDYLL----------DFDSNQAHHRRIRTSASFREVPPLNPH---------SDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQ

Query:  GKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR
           +      L     KKKE   NE    K KDEPR+
Subjt:  GKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRR

AT5G58630.1 unknown protein1.4e-0930.54Show/hide
Query:  GCFSGMLRRLLCTGNLPTHPSEALNESRFN----IPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFDSNQAH
        GC S +L+R LC+G   T+PS+ + E  F     IP+       R    GPG VARLMGL S+P  +       +   +SRS SVN +    D D  Q  
Subjt:  GCFSGMLRRLLCTGNLPTHPSEALNESRFN----IPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFDSNQAH

Query:  HRRIRTSASFREVPPLNPHSDFFVVC-TKDYFDGYGIESNLKKPETHRFDERK-------QGKEQSMNSNDL--------------KKKKKKKKENARNE
        HRR++++  + E+       DFF++   KD  D      N KK    R ++ K       +GKE ++N+ ++              ++K+K+K    R E
Subjt:  HRRIRTSASFREVPPLNPHSDFFVVC-TKDYFDGYGIESNLKKPETHRFDERK-------QGKEQSMNSNDL--------------KKKKKKKKENARNE

Query:  IKI
         K+
Subjt:  IKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAACTCCGGCGACCACCATTCCGGCTGTTTTTCCGGCATGCTGCGGCGGCTGCTTTGCACTGGCAACCTTCCCACTCACCCATCTGAAGCTCTAAACGAATCACG
ATTTAATATACCAAAAGCAGAGGCTAAACTTGTGGCTCGAGCGGCCGAGTCCGGACCGGGCGTGGTGGCTCGGCTGATGGGTTTGAGTTCGCTTCCAGATGCCAATTGGG
TCCCGAACCATCGAGCCAGACCAGGTGCGGTCTCGCGGAGCAAGTCCGTGAATTTCGTTGATTATTTGCTGGATTTCGACTCGAACCAAGCCCACCACCGCCGAATCCGA
ACCTCCGCTTCGTTTCGTGAGGTCCCACCATTAAACCCACACAGTGATTTCTTTGTTGTATGCACGAAGGATTATTTCGACGGTTACGGAATTGAATCCAATTTGAAGAA
ACCCGAAACGCACCGTTTTGACGAAAGGAAACAGGGAAAGGAACAGAGCATGAACAGTAATGATTTGAAGAAGAAGAAGAAGAAGAAGAAAGAAAATGCAAGGAATGAAA
TCAAGATTTCAAAGCTTAAAGACGAGCCAAGAAGGAACAGAGATTACGATTATGGTGAATTGGTGGAAAGGATATGCAGATTGGCAGAGGAAGATATAAGAGAGGCAAAA
TGGACGGGTAAGATTAAGAACGTGGATGAATCTGAAGCGTTGGAAGAATTATGCATGGAGATTGAACGACATGTCGTCGATGTATTGTTGGTTCACACTCTGGACGAGTT
AGTGTACTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCAACTCCGGCGACCACCATTCCGGCTGTTTTTCCGGCATGCTGCGGCGGCTGCTTTGCACTGGCAACCTTCCCACTCACCCATCTGAAGCTCTAAACGAATCACG
ATTTAATATACCAAAAGCAGAGGCTAAACTTGTGGCTCGAGCGGCCGAGTCCGGACCGGGCGTGGTGGCTCGGCTGATGGGTTTGAGTTCGCTTCCAGATGCCAATTGGG
TCCCGAACCATCGAGCCAGACCAGGTGCGGTCTCGCGGAGCAAGTCCGTGAATTTCGTTGATTATTTGCTGGATTTCGACTCGAACCAAGCCCACCACCGCCGAATCCGA
ACCTCCGCTTCGTTTCGTGAGGTCCCACCATTAAACCCACACAGTGATTTCTTTGTTGTATGCACGAAGGATTATTTCGACGGTTACGGAATTGAATCCAATTTGAAGAA
ACCCGAAACGCACCGTTTTGACGAAAGGAAACAGGGAAAGGAACAGAGCATGAACAGTAATGATTTGAAGAAGAAGAAGAAGAAGAAGAAAGAAAATGCAAGGAATGAAA
TCAAGATTTCAAAGCTTAAAGACGAGCCAAGAAGGAACAGAGATTACGATTATGGTGAATTGGTGGAAAGGATATGCAGATTGGCAGAGGAAGATATAAGAGAGGCAAAA
TGGACGGGTAAGATTAAGAACGTGGATGAATCTGAAGCGTTGGAAGAATTATGCATGGAGATTGAACGACATGTCGTCGATGTATTGTTGGTTCACACTCTGGACGAGTT
AGTGTACTTGTAG
Protein sequenceShow/hide protein sequence
MPNSGDHHSGCFSGMLRRLLCTGNLPTHPSEALNESRFNIPKAEAKLVARAAESGPGVVARLMGLSSLPDANWVPNHRARPGAVSRSKSVNFVDYLLDFDSNQAHHRRIR
TSASFREVPPLNPHSDFFVVCTKDYFDGYGIESNLKKPETHRFDERKQGKEQSMNSNDLKKKKKKKKENARNEIKISKLKDEPRRNRDYDYGELVERICRLAEEDIREAK
WTGKIKNVDESEALEELCMEIERHVVDVLLVHTLDELVYL