; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021997 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021997
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr7:15636658..15638499
RNA-Seq ExpressionLag0021997
SyntenyLag0021997
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]3.5e-5943.6Show/hide
Query:  YFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHE
        +F A LT  +H+ KT   ++ +LT TQLDMFRQT FGPI+D  ++FNG LIHHLLL EVE+PR+DVISFD+F  +VSFGK EFDLITG  H    V+ H 
Subjt:  YFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHE

Query:  SGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSH--ATQ
         G  LR  YF DSV+    + EK FLE  F  +ED VKV + YFIELAM G+ERKQ  +   +G+VD WE FCN DWS +IF+ TI SLK  L    +  
Subjt:  SGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSH--ATQ

Query:  RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAELEHMRRIVLPPQLQ
        +       + +E YS+YGFP+                   RM ++ +                L+SE+F  T ++V   L+ +DAE +HM R++LPP+++
Subjt:  RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAELEHMRRIVLPPQLQ

Query:  APILPPQPEAPVSPPHPEAPVLSPQPDA
          ++P  P  P     P+  V+ P P A
Subjt:  APILPPQPEAPVSPPHPEAPVLSPQPDA

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]6.5e-8241.25Show/hide
Query:  YFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHE
        +F A LT  +H+ KT   ++ +LT TQLDMFRQT FGPI+D +++FNG LIHHLLLREVE+PR+DVISFD+FG +VSFGK EFDLITG  H    VD H 
Subjt:  YFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHE

Query:  SGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSH--ATQ
         G  LR  YF D V+    + EK FLE  F  +ED VKV + YFIELAM G+ERKQ  + +LLG+VD WE+FCNYDWS +IF+ TI SLK AL    +  
Subjt:  SGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSH--ATQ

Query:  RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAELEHMRRIVLPPQLQ
        +       S +E YS+YGFP+AFQVW YETIS+L++        +AIPR  RWSC +S  +  L+SE+F  T+++V   L+ +DA+ +HM R++LPP+++
Subjt:  RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAELEHMRRIVLPPQLQ

Query:  A-PILPPQPEAPVSPPHPEAPVLSPQPDANLDHPVGSDRRSEEAGLDRSSPTKDVEMVRLDEQSTHDSLPEGVGKTCQCD-----CKQAYESLDRRMKVV
          P  P  P+  V P  P +P  +  PD   D  +G              P  D   V     S +D   EG+ K  + +       +  + LD  +  +
Subjt:  A-PILPPQPEAPVSPPHPEAPVLSPQPDANLDHPVGSDRRSEEAGLDRSSPTKDVEMVRLDEQSTHDSLPEGVGKTCQCD-----CKQAYESLDRRMKVV

Query:  ESDVKEMKCDLKSITKYLRRLSKGQMVVDPTKYLGPDSSAAASGDEPAAETEEGGRRRAVAAASGGRRSSDIGRRRRRKEKVEKMEKKKKKREKGKG
        E  + +    LK I  YL++L+KG+   D +KY G        G  P  +     R        GGR+S D  +R    ++ ++  + +K+   G G
Subjt:  ESDVKEMKCDLKSITKYLRRLSKGQMVVDPTKYLGPDSSAAASGDEPAAETEEGGRRRAVAAASGGRRSSDIGRRRRRKEKVEKMEKKKKKREKGKG

XP_022154995.1 uncharacterized protein LOC111022139 [Momordica charantia]3.1e-5255.9Show/hide
Query:  MVAKIRPEGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRH
        M  KI  +  F A L+  +H+ KT   L+ +LT +QLDMF QT FG I+  N +FN  L+HHLLLREVE+PR D+ISF++FGN+VSFGK EFDLITG RH
Subjt:  MVAKIRPEGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRH

Query:  NRRIVDRHESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTI
            V        LR +YF D       + EK FLE  F+++EDAVK+A+ YFIELAM G+ERKQK + SLLGIVD WE+FCNYDWS +I EMT+
Subjt:  NRRIVDRHESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTI

XP_022155158.1 uncharacterized protein LOC111022300 [Momordica charantia]8.8e-5556.12Show/hide
Query:  EGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDR
        + +F   LT  +H  KT   L+ +LT TQ+DMFRQT FGPI+D +++FNG LIHHLLLREVE+PR+D+ISFD+FG +VSFGK EFDLITG  +    VD 
Subjt:  EGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDR

Query:  HESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALS
           G  LR  YF DSV+    + EK F+E  F  +EDAVKV + YF+ELAM G+ERKQ  + +LLG+VD WE+FCN+DWS +IFE T+ SLK A++
Subjt:  HESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALS

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]1.0e-8253.9Show/hide
Query:  MVAKIRPEGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRH
        M  KI  + +F A L+  +H+ KT   L+ +LT +QLDMF QT FGPI+  N++FNG L+HHLLLREVE+P+ D+ISF++FGN+VSFGK EFDLITG RH
Subjt:  MVAKIRPEGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRH

Query:  NRRIVDRHESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKK
            VD       LR +YF D       + EK FLE  FE++EDAVK+A+ YFIELAM G+ERK K + SLLGIVD WE+FCNYDWS +IFE T+ SLK 
Subjt:  NRRIVDRHESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKK

Query:  ALSHATQ--RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAE
        AL    +  +  VA  +S +E YS+Y FP+AFQVW YETIS+L+ RVA R+N +AIPR  RWSC++S  +  L  E+F   K++V V+L  +D E
Subjt:  ALSHATQ--RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAE

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156001.7e-5943.6Show/hide
Query:  YFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHE
        +F A LT  +H+ KT   ++ +LT TQLDMFRQT FGPI+D  ++FNG LIHHLLL EVE+PR+DVISFD+F  +VSFGK EFDLITG  H    V+ H 
Subjt:  YFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHE

Query:  SGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSH--ATQ
         G  LR  YF DSV+    + EK FLE  F  +ED VKV + YFIELAM G+ERKQ  +   +G+VD WE FCN DWS +IF+ TI SLK  L    +  
Subjt:  SGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSH--ATQ

Query:  RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAELEHMRRIVLPPQLQ
        +       + +E YS+YGFP+                   RM ++ +                L+SE+F  T ++V   L+ +DAE +HM R++LPP+++
Subjt:  RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAELEHMRRIVLPPQLQ

Query:  APILPPQPEAPVSPPHPEAPVLSPQPDA
          ++P  P  P     P+  V+ P P A
Subjt:  APILPPQPEAPVSPPHPEAPVLSPQPDA

A0A6J1DJX9 uncharacterized protein LOC1110207573.1e-8241.25Show/hide
Query:  YFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHE
        +F A LT  +H+ KT   ++ +LT TQLDMFRQT FGPI+D +++FNG LIHHLLLREVE+PR+DVISFD+FG +VSFGK EFDLITG  H    VD H 
Subjt:  YFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHE

Query:  SGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSH--ATQ
         G  LR  YF D V+    + EK FLE  F  +ED VKV + YFIELAM G+ERKQ  + +LLG+VD WE+FCNYDWS +IF+ TI SLK AL    +  
Subjt:  SGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSH--ATQ

Query:  RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAELEHMRRIVLPPQLQ
        +       S +E YS+YGFP+AFQVW YETIS+L++        +AIPR  RWSC +S  +  L+SE+F  T+++V   L+ +DA+ +HM R++LPP+++
Subjt:  RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAELEHMRRIVLPPQLQ

Query:  A-PILPPQPEAPVSPPHPEAPVLSPQPDANLDHPVGSDRRSEEAGLDRSSPTKDVEMVRLDEQSTHDSLPEGVGKTCQCD-----CKQAYESLDRRMKVV
          P  P  P+  V P  P +P  +  PD   D  +G              P  D   V     S +D   EG+ K  + +       +  + LD  +  +
Subjt:  A-PILPPQPEAPVSPPHPEAPVLSPQPDANLDHPVGSDRRSEEAGLDRSSPTKDVEMVRLDEQSTHDSLPEGVGKTCQCD-----CKQAYESLDRRMKVV

Query:  ESDVKEMKCDLKSITKYLRRLSKGQMVVDPTKYLGPDSSAAASGDEPAAETEEGGRRRAVAAASGGRRSSDIGRRRRRKEKVEKMEKKKKKREKGKG
        E  + +    LK I  YL++L+KG+   D +KY G        G  P  +     R        GGR+S D  +R    ++ ++  + +K+   G G
Subjt:  ESDVKEMKCDLKSITKYLRRLSKGQMVVDPTKYLGPDSSAAASGDEPAAETEEGGRRRAVAAASGGRRSSDIGRRRRRKEKVEKMEKKKKKREKGKG

A0A6J1DL69 uncharacterized protein LOC1110221391.5e-5255.9Show/hide
Query:  MVAKIRPEGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRH
        M  KI  +  F A L+  +H+ KT   L+ +LT +QLDMF QT FG I+  N +FN  L+HHLLLREVE+PR D+ISF++FGN+VSFGK EFDLITG RH
Subjt:  MVAKIRPEGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRH

Query:  NRRIVDRHESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTI
            V        LR +YF D       + EK FLE  F+++EDAVK+A+ YFIELAM G+ERKQK + SLLGIVD WE+FCNYDWS +I EMT+
Subjt:  NRRIVDRHESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTI

A0A6J1DM82 uncharacterized protein LOC1110223004.3e-5556.12Show/hide
Query:  EGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDR
        + +F   LT  +H  KT   L+ +LT TQ+DMFRQT FGPI+D +++FNG LIHHLLLREVE+PR+D+ISFD+FG +VSFGK EFDLITG  +    VD 
Subjt:  EGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDR

Query:  HESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALS
           G  LR  YF DSV+    + EK F+E  F  +EDAVKV + YF+ELAM G+ERKQ  + +LLG+VD WE+FCN+DWS +IFE T+ SLK A++
Subjt:  HESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALS

A0A6J1DRZ7 uncharacterized protein LOC1110238474.9e-8353.9Show/hide
Query:  MVAKIRPEGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRH
        M  KI  + +F A L+  +H+ KT   L+ +LT +QLDMF QT FGPI+  N++FNG L+HHLLLREVE+P+ D+ISF++FGN+VSFGK EFDLITG RH
Subjt:  MVAKIRPEGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRH

Query:  NRRIVDRHESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKK
            VD       LR +YF D       + EK FLE  FE++EDAVK+A+ YFIELAM G+ERK K + SLLGIVD WE+FCNYDWS +IFE T+ SLK 
Subjt:  NRRIVDRHESGVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKK

Query:  ALSHATQ--RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAE
        AL    +  +  VA  +S +E YS+Y FP+AFQVW YETIS+L+ RVA R+N +AIPR  RWSC++S  +  L  E+F   K++V V+L  +D E
Subjt:  ALSHATQ--RDVVAGKASQLERYSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCAAAGATTAGGCCGGAAGGGTACTTTCGTGCACAGTTGACTTGTTGTTCGCACTTGACAAAAACAATTGAAATTTTGCAAAAGAAATTGACTCAAACCCAATT
AGATATGTTTAGGCAAACCATATTTGGCCCTATAGTAGACAACAACATATTATTTAATGGTCAGTTAATCCACCATCTACTACTTAGGGAGGTTGAGGATCCCAGGAAGG
ATGTAATTAGTTTCGATATATTTGGAAATAAGGTGTCGTTTGGCAAGGAAGAATTCGATCTAATCACCGGATTTAGACACAATAGAAGGATAGTTGATAGACATGAGTCG
GGGGTTAGTTTGAGGCGTATGTACTTTAATGACAGTGTCAAAGATACCGTAATGGATGCTGAAAAAAGATTTTTAGAAATACAGTTTGAGTCAAATGAAGATGCGGTGAA
GGTAGCGCTCGCATATTTTATCGAGCTAGCAATGTTTGGGCGGGAGAGGAAACAAAAATTCAATTGGTCTTTATTGGGTATCGTGGACGATTGGGAGATATTCTGCAATT
ATGACTGGAGCAAAGTAATTTTTGAGATGACTATAAGGAGTTTGAAGAAAGCACTCAGTCATGCCACCCAAAGAGACGTCGTGGCCGGAAAGGCTAGTCAATTGGAAAGA
TATAGTATTTACGGCTTTCCACATGCTTTTCAGGTATGGACGTATGAGACTATTTCATCTCTAACGAACCGTGTTGCGAACCGGATGAACCAGAATGCGATCCCACGGTT
TTCTCGGTGGTCATGTTCCCATTCTCCTACGTACACTCAACTTAGCAGCGAGATATTTGGGTTGACGAAGGCAAGGGTGACAGTGCAATTGGTTCCAAGTGATGCAGAGC
TCGAACACATGCGTCGTATTGTTTTGCCGCCACAACTACAGGCCCCTATTTTGCCGCCACAACCAGAGGCCCCTGTTTCGCCGCCACATCCAGAGGCCCCTGTTTTGTCG
CCACAACCAGATGCAAACCTAGATCATCCTGTGGGGAGTGATAGAAGGTCAGAAGAGGCTGGTTTGGATAGGAGTTCACCGACAAAAGATGTAGAAATGGTTAGGCTCGA
TGAACAATCGACACACGACAGTCTACCTGAAGGCGTGGGCAAGACCTGCCAATGTGACTGCAAGCAAGCATACGAGTCACTAGACCGACGGATGAAGGTGGTGGAGTCCG
ATGTAAAAGAGATGAAATGTGATTTAAAGTCGATCACGAAGTATTTGCGCCGGTTATCTAAGGGTCAAATGGTGGTTGATCCTACCAAGTATTTGGGTCCCGACAGTAGT
GCAGCTGCATCAGGTGATGAACCGGCGGCGGAGACGGAGGAGGGAGGTCGCCGGCGAGCGGTGGCGGCGGCGAGCGGTGGCCGCCGGTCGTCAGACATAGGAAGAAGAAG
AAGGAGGAAGGAGAAGGTGGAGAAGATGGAAAAGAAGAAGAAGAAGAGAGAGAAAGGAAAGGGAGAAAGAATAAAAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGCAAAGATTAGGCCGGAAGGGTACTTTCGTGCACAGTTGACTTGTTGTTCGCACTTGACAAAAACAATTGAAATTTTGCAAAAGAAATTGACTCAAACCCAATT
AGATATGTTTAGGCAAACCATATTTGGCCCTATAGTAGACAACAACATATTATTTAATGGTCAGTTAATCCACCATCTACTACTTAGGGAGGTTGAGGATCCCAGGAAGG
ATGTAATTAGTTTCGATATATTTGGAAATAAGGTGTCGTTTGGCAAGGAAGAATTCGATCTAATCACCGGATTTAGACACAATAGAAGGATAGTTGATAGACATGAGTCG
GGGGTTAGTTTGAGGCGTATGTACTTTAATGACAGTGTCAAAGATACCGTAATGGATGCTGAAAAAAGATTTTTAGAAATACAGTTTGAGTCAAATGAAGATGCGGTGAA
GGTAGCGCTCGCATATTTTATCGAGCTAGCAATGTTTGGGCGGGAGAGGAAACAAAAATTCAATTGGTCTTTATTGGGTATCGTGGACGATTGGGAGATATTCTGCAATT
ATGACTGGAGCAAAGTAATTTTTGAGATGACTATAAGGAGTTTGAAGAAAGCACTCAGTCATGCCACCCAAAGAGACGTCGTGGCCGGAAAGGCTAGTCAATTGGAAAGA
TATAGTATTTACGGCTTTCCACATGCTTTTCAGGTATGGACGTATGAGACTATTTCATCTCTAACGAACCGTGTTGCGAACCGGATGAACCAGAATGCGATCCCACGGTT
TTCTCGGTGGTCATGTTCCCATTCTCCTACGTACACTCAACTTAGCAGCGAGATATTTGGGTTGACGAAGGCAAGGGTGACAGTGCAATTGGTTCCAAGTGATGCAGAGC
TCGAACACATGCGTCGTATTGTTTTGCCGCCACAACTACAGGCCCCTATTTTGCCGCCACAACCAGAGGCCCCTGTTTCGCCGCCACATCCAGAGGCCCCTGTTTTGTCG
CCACAACCAGATGCAAACCTAGATCATCCTGTGGGGAGTGATAGAAGGTCAGAAGAGGCTGGTTTGGATAGGAGTTCACCGACAAAAGATGTAGAAATGGTTAGGCTCGA
TGAACAATCGACACACGACAGTCTACCTGAAGGCGTGGGCAAGACCTGCCAATGTGACTGCAAGCAAGCATACGAGTCACTAGACCGACGGATGAAGGTGGTGGAGTCCG
ATGTAAAAGAGATGAAATGTGATTTAAAGTCGATCACGAAGTATTTGCGCCGGTTATCTAAGGGTCAAATGGTGGTTGATCCTACCAAGTATTTGGGTCCCGACAGTAGT
GCAGCTGCATCAGGTGATGAACCGGCGGCGGAGACGGAGGAGGGAGGTCGCCGGCGAGCGGTGGCGGCGGCGAGCGGTGGCCGCCGGTCGTCAGACATAGGAAGAAGAAG
AAGGAGGAAGGAGAAGGTGGAGAAGATGGAAAAGAAGAAGAAGAAGAGAGAGAAAGGAAAGGGAGAAAGAATAAAAAAATAA
Protein sequenceShow/hide protein sequence
MVAKIRPEGYFRAQLTCCSHLTKTIEILQKKLTQTQLDMFRQTIFGPIVDNNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHES
GVSLRRMYFNDSVKDTVMDAEKRFLEIQFESNEDAVKVALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSHATQRDVVAGKASQLER
YSIYGFPHAFQVWTYETISSLTNRVANRMNQNAIPRFSRWSCSHSPTYTQLSSEIFGLTKARVTVQLVPSDAELEHMRRIVLPPQLQAPILPPQPEAPVSPPHPEAPVLS
PQPDANLDHPVGSDRRSEEAGLDRSSPTKDVEMVRLDEQSTHDSLPEGVGKTCQCDCKQAYESLDRRMKVVESDVKEMKCDLKSITKYLRRLSKGQMVVDPTKYLGPDSS
AAASGDEPAAETEEGGRRRAVAAASGGRRSSDIGRRRRRKEKVEKMEKKKKKREKGKGERIKK