; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017732 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017732
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 1303 .
Genome locationchr5:8011889..8026698
RNA-Seq ExpressionLag0017732
SyntenyLag0017732
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0010027 - thylakoid membrane organization (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR040299 - Protein FERTILITY RESTORER RF2-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577548.1 Protein YELLOW LEAF 1, choloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.0e-7375.73Show/hide
Query:  SPSFSRVSGCHPPSPGDAYYKFWCPVTSSTPRKLKNPQSNMMTIPAFSVGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGG
        S  F  +SGCH PS      K     TSS           M+T+    VGQHWG+MK KDF LRRD+ QLQTPN RRQ VIAKAGP+FLKPIPSTGT+GG
Subjt:  SPSFSRVSGCHPPSPGDAYYKFWCPVTSSTPRKLKNPQSNMMTIPAFSVGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGG

Query:  ILYSSRK-NNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPY
        +LYS+RK NNAF+CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y
Subjt:  ILYSSRK-NNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPY

Query:  QNDRRR
        ++DRRR
Subjt:  QNDRRR

KAG6596785.1 Protein YELLOW LEAF 1, choloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.6e-7390.45Show/hide
Query:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
        VGQHWGRMKA +FHL RD+FQ++T N RRQPVIAKAGPL++KPI STG RGG+LYSSRKNNAF+CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
Subjt:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL

Query:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_023005574.1 uncharacterized protein LOC111498519 isoform X1 [Cucurbita maxima]1.1e-7290.45Show/hide
Query:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
        VGQHWGRMKA +FHL RD+FQ++T N RRQPVIAKAGPL++KPI STGT GG+LYSSRKNNAF+CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
Subjt:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL

Query:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_023540533.1 uncharacterized protein LOC111800870 [Cucurbita pepo subsp. pepo]1.0e-7391.08Show/hide
Query:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
        VGQHWGRM+A +FHL RD+FQ+QT N RRQPVIAKAGPL++KPI STGTRGG+LYSSRKNNAF+CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
Subjt:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL

Query:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_023553392.1 uncharacterized protein LOC111810820 isoform X1 [Cucurbita pepo subsp. pepo]1.5e-7290.51Show/hide
Query:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRK-NNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQ
        VGQHWG+MKAKDF LRRD+ QLQTPN RRQ VIAKAGP+FLKPIPSTGT+GG+LYS+RK NNAF+CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQ
Subjt:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRK-NNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQ

Query:  LDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        LDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  LDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

TrEMBL top hitse value%identityAlignment
A0A6J1CP11 uncharacterized protein LOC1110133921.9e-7088.54Show/hide
Query:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
        VG+HWGR KAK+FHL RD+FQLQ  N R Q V+AK GPLFLKPIPS GTR  +LYSSRK NAF CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
Subjt:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL

Query:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
Subjt:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1F258 uncharacterized protein LOC1114387812.1e-7289.87Show/hide
Query:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRK-NNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQ
        VGQHWG+MK KDF LRRD+ QLQTPN RRQ VIAKAGP+FLKPIPSTGT+GG+LYS+RK NNAF+CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQ
Subjt:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRK-NNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQ

Query:  LDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        LDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  LDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1GD79 uncharacterized protein LOC1114531172.7e-7289.81Show/hide
Query:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
        VGQHWGRMKA +FHL  D+FQ++T N RRQPVIAKAGPL++KPI STG RGG+LYSSRKNNAF+CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
Subjt:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL

Query:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1HQC3 uncharacterized protein LOC1114651274.6e-7289.87Show/hide
Query:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRK-NNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQ
        VGQHWG+MK KDF LRRD+ QLQTPN RRQ VIAKAGP+FLKPIPSTGT+GG LYS+RK NNAF+CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQ
Subjt:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRK-NNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQ

Query:  LDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        LDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  LDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1KZM4 uncharacterized protein LOC111498519 isoform X15.5e-7390.45Show/hide
Query:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
        VGQHWGRMKA +FHL RD+FQ++T N RRQPVIAKAGPL++KPI STGT GG+LYSSRKNNAF+CFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL
Subjt:  VGQHWGRMKAKDFHLRRDTFQLQTPNERRQPVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQL

Query:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  DDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

SwissProt top hitse value%identityAlignment
F1SZ41 Protein FERTILITY RESTORER RF2, mitochondrial3.8e-0749.4Show/hide
Query:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE
        + RC A QTQ+  R++ T TV      GK + P+LDDG  GFPP   G GGGGGGGGGG  N+ GGF  F  +  L +LK+ E
Subjt:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE

F1SZ42 Protein FERTILITY RESTORER RF2, mitochondrial3.8e-0749.4Show/hide
Query:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE
        + RC A QTQ+  R++ T TV      GK + P+LDDG  GFPP   G GGGGGGGGGG  N+ GGF  F  +  L +LK+ E
Subjt:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE

F1SZ44 Protein FERTILITY RESTORER RF2, mitochondrial2.3e-0749.4Show/hide
Query:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE
        + RC A QTQ+  R++ T TV      GK + P+LDDG  GFPP   G GGGGGGGGGG  N+ GGF  F  +  L +LK+ E
Subjt:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE

P92555 Uncharacterized mitochondrial protein AtMg012503.8e-0748.98Show/hide
Query:  LLNGTPREVFFPNRGLRQGDPLSPYLFLMCAEGLSRILIHAESRKEVTG
        ++NG P+ +  P+RGLRQGDPLSPYLF++C E LS +   A+ +  + G
Subjt:  LLNGTPREVFFPNRGLRQGDPLSPYLFLMCAEGLSRILIHAESRKEVTG

Q0E3V2 Protein YELLOW LEAF 1, choloroplastic7.7e-0844.44Show/hide
Query:  CFAALNARCAAEQTQTVTRE------APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        C A++   C A QTQT  R+      +P    +  K +SP+LDDG +GFPP   G GGGGGGGGG N +GGF  F  +  L +L  +E E   QN  RR
Subjt:  CFAALNARCAAEQTQTVTRE------APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

Arabidopsis top hitse value%identityAlignment
AT1G30475.1 BEST Arabidopsis thaliana protein match is: embryo defective 1303 (TAIR:AT1G56200.1)7.1e-2564.29Show/hide
Query:  ILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE
        ++ S  K   F C +ALN++C+  QTQTVTR++PTIT  P  GK KSP+LDDG +GFPPRDDG GGGGGGGGGG+ SGGFF FGFL F+G+LKD E E
Subjt:  ILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE

AT1G30475.2 FUNCTIONS IN: molecular_function unknown7.1e-2564.29Show/hide
Query:  ILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE
        ++ S  K   F C +ALN++C+  QTQTVTR++PTIT  P  GK KSP+LDDG +GFPPRDDG GGGGGGGGGG+ SGGFF FGFL F+G+LKD E E
Subjt:  ILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE

AT1G30475.3 FUNCTIONS IN: molecular_function unknown7.1e-2564.29Show/hide
Query:  ILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE
        ++ S  K   F C +ALN++C+  QTQTVTR++PTIT  P  GK KSP+LDDG +GFPPRDDG GGGGGGGGGG+ SGGFF FGFL F+G+LKD E E
Subjt:  ILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE

AT1G56200.1 embryo defective 13039.3e-3363.87Show/hide
Query:  LKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF
        + P+ + G   G+   SR+ +  +C +A+NA+C+  QTQTVTRE+PTIT  P   KEKSP LDDG  GFPPRDDGD GGGGGGGGGNWSGGFFFFGFLAF
Subjt:  LKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF

Query:  LGFLKDKESEGPYQNDRRR
        LG LKDKE E  Y+  RRR
Subjt:  LGFLKDKESEGPYQNDRRR

AT1G56200.2 embryo defective 13039.3e-3363.87Show/hide
Query:  LKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF
        + P+ + G   G+   SR+ +  +C +A+NA+C+  QTQTVTRE+PTIT  P   KEKSP LDDG  GFPPRDDGD GGGGGGGGGNWSGGFFFFGFLAF
Subjt:  LKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF

Query:  LGFLKDKESEGPYQNDRRR
        LG LKDKE E  Y+  RRR
Subjt:  LGFLKDKESEGPYQNDRRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCTAATCAGGAGGTGATAGACCAAATTCTAGGGACGGTGCCGAATAGCATATCAGAAGAGCAGAATGTAAAGCTCACAACCCCTTTCACGAGAGAAGAACTCTA
TAGCGTTGTCAAACGTATGCATCCTACTAAAGCACCAGGTCTAGATGGGATGCAAGCCATCTTCTACCAGAAATATTGGGAGGTGGTGGGATCGGAGGTGTGTGACTTCT
GTCTGCAGTATCTGAATGGGACGGAGAGCTTGAAGCAAATTAACAAGACGCGATGGATTGAGCTTGTCATGGGTTGTGTTGAATCAATCACATACCAAGTTTTATTGAAT
GGAACTCCGAGAGAGGTCTTCTTTCCTAATAGGGGGCTTAGGCAGGGTGACCCCCTCTCCCCCTATCTCTTCCTGATGTGTGCAGAGGGTTTGTCTAGGATCCTGATCCA
CGCTGAATCGAGGAAGGAGGTAACAGGTTCTGGTGGGGCTCAGGATCATCAGGCAGGAAGATCCACTAGAGGAGTTGGAAGAAACTTTGCATCCATAAATCTAACGGGGC
ATGGGTTTTCGAGACCTTACCATCTTCAACCAGGCTATGTTAGCGAAGCAGAGCTGGAGGATCTTGAGGTACCCGAGAGCATTCTATCAAGGGTGCTTAAGGGGGGTATT
TCAAGAATGGGAGCTTTCTTAAGGCTGAAAAGCGGCGTCGGATCCTTGGATCCCAAGAGATGGGTCTCCCAAACCGATCCTGCGAATGGAAATGCTTCTAAGTATACTGT
GGCTCAACTCATTCATCCATCTAGAGTCTGGAGAGAAGACCTTGTCAGAGATTTGTTCCTCCAAGGGGATGCTAATGCAATTTTAAACATCCTGATCAGCTCTAGACAAA
GGATGGATGAGATTGTTTGGAACTATGATCCTAGGGGAGCTCCTGGATGGTGGCTGACTTCTGAACGACTTAATGCACAATGGTTTTCAGTCCAACAAAGACAACATCAG
ACATCAGATTCAAAGAGCGTTGAACTCGATTTTGAGAGACGAAGACTCTTACCGGCGAGAAGAAGTTGCGACAAGGGAGATAGATCCAAACGGCGGGAAATCTCCTCTGG
AATGGCTGGGACCACCGGCTGGTTTCTGCAAATTGAATACCGATGCGAGAAGACGAAAAACGGTGGAATTGGATGGGTCCTCAGGCAGTGGGATGGAACTCCGTTGAGCG
CCGACTTTAAATTCATCGACCGTCAATGGAAAATCTCTTGGCTGGAAGCCCTGGCTGTAGTGGAGGGTTTGACATCGATTCCTACATTCTCCCACAAGCTGATTCTCGAG
CTTGACTCAGTGCATGTGGTAAATCTGCTTGCTGGTAGGGAAGAAGATGCAACAGAACTCTCCAACTTCATTGATGAGGCGAAGTCCCAGATGTCCGGCTTTCAAGTTCA
CGAAGTGGTTCATGTTTCTAGAAGGAGAAATTATTTGACCCACCAATTGGCCCAAAAGGCCCATTCCAGTCAAAGGTCTGAGAGCTGGAGTGGTTGGTTCCCGGTGTGGT
TTTTGCAAATGAACAAGATTGATATTAGGGCTATTAATGATCGTAGTGGGGGTGTCTGTCCTACGATAGTAAACCCACGACCAGCGGCGGTTGTAGCCCTCTCTCTCGCG
TGCCGTCTCAACCTCGTGCCGCCGCCGCTTGCTCTGCCTTCGTCTCCCTCTCTTCGTGCCGTTGGCCAGCCGCGAGGAGACCCACAGCTTGAATCTTCTCCTTCTTTCTC
GCGTGTGTCAGGTTGTCATCCACCCTCGCCTGGAGATGCTTACTACAAATTCTGGTGCCCTGTTACCTCTTCTACCCCCAGGAAGCTTAAGAATCCCCAATCCAACATGA
TGACAATTCCAGCTTTTAGTGTGGGTCAGCATTGGGGAAGAATGAAGGCCAAAGATTTCCATCTTAGAAGGGATACTTTCCAACTTCAAACTCCTAATGAGAGGAGACAA
CCAGTGATTGCTAAAGCAGGACCACTTTTTCTGAAACCAATTCCATCTACTGGGACGAGAGGAGGCATTCTATATTCTAGTAGAAAGAACAATGCTTTCATGTGTTTTGC
TGCTTTGAATGCTAGATGTGCAGCGGAGCAGACCCAGACTGTTACGAGAGAGGCTCCTACAATCACTGTTCTTCCTGGCAAGGAAAAGTCACCGCAACTGGATGATGGTG
ATTCTGGATTCCCACCTCGTGATGATGGTGATGGTGGCGGTGGAGGCGGCGGCGGCGGAGGCAACTGGTCTGGTGGATTCTTCTTCTTTGGCTTCCTTGCCTTCCTTGGG
TTCTTAAAAGATAAAGAAAGTGAAGGGCCTTATCAGAATGATCGGAGAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAATCTAATCAGGAGGTGATAGACCAAATTCTAGGGACGGTGCCGAATAGCATATCAGAAGAGCAGAATGTAAAGCTCACAACCCCTTTCACGAGAGAAGAACTCTA
TAGCGTTGTCAAACGTATGCATCCTACTAAAGCACCAGGTCTAGATGGGATGCAAGCCATCTTCTACCAGAAATATTGGGAGGTGGTGGGATCGGAGGTGTGTGACTTCT
GTCTGCAGTATCTGAATGGGACGGAGAGCTTGAAGCAAATTAACAAGACGCGATGGATTGAGCTTGTCATGGGTTGTGTTGAATCAATCACATACCAAGTTTTATTGAAT
GGAACTCCGAGAGAGGTCTTCTTTCCTAATAGGGGGCTTAGGCAGGGTGACCCCCTCTCCCCCTATCTCTTCCTGATGTGTGCAGAGGGTTTGTCTAGGATCCTGATCCA
CGCTGAATCGAGGAAGGAGGTAACAGGTTCTGGTGGGGCTCAGGATCATCAGGCAGGAAGATCCACTAGAGGAGTTGGAAGAAACTTTGCATCCATAAATCTAACGGGGC
ATGGGTTTTCGAGACCTTACCATCTTCAACCAGGCTATGTTAGCGAAGCAGAGCTGGAGGATCTTGAGGTACCCGAGAGCATTCTATCAAGGGTGCTTAAGGGGGGTATT
TCAAGAATGGGAGCTTTCTTAAGGCTGAAAAGCGGCGTCGGATCCTTGGATCCCAAGAGATGGGTCTCCCAAACCGATCCTGCGAATGGAAATGCTTCTAAGTATACTGT
GGCTCAACTCATTCATCCATCTAGAGTCTGGAGAGAAGACCTTGTCAGAGATTTGTTCCTCCAAGGGGATGCTAATGCAATTTTAAACATCCTGATCAGCTCTAGACAAA
GGATGGATGAGATTGTTTGGAACTATGATCCTAGGGGAGCTCCTGGATGGTGGCTGACTTCTGAACGACTTAATGCACAATGGTTTTCAGTCCAACAAAGACAACATCAG
ACATCAGATTCAAAGAGCGTTGAACTCGATTTTGAGAGACGAAGACTCTTACCGGCGAGAAGAAGTTGCGACAAGGGAGATAGATCCAAACGGCGGGAAATCTCCTCTGG
AATGGCTGGGACCACCGGCTGGTTTCTGCAAATTGAATACCGATGCGAGAAGACGAAAAACGGTGGAATTGGATGGGTCCTCAGGCAGTGGGATGGAACTCCGTTGAGCG
CCGACTTTAAATTCATCGACCGTCAATGGAAAATCTCTTGGCTGGAAGCCCTGGCTGTAGTGGAGGGTTTGACATCGATTCCTACATTCTCCCACAAGCTGATTCTCGAG
CTTGACTCAGTGCATGTGGTAAATCTGCTTGCTGGTAGGGAAGAAGATGCAACAGAACTCTCCAACTTCATTGATGAGGCGAAGTCCCAGATGTCCGGCTTTCAAGTTCA
CGAAGTGGTTCATGTTTCTAGAAGGAGAAATTATTTGACCCACCAATTGGCCCAAAAGGCCCATTCCAGTCAAAGGTCTGAGAGCTGGAGTGGTTGGTTCCCGGTGTGGT
TTTTGCAAATGAACAAGATTGATATTAGGGCTATTAATGATCGTAGTGGGGGTGTCTGTCCTACGATAGTAAACCCACGACCAGCGGCGGTTGTAGCCCTCTCTCTCGCG
TGCCGTCTCAACCTCGTGCCGCCGCCGCTTGCTCTGCCTTCGTCTCCCTCTCTTCGTGCCGTTGGCCAGCCGCGAGGAGACCCACAGCTTGAATCTTCTCCTTCTTTCTC
GCGTGTGTCAGGTTGTCATCCACCCTCGCCTGGAGATGCTTACTACAAATTCTGGTGCCCTGTTACCTCTTCTACCCCCAGGAAGCTTAAGAATCCCCAATCCAACATGA
TGACAATTCCAGCTTTTAGTGTGGGTCAGCATTGGGGAAGAATGAAGGCCAAAGATTTCCATCTTAGAAGGGATACTTTCCAACTTCAAACTCCTAATGAGAGGAGACAA
CCAGTGATTGCTAAAGCAGGACCACTTTTTCTGAAACCAATTCCATCTACTGGGACGAGAGGAGGCATTCTATATTCTAGTAGAAAGAACAATGCTTTCATGTGTTTTGC
TGCTTTGAATGCTAGATGTGCAGCGGAGCAGACCCAGACTGTTACGAGAGAGGCTCCTACAATCACTGTTCTTCCTGGCAAGGAAAAGTCACCGCAACTGGATGATGGTG
ATTCTGGATTCCCACCTCGTGATGATGGTGATGGTGGCGGTGGAGGCGGCGGCGGCGGAGGCAACTGGTCTGGTGGATTCTTCTTCTTTGGCTTCCTTGCCTTCCTTGGG
TTCTTAAAAGATAAAGAAAGTGAAGGGCCTTATCAGAATGATCGGAGAAGATAA
Protein sequenceShow/hide protein sequence
MQSNQEVIDQILGTVPNSISEEQNVKLTTPFTREELYSVVKRMHPTKAPGLDGMQAIFYQKYWEVVGSEVCDFCLQYLNGTESLKQINKTRWIELVMGCVESITYQVLLN
GTPREVFFPNRGLRQGDPLSPYLFLMCAEGLSRILIHAESRKEVTGSGGAQDHQAGRSTRGVGRNFASINLTGHGFSRPYHLQPGYVSEAELEDLEVPESILSRVLKGGI
SRMGAFLRLKSGVGSLDPKRWVSQTDPANGNASKYTVAQLIHPSRVWREDLVRDLFLQGDANAILNILISSRQRMDEIVWNYDPRGAPGWWLTSERLNAQWFSVQQRQHQ
TSDSKSVELDFERRRLLPARRSCDKGDRSKRREISSGMAGTTGWFLQIEYRCEKTKNGGIGWVLRQWDGTPLSADFKFIDRQWKISWLEALAVVEGLTSIPTFSHKLILE
LDSVHVVNLLAGREEDATELSNFIDEAKSQMSGFQVHEVVHVSRRRNYLTHQLAQKAHSSQRSESWSGWFPVWFLQMNKIDIRAINDRSGGVCPTIVNPRPAAVVALSLA
CRLNLVPPPLALPSSPSLRAVGQPRGDPQLESSPSFSRVSGCHPPSPGDAYYKFWCPVTSSTPRKLKNPQSNMMTIPAFSVGQHWGRMKAKDFHLRRDTFQLQTPNERRQ
PVIAKAGPLFLKPIPSTGTRGGILYSSRKNNAFMCFAALNARCAAEQTQTVTREAPTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLG
FLKDKESEGPYQNDRRR