; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041444 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041444
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF3067 domain-containing protein
Genome locationchr13:18000689..18015901
RNA-Seq ExpressionLag0041444
SyntenyLag0041444
Gene Ontology termsNA
InterPro domainsIPR021420 - Protein of unknown function DUF3067
IPR026960 - Reverse transcriptase zinc-binding domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607672.1 hypothetical protein SDJN03_01014, partial [Cucurbita argyrosperma subsp. sororia]1.5e-11692.14Show/hide
Query:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN
        +KIMTSVTFGHQSNWSPGIGIQEHLGR KISRV  SYSEYCCPF PSRET THHGN VAYCRRR+GR +RPVLAGS DGAMDPDESDR IN+SESPK EN
Subjt:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN

Query:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
        GGMNFEMLRENLERSVG DDSRFSGIDLAT+IRNKYG+SYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
Subjt:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL

Query:  DKTKDRPRIGKAVSIFIDMDESGGRANEY
        DKTKDRPRIGKAVSIFIDMDESGGRANE+
Subjt:  DKTKDRPRIGKAVSIFIDMDESGGRANEY

KAG7028479.1 hypothetical protein SDJN02_09660, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-11692.14Show/hide
Query:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN
        +KIMTSVTFGHQSNWSPGIGIQEHLGR KISRV  SYSEYCCPF PSRET THHGN VAYCRRR+GR +RPVLAGS DGAMDPDESDR IN+SESPK EN
Subjt:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN

Query:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
        GGMNFEMLRENLERSVG DDSRFSGIDLAT+IRNKYG+SYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
Subjt:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL

Query:  DKTKDRPRIGKAVSIFIDMDESGGRANEY
        DKTKDRPRIGKAVSIFIDMDESGGRANE+
Subjt:  DKTKDRPRIGKAVSIFIDMDESGGRANEY

XP_022926184.1 uncharacterized protein LOC111433376 [Cucurbita moschata]1.5e-11692.14Show/hide
Query:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN
        +KIMTSVTFGHQSNWSPGIGIQEHLGR KISRV  SYSEYCCPF PSRET THHGN VAYCRRR+GR +RPVLAGS DGAMDPDESDR IN+SESPK EN
Subjt:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN

Query:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
        GGMNFEMLRENLERSVG DDSRFSGIDLAT+IRNKYG+SYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
Subjt:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL

Query:  DKTKDRPRIGKAVSIFIDMDESGGRANEY
        DKTKDRPRIGKAVSIFIDMDESGGRANE+
Subjt:  DKTKDRPRIGKAVSIFIDMDESGGRANEY

XP_022981573.1 uncharacterized protein LOC111480651 [Cucurbita maxima]3.4e-11691.7Show/hide
Query:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN
        +KIMTSVTFGHQSNWSPGIGIQEHLGR KISRV  SYSEYCCPF PSRET THHGN VAYCRRR+GR +RPVLAGS DGAMDPDESDR IN+SE+PK EN
Subjt:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN

Query:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
        GGMNFEMLRENLERSVG DDSRFSGIDLAT+IRNKYG+SYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
Subjt:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL

Query:  DKTKDRPRIGKAVSIFIDMDESGGRANEY
        DKTKDRPRIGKAVSIFIDMDESGGRANE+
Subjt:  DKTKDRPRIGKAVSIFIDMDESGGRANEY

XP_023525612.1 uncharacterized protein LOC111789180 [Cucurbita pepo subsp. pepo]1.5e-11692.14Show/hide
Query:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN
        +KIMTSVTFGHQSNWSPGIGIQEHLGR KISRV  SYSEYCCPF PSRET THHGN VAYCRRR+GR +RPVLAGS DGAMDPDESDR IN+SESPK EN
Subjt:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN

Query:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
        GGMNFEMLRENLERSVG DDSRFSGIDLAT+IRNKYG+SYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
Subjt:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL

Query:  DKTKDRPRIGKAVSIFIDMDESGGRANEY
        DKTKDRPRIGKAVSIFIDMDESGGRANE+
Subjt:  DKTKDRPRIGKAVSIFIDMDESGGRANEY

TrEMBL top hitse value%identityAlignment
A0A1S3C5E8 uncharacterized protein LOC1034966697.0e-10788.84Show/hide
Query:  MTSVTFGHQSNWSPGIGIQEHLGRGKISRVSYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGENGGMNF
        MTSVTFGHQSNWSPGIGIQEH GR KISRV     CCPF PS    THHGN VAYCRRR+GR +RPVLAGS DGAMDPD+ DR IN+SESPK ENGGMNF
Subjt:  MTSVTFGHQSNWSPGIGIQEHLGRGKISRVSYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGENGGMNF

Query:  EMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSLDKTKD
        EMLRENLERSVG DDSRFSGIDLATLIRNKYG+SYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSLDKTKD
Subjt:  EMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSLDKTKD

Query:  RPRIGKAVSIFIDMDESGGRANEY
        RPRIGKAVSIFIDMDESGGRANE+
Subjt:  RPRIGKAVSIFIDMDESGGRANEY

A0A5A7SMD4 DUF3067 domain-containing protein7.0e-10788.84Show/hide
Query:  MTSVTFGHQSNWSPGIGIQEHLGRGKISRVSYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGENGGMNF
        MTSVTFGHQSNWSPGIGIQEH GR KISRV     CCPF PS    THHGN VAYCRRR+GR +RPVLAGS DGAMDPD+ DR IN+SESPK ENGGMNF
Subjt:  MTSVTFGHQSNWSPGIGIQEHLGRGKISRVSYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGENGGMNF

Query:  EMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSLDKTKD
        EMLRENLERSVG DDSRFSGIDLATLIRNKYG+SYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSLDKTKD
Subjt:  EMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSLDKTKD

Query:  RPRIGKAVSIFIDMDESGGRANEY
        RPRIGKAVSIFIDMDESGGRANE+
Subjt:  RPRIGKAVSIFIDMDESGGRANEY

A0A6J1DNM9 uncharacterized protein LOC1110223111.7e-10084.35Show/hide
Query:  MTSVTFGHQSNWSPGIGIQEHLGRGKISRVSYSEYCCPFAPSRET--ITHHGNQVAYC-RRRSGRHVRPVLA-GSGDGAMDPDESDRGINNSESPKGENG
        MTSVT GH SNWSPGIGI+EHLGR  ISRVSYSEYC PF PSRET   T HGN++AYC RRRSGR  RPV A GS DGA DPD+SD G+N S+SP+ EN 
Subjt:  MTSVTFGHQSNWSPGIGIQEHLGRGKISRVSYSEYCCPFAPSRET--ITHHGNQVAYC-RRRSGRHVRPVLA-GSGDGAMDPDESDRGINNSESPKGENG

Query:  --GMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNS
           MNF+MLRENLERSVG DDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNS
Subjt:  --GMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNS

Query:  LDKTKDRPRIGKAVSIFIDMDESGGRANEY
        L KTKDRPRIGKAVSIFIDMDESGGRA E+
Subjt:  LDKTKDRPRIGKAVSIFIDMDESGGRANEY

A0A6J1EEF2 uncharacterized protein LOC1114333767.5e-11792.14Show/hide
Query:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN
        +KIMTSVTFGHQSNWSPGIGIQEHLGR KISRV  SYSEYCCPF PSRET THHGN VAYCRRR+GR +RPVLAGS DGAMDPDESDR IN+SESPK EN
Subjt:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN

Query:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
        GGMNFEMLRENLERSVG DDSRFSGIDLAT+IRNKYG+SYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
Subjt:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL

Query:  DKTKDRPRIGKAVSIFIDMDESGGRANEY
        DKTKDRPRIGKAVSIFIDMDESGGRANE+
Subjt:  DKTKDRPRIGKAVSIFIDMDESGGRANEY

A0A6J1J2G4 uncharacterized protein LOC1114806511.7e-11691.7Show/hide
Query:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN
        +KIMTSVTFGHQSNWSPGIGIQEHLGR KISRV  SYSEYCCPF PSRET THHGN VAYCRRR+GR +RPVLAGS DGAMDPDESDR IN+SE+PK EN
Subjt:  IKIMTSVTFGHQSNWSPGIGIQEHLGRGKISRV--SYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGEN

Query:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
        GGMNFEMLRENLERSVG DDSRFSGIDLAT+IRNKYG+SYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL
Subjt:  GGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSL

Query:  DKTKDRPRIGKAVSIFIDMDESGGRANEY
        DKTKDRPRIGKAVSIFIDMDESGGRANE+
Subjt:  DKTKDRPRIGKAVSIFIDMDESGGRANEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.9e-0428.1Show/hide
Query:  LSLLAVQSFHLERRDSWFWTP---KSSDGFSCASFFHSLSVSPSSRSL-HFSSIW-KVKIPKKVKFFTSEVLHGRVSTQDRIQRFSFVLRPQWCVLYRSQ
        L + AV     +  DS+ W       S+ FS A    SL++ P +  +  + ++W K  +PK   F    V   R+ T+DR++ +   + P  C+L  S 
Subjt:  LSLLAVQSFHLERRDSWFWTP---KSSDGFSCASFFHSLSVSPSSRSL-HFSSIW-KVKIPKKVKFFTSEVLHGRVSTQDRIQRFSFVLRPQWCVLYRSQ

Query:  EEDLDHLLWDCPYVSSIWSQF
        +E   HL ++CP+  ++W  F
Subjt:  EEDLDHLLWDCPYVSSIWSQF

AT2G02650.1 Ribonuclease H-like superfamily protein2.9e-0429.85Show/hide
Query:  SIWKVKIPKKVKFFTSEVLHGRVSTQDRIQRFSFVLRP--QWCVLYRSQEEDLDHLLWDCPYVSSIW
        +IWK+ +  K+K F    + G ++T  R++  +    P  Q C +   +EE + H++++CPY  S+W
Subjt:  SIWKVKIPKKVKFFTSEVLHGRVSTQDRIQRFSFVLRP--QWCVLYRSQEEDLDHLLWDCPYVSSIW

AT3G19900.1 unknown protein5.9e-5857.21Show/hide
Query:  IQEHLGRGKISRVSYSEYCCPFAPSRETITHH-GNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGENGGMNFEMLRENLERSVGNDDS
        I E  G G IS+VS S YC P    R   +H    +  Y +    +  R       +G +D  + +   +  +  K +   MN    R NLER VG+DDS
Subjt:  IQEHLGRGKISRVSYSEYCCPFAPSRETITHH-GNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGENGGMNFEMLRENLERSVGNDDS

Query:  RFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSLDKTKDRPRIGKAVSIFIDMDE
         F+G+DLATLIR KYG+SYDVQLIKKEFMG+NLLA+NVMWKYREQ+SFPL+EEEY+LRLD VAN LKCWGAV+HIR+SL K+K+RPRIGKAVSIFIDMD 
Subjt:  RFSGIDLATLIRNKYGRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSLDKTKDRPRIGKAVSIFIDMDE

Query:  SGGRANEY
        +GGRANE+
Subjt:  SGGRANEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAATTCAAAGGAATTTTTCCCGAGGAGTTGCCTAAGGAGTTACCACCCTTGAGAGACATAGAGCATAAGATTGATTTCATTCCAGGAGCTCAGATTCCAAATCGACCAGC
CTATCGAACCAATCCACAGGAGGCAAAGAGATTCAAAGGCAGTGAGAAGCTGACTGGAGCAACCTTGAAGTATCCTACATATGACAAGGAACTTTATGCGTTGGTAAGAG
CGCTACAAACATGGCAGCATTACTTGTGGCCAAAAGAATTCGTCATTCACACAGATCATCAGAATGCAGAATTTGATTCGAGGACGAATCCTTCAAAAGGGGGAGAATAT
GATATGAACCAAGACATCATTGCTTTACCTCCAGGACCAATCACACGCTCAAGGGCTAAGCAACTACAATTGGCCTTGGATTCCCACATACAAACAATGGTGAACTCAAT
TAAGGAAGAAGGAGCCTCATCACTCTCTTTGCAATTCTTGAAACCAGGGTTGCAAGCTTGGACATTCAGAACGTCTTTCTTGATTGCTCTGCAACAACGAAGAACACCAC
CATTTGATCTTCCCCGCTATAACTCTGAGATTAGGAACGTACGCTCCTCTAATGAACAACCTGTTTATGGTCCAACCATTAAACAGAAAGTTCTTCTCGGGCCAGATACC
CCCGCTCGCATGTCTCCTACATGGACGCCTTGGATTAATACGTCTGTATCGAATACAAAGTGGACCGTATCACATAGTGTTACCAGGATAAGATATGAAGCTCATGTAGT
GGACTTTGTAATGGATCCTGGAATGGATTTTGTAATGGAAAAGTCCACATCAGAAACTGGGAAGAGAAACTCTGGTGAAGAGAAATCTGAAATGCAAACTGTTGAAGAAA
CTTCAACCGAGAATATGAAGGATGAGAGAATCCAAGCAATTAAGATTATGACGAGCGTAACGTTCGGTCACCAGAGTAATTGGTCGCCGGGGATTGGGATTCAAGAGCAT
CTAGGTAGAGGAAAGATTTCTAGGGTTTCGTATTCAGAGTATTGTTGTCCATTTGCTCCTTCCAGAGAAACAATCACTCACCATGGGAATCAGGTTGCTTACTGTCGCCG
GCGAAGTGGACGGCATGTGAGACCTGTTCTTGCCGGTTCCGGCGACGGTGCGATGGATCCAGACGAATCGGATCGTGGGATTAACAACAGTGAGAGTCCCAAAGGCGAAA
ATGGAGGGATGAACTTCGAAATGCTTAGGGAAAACTTGGAGAGATCGGTTGGGAATGATGATTCCAGATTTAGTGGGATTGACCTTGCTACTCTTATTAGGAACAAGTAT
GGCAGGTCTTATGACGTTCAACTAATAAAGAAGGAGTTCATGGGACGAAATCTTCTTGCTTTAAATGTCATGTGGAAATACAGAGAGCAGAAATCATTTCCATTGTCCGA
GGAAGAGTATCTATTGCGACTCGATGGTGTGGCCAATACATTAAAGTGCTGGGGAGCCGTTGCTCACATTCGAAACAGCTTAGACAAAACTAAAGACAGGCCTCGGATAG
GAAAGGCAGTCAGTATTTTTATCGACATGGACGAGTCTGGTGGTCGTGCCAATGAGTATCATTCGGTAGCCTCGGTCTTCCTCGATTCCGACTCCTCTCCTGCTCTTGGC
TTTCGTCGTTCCCTTACTGATAGAGAGGTGGCAGAGGTCGTCAGTTTCCTTTCCCTATTGGCTGTCCAATCTTTTCATCTTGAGAGAAGGGATTCTTGGTTCTGGACCCC
AAAGTCTTCTGATGGTTTCTCTTGTGCTTCTTTTTTCCATTCTCTTTCTGTCTCCCCCTCCTCCAGGAGCCTCCATTTTTCCTCGATATGGAAGGTCAAAATTCCCAAGA
AAGTGAAGTTCTTTACCTCGGAAGTTTTACATGGTCGAGTGAGTACTCAGGATCGGATCCAAAGGTTCTCCTTCGTGTTGCGACCGCAGTGGTGTGTCCTCTATAGGAGT
CAGGAGGAAGATCTAGATCATTTGCTTTGGGATTGTCCTTACGTCAGCTCTATTTGGAGTCAGTTCTTCAGGATGTTTGGGGTTGCTTCGACTTGTAACATTGACTGCTT
CTCTATGTTTGAGGAAATCCTGTGGCATCATCTTTTTGTGACAAAAGGAGAGTTCTTTGGCAAACGTGCTTCTTTGCTATTTGTGGTATATTTGGCTTGA
mRNA sequenceShow/hide mRNA sequence
GAATTCAAAGGAATTTTTCCCGAGGAGTTGCCTAAGGAGTTACCACCCTTGAGAGACATAGAGCATAAGATTGATTTCATTCCAGGAGCTCAGATTCCAAATCGACCAGC
CTATCGAACCAATCCACAGGAGGCAAAGAGATTCAAAGGCAGTGAGAAGCTGACTGGAGCAACCTTGAAGTATCCTACATATGACAAGGAACTTTATGCGTTGGTAAGAG
CGCTACAAACATGGCAGCATTACTTGTGGCCAAAAGAATTCGTCATTCACACAGATCATCAGAATGCAGAATTTGATTCGAGGACGAATCCTTCAAAAGGGGGAGAATAT
GATATGAACCAAGACATCATTGCTTTACCTCCAGGACCAATCACACGCTCAAGGGCTAAGCAACTACAATTGGCCTTGGATTCCCACATACAAACAATGGTGAACTCAAT
TAAGGAAGAAGGAGCCTCATCACTCTCTTTGCAATTCTTGAAACCAGGGTTGCAAGCTTGGACATTCAGAACGTCTTTCTTGATTGCTCTGCAACAACGAAGAACACCAC
CATTTGATCTTCCCCGCTATAACTCTGAGATTAGGAACGTACGCTCCTCTAATGAACAACCTGTTTATGGTCCAACCATTAAACAGAAAGTTCTTCTCGGGCCAGATACC
CCCGCTCGCATGTCTCCTACATGGACGCCTTGGATTAATACGTCTGTATCGAATACAAAGTGGACCGTATCACATAGTGTTACCAGGATAAGATATGAAGCTCATGTAGT
GGACTTTGTAATGGATCCTGGAATGGATTTTGTAATGGAAAAGTCCACATCAGAAACTGGGAAGAGAAACTCTGGTGAAGAGAAATCTGAAATGCAAACTGTTGAAGAAA
CTTCAACCGAGAATATGAAGGATGAGAGAATCCAAGCAATTAAGATTATGACGAGCGTAACGTTCGGTCACCAGAGTAATTGGTCGCCGGGGATTGGGATTCAAGAGCAT
CTAGGTAGAGGAAAGATTTCTAGGGTTTCGTATTCAGAGTATTGTTGTCCATTTGCTCCTTCCAGAGAAACAATCACTCACCATGGGAATCAGGTTGCTTACTGTCGCCG
GCGAAGTGGACGGCATGTGAGACCTGTTCTTGCCGGTTCCGGCGACGGTGCGATGGATCCAGACGAATCGGATCGTGGGATTAACAACAGTGAGAGTCCCAAAGGCGAAA
ATGGAGGGATGAACTTCGAAATGCTTAGGGAAAACTTGGAGAGATCGGTTGGGAATGATGATTCCAGATTTAGTGGGATTGACCTTGCTACTCTTATTAGGAACAAGTAT
GGCAGGTCTTATGACGTTCAACTAATAAAGAAGGAGTTCATGGGACGAAATCTTCTTGCTTTAAATGTCATGTGGAAATACAGAGAGCAGAAATCATTTCCATTGTCCGA
GGAAGAGTATCTATTGCGACTCGATGGTGTGGCCAATACATTAAAGTGCTGGGGAGCCGTTGCTCACATTCGAAACAGCTTAGACAAAACTAAAGACAGGCCTCGGATAG
GAAAGGCAGTCAGTATTTTTATCGACATGGACGAGTCTGGTGGTCGTGCCAATGAGTATCATTCGGTAGCCTCGGTCTTCCTCGATTCCGACTCCTCTCCTGCTCTTGGC
TTTCGTCGTTCCCTTACTGATAGAGAGGTGGCAGAGGTCGTCAGTTTCCTTTCCCTATTGGCTGTCCAATCTTTTCATCTTGAGAGAAGGGATTCTTGGTTCTGGACCCC
AAAGTCTTCTGATGGTTTCTCTTGTGCTTCTTTTTTCCATTCTCTTTCTGTCTCCCCCTCCTCCAGGAGCCTCCATTTTTCCTCGATATGGAAGGTCAAAATTCCCAAGA
AAGTGAAGTTCTTTACCTCGGAAGTTTTACATGGTCGAGTGAGTACTCAGGATCGGATCCAAAGGTTCTCCTTCGTGTTGCGACCGCAGTGGTGTGTCCTCTATAGGAGT
CAGGAGGAAGATCTAGATCATTTGCTTTGGGATTGTCCTTACGTCAGCTCTATTTGGAGTCAGTTCTTCAGGATGTTTGGGGTTGCTTCGACTTGTAACATTGACTGCTT
CTCTATGTTTGAGGAAATCCTGTGGCATCATCTTTTTGTGACAAAAGGAGAGTTCTTTGGCAAACGTGCTTCTTTGCTATTTGTGGTATATTTGGCTTGA
Protein sequenceShow/hide protein sequence
EFKGIFPEELPKELPPLRDIEHKIDFIPGAQIPNRPAYRTNPQEAKRFKGSEKLTGATLKYPTYDKELYALVRALQTWQHYLWPKEFVIHTDHQNAEFDSRTNPSKGGEY
DMNQDIIALPPGPITRSRAKQLQLALDSHIQTMVNSIKEEGASSLSLQFLKPGLQAWTFRTSFLIALQQRRTPPFDLPRYNSEIRNVRSSNEQPVYGPTIKQKVLLGPDT
PARMSPTWTPWINTSVSNTKWTVSHSVTRIRYEAHVVDFVMDPGMDFVMEKSTSETGKRNSGEEKSEMQTVEETSTENMKDERIQAIKIMTSVTFGHQSNWSPGIGIQEH
LGRGKISRVSYSEYCCPFAPSRETITHHGNQVAYCRRRSGRHVRPVLAGSGDGAMDPDESDRGINNSESPKGENGGMNFEMLRENLERSVGNDDSRFSGIDLATLIRNKY
GRSYDVQLIKKEFMGRNLLALNVMWKYREQKSFPLSEEEYLLRLDGVANTLKCWGAVAHIRNSLDKTKDRPRIGKAVSIFIDMDESGGRANEYHSVASVFLDSDSSPALG
FRRSLTDREVAEVVSFLSLLAVQSFHLERRDSWFWTPKSSDGFSCASFFHSLSVSPSSRSLHFSSIWKVKIPKKVKFFTSEVLHGRVSTQDRIQRFSFVLRPQWCVLYRS
QEEDLDHLLWDCPYVSSIWSQFFRMFGVASTCNIDCFSMFEEILWHHLFVTKGEFFGKRASLLFVVYLA