; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C034471 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C034471
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionReverse transcriptase
Genome locationchr10:19410269..19422028
RNA-Seq ExpressionMELO3C034471
SyntenyMELO3C034471
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.7e-0392.86Show/hide
Query:  MKVLSRVDRYRYRKALFVDFTPSFGLRS
        MKVLSRVDRYRYRKALFV FTPSFGLR+
Subjt:  MKVLSRVDRYRYRKALFVDFTPSFGLRS

KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.1e-10373.01Show/hide
Query:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI
        WQT       + + YPSTYCEAKRDEFLGLKQGSLSVAEYERKYT+LSRYADVI+ASESDRCRRFE+G           + K+     LVETALRVE SI
Subjt:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI

Query:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG
        TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPIRSTVRSQPGQES+A+TVR  PCTSCGRNHRG
Subjt:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG

Query:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV
        QCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQT++Q RVSVVPTEG   T G R   + G   +   +Y    +E+    +V+
Subjt:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV

KAA0035808.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]6.5e-10459.34Show/hide
Query:  MKVLSRVDRYRYRKALFVDFTPSFGLR------------------------------------SHATTYWQTTPIESGRDARS-----------------
        MKVLSRVDRYRYRKALFV FTPSFGLR                                          +  T  E GR  R+                 
Subjt:  MKVLSRVDRYRYRKALFVDFTPSFGLR------------------------------------SHATTYWQTTPIESGRDARS-----------------

Query:  ---------------------------------------------YPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF
                                                     YPSTYCEAKRDEFLGLKQGSL+VAEYERKYT+LSRYADVI+ASESDRCRRFE+G 
Subjt:  ---------------------------------------------YPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF

Query:  ----------VLKY-----LVETALRVELSITEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPI
                  + K+     LVETAL VE SIT+EKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPI
Subjt:  ----------VLKY-----LVETALRVELSITEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPI

Query:  RSTVRSQPGQESVATTVRLTPCTSCGRNHRGQCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVR
         STVRSQPGQES+A+TVR  PCTSCGRNHRGQCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQTV+Q RVSVVPTEG   T G R
Subjt:  RSTVRSQPGQESVATTVRLTPCTSCGRNHRGQCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVR

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.1e-10373.01Show/hide
Query:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI
        WQT       + + YPSTYCEAKRDEFLGLKQGSLSVAEYERKYT+LSRYADVI+ASESDRCRRFE+G           + K+     LVETALRVE SI
Subjt:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI

Query:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG
        TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPIRSTVRSQPGQES+A+TVR  PCTSCGRNHRG
Subjt:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG

Query:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV
        QCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQT++Q RVSVVPTEG   T G R   + G   +   +Y    +E+    +V+
Subjt:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.7e-0392.86Show/hide
Query:  MKVLSRVDRYRYRKALFVDFTPSFGLRS
        MKVLSRVDRYRYRKALFV FTPSFGLR+
Subjt:  MKVLSRVDRYRYRKALFVDFTPSFGLRS

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.1e-10373.01Show/hide
Query:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI
        WQT       + + YPSTYCEAKRDEFLGLKQGSLSVAEYERKYT+LSRYADVI+ASESDRCRRFE+G           + K+     LVETALRVE SI
Subjt:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI

Query:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG
        TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPIRSTVRSQPGQES+A+TVR  PCTSCGRNHRG
Subjt:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG

Query:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV
        QCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQT++Q RVSVVPTEG   T G R   + G   +   +Y    +E+    +V+
Subjt:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV

TYK27507.1 reverse transcriptase [Cucumis melo var. makuwa]1.1e-10373.01Show/hide
Query:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI
        WQT       + + YPSTYCEAKRDEFLGLKQGSLSVAEYERKYT+LSRYADVI+ASESDRCRRFE+G           + K+     LVETALRVE SI
Subjt:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI

Query:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG
        TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPIRSTVRSQPGQES+A+TVR  PCTSCGRNHRG
Subjt:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG

Query:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV
        QCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQT++Q RVSVVPTEG   T G R   + G   +   +Y    +E+    +V+
Subjt:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV

TrEMBL top hitse value%identityAlignment
A0A5A7SZ16 Reverse transcriptase3.2e-10459.34Show/hide
Query:  MKVLSRVDRYRYRKALFVDFTPSFGLR------------------------------------SHATTYWQTTPIESGRDARS-----------------
        MKVLSRVDRYRYRKALFV FTPSFGLR                                          +  T  E GR  R+                 
Subjt:  MKVLSRVDRYRYRKALFVDFTPSFGLR------------------------------------SHATTYWQTTPIESGRDARS-----------------

Query:  ---------------------------------------------YPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF
                                                     YPSTYCEAKRDEFLGLKQGSL+VAEYERKYT+LSRYADVI+ASESDRCRRFE+G 
Subjt:  ---------------------------------------------YPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF

Query:  ----------VLKY-----LVETALRVELSITEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPI
                  + K+     LVETAL VE SIT+EKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPI
Subjt:  ----------VLKY-----LVETALRVELSITEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPI

Query:  RSTVRSQPGQESVATTVRLTPCTSCGRNHRGQCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVR
         STVRSQPGQES+A+TVR  PCTSCGRNHRGQCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQTV+Q RVSVVPTEG   T G R
Subjt:  RSTVRSQPGQESVATTVRLTPCTSCGRNHRGQCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVR

A0A5A7T1Y5 Reverse transcriptase8.3e-0492.86Show/hide
Query:  MKVLSRVDRYRYRKALFVDFTPSFGLRS
        MKVLSRVDRYRYRKALFV FTPSFGLR+
Subjt:  MKVLSRVDRYRYRKALFVDFTPSFGLRS

A0A5A7U2V7 Reverse transcriptase5.4e-10473.01Show/hide
Query:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI
        WQT       + + YPSTYCEAKRDEFLGLKQGSLSVAEYERKYT+LSRYADVI+ASESDRCRRFE+G           + K+     LVETALRVE SI
Subjt:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI

Query:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG
        TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPIRSTVRSQPGQES+A+TVR  PCTSCGRNHRG
Subjt:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG

Query:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV
        QCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQT++Q RVSVVPTEG   T G R   + G   +   +Y    +E+    +V+
Subjt:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV

A0A5D3BHI1 Reverse transcriptase8.3e-0492.86Show/hide
Query:  MKVLSRVDRYRYRKALFVDFTPSFGLRS
        MKVLSRVDRYRYRKALFV FTPSFGLR+
Subjt:  MKVLSRVDRYRYRKALFVDFTPSFGLRS

A0A5D3BHI1 Reverse transcriptase5.4e-10473.01Show/hide
Query:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI
        WQT       + + YPSTYCEAKRDEFLGLKQGSLSVAEYERKYT+LSRYADVI+ASESDRCRRFE+G           + K+     LVETALRVE SI
Subjt:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI

Query:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG
        TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPIRSTVRSQPGQES+A+TVR  PCTSCGRNHRG
Subjt:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG

Query:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV
        QCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQT++Q RVSVVPTEG   T G R   + G   +   +Y    +E+    +V+
Subjt:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV

A0A5D3BS67 Reverse transcriptase5.4e-10473.01Show/hide
Query:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI
        WQT       + + YPSTYCEAKRDEFLGLKQGSLSVAEYERKYT+LSRYADVI+ASESDRCRRFE+G           + K+     LVETALRVE SI
Subjt:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI

Query:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG
        TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPIRSTVRSQPGQES+A+TVR  PCTSCGRNHRG
Subjt:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG

Query:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV
        QCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQT++Q RVSVVPTEG   T G R   + G   +   +Y    +E+    +V+
Subjt:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV

A0A5D3BS67 Reverse transcriptase8.3e-0492.86Show/hide
Query:  MKVLSRVDRYRYRKALFVDFTPSFGLRS
        MKVLSRVDRYRYRKALFV FTPSFGLR+
Subjt:  MKVLSRVDRYRYRKALFVDFTPSFGLRS

A0A5D3BS67 Reverse transcriptase5.4e-10473.01Show/hide
Query:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI
        WQT       + + YPSTYCEAKRDEFLGLKQGSLSVAEYERKYT+LSRYADVI+ASESDRCRRFE+G           + K+     LVETALRVE SI
Subjt:  WQTTPIESGRDARSYPSTYCEAKRDEFLGLKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGF----------VLKY-----LVETALRVELSI

Query:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG
        TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSR DFKNRS  QASRN+SYGSVF RQSQRIPSQPIRSTVRSQPGQES+A+TVR  PCTSCGRNHRG
Subjt:  TEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASRNMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRG

Query:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV
        QCLVGAGVCYQCGQ GHFKKDCPQLNMTVQRDQG GSQT++Q RVSVVPTEG   T G R   + G   +   +Y    +E+    +V+
Subjt:  QCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAECTGGVRISILYG--WKAYSIYYSFHEEIVQVKEVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G44950.1 histone mono-ubiquitination 12.8e-0452.94Show/hide
Query:  SKVADVLKSKGNKNVAYLSEIE----PYNDMQTQN-HLLQQITERDDYNIK
        +K +D+LKSK  ++  YLSEI+     Y D+  QN  LL Q+TERDDYNIK
Subjt:  SKVADVLKSKGNKNVAYLSEIE----PYNDMQTQN-HLLQQITERDDYNIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGCTGGGCGGGCCCCACTACATCGTAGAGCTTGTAAACGTTGTTGTACTGGGTGTACTCTACACAACGTAGATTTGTCATGTGTCTTTAGGTTTACTCGT
GAAATCGATTGGCAAGTTCATCACATGAAAGTTTTAAGCAGAGTCGACAGATACAGGTACAGAAAAGCGTTGTTCGTCGACTTTACGCCATCTTTCGGTCTAAGG
AGTCATGCCACCACGTACTGGCAGACGACGCCGATAGAATCAGGACGGGATGCAAGGTCCTACCCAAGCACATACTGCGAAGCCAAGAGGGATGAATTTCTGGGG
TTGAAACAAGGATCACTTTCAGTGGCTGAGTATGAGAGGAAGTATACCAAGCTTTCACGGTATGCTGACGTTATTGTAGCTTCTGAGAGTGACAGGTGCCGAAGG
TTTGAAAAGGGTTTCGTTTTGAAATACTTAGTGGAGACTGCCCTTCGTGTGGAGCTGAGTATAACAGAAGAGAAATCAGCAGTGGAGCTTAGTCGTGGGACTTCA
ACAGCTAGTGGATTTAGAGGCCGTGAGCAACGAAGGTTCACGCCTGGGATAAATATTTCAAGCCGTCTAGATTTTAAGAATCGCTCTAGAAGCCAAGCATCGAGA
AACATGAGTTATGGTAGTGTTTTTCACAGACAGAGCCAGAGAATACCGAGTCAACCCATTAGATCAACAGTAAGATCTCAACCAGGTCAGGAGTCTGTTGCTACT
ACCGTTAGGCTAACACCATGCACGAGTTGTGGCAGAAACCATCGGGGTCAGTGTTTGGTAGGTGCCGGTGTATGTTACCAGTGCGGACAGCAAGGACATTTCAAG
AAAGATTGTCCGCAGTTAAACATGACAGTTCAGAGAGATCAGGGATTTGGGTCCCAGACAGTTAAGCAATTGAGAGTTTCAGTGGTTCCAACAGAGGGCGCCGAG
TGTACCGGTGGAGTTAGAATCTCTATTTTGTATGGTTGGAAAGCTTATTCAATTTACTACAGCTTCCATGAAGAAATTGTACAAGTTAAAGAGGTTGTGTGTGCG
CGATTCTGGAAGTGGTTCTATTTCGAGGGTTATCTGAGTTTATGCACGGGGAATCAGATTTATAGGTCTTCAGAACTACAGTTTTCCATGAAAGAGCTGAGTAAA
ATGATTTATGAAATTTTCATTAACTACATGACTAAGATCTTGAGCCTGAGGCTAGAGATTACCGTGTGCACACTGGTGTCTTTAGATTTACTCGTGAAATCGGTA
AATTTTGAGGTACACTTCATATTGGCAAGTTCATCACATGAAAGTTTTAAGCAAAGTCGACAGATACAGGTACAGAAAAGCGTTGTTCGTCGGCTTCACGTCATC
TTTCGGTCTAAGGTAGCAGATGTTTTAAAATCTAAAGGCAACAAAAATGTGGCATATTTGTCTGAAATTGAGCCATATAACGATATGCAAACTCAAAATCATCTG
TTGCAGCAAATAACTGAGAGAGATGACTATAACATTAAGAAACAAGCAACCAGAGTTCAATGCATGGTTATGGAATCCAAGTTTGTATTTGAAGCCAAGTTCATA
CAAGAATATGAAAGAAGACTCCGATATGAACTCAACACAATGAAAATCACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATCTATGGAGGAAAAGGGAAAAGACTAAAAATCAGTTAAATTGTAGGGACAAGTTAAAGGGGTCGTGTGTGCACGATTCTGGAAGTGGTTCTGTTTCGAGG
ATGATTTATGAAATTTTCATTAACTACATGACTGAGATCTTAAGCCTGAGGCTAGAGATTACCGTGTGCACACTGGTTAGATTCTGTTGTCGACGTTGAGCGTAC
TCCGTAACAACGATGCTGTCATGAGTGCTGGGCGGGCCCCACTACATCGTAGAGCTTGTAAACGTTGTTGTACTGGGTGTACTCTACACAACGTAGATTTGTCAT
GTGTCTTTAGGTTTACTCGTGAAATCGATTGGCAAGTTCATCACATGAAAGTTTTAAGCAGAGTCGACAGATACAGGTACAGAAAAGCGTTGTTCGTCGACTTTA
CGCCATCTTTCGGTCTAAGGAGTCATGCCACCACGTACTGGCAGACGACGCCGATAGAATCAGGACGGGATGCAAGGTCCTACCCAAGCACATACTGCGAAGCCA
AGAGGGATGAATTTCTGGGGTTGAAACAAGGATCACTTTCAGTGGCTGAGTATGAGAGGAAGTATACCAAGCTTTCACGGTATGCTGACGTTATTGTAGCTTCTG
AGAGTGACAGGTGCCGAAGGTTTGAAAAGGGTTTCGTTTTGAAATACTTAGTGGAGACTGCCCTTCGTGTGGAGCTGAGTATAACAGAAGAGAAATCAGCAGTGG
AGCTTAGTCGTGGGACTTCAACAGCTAGTGGATTTAGAGGCCGTGAGCAACGAAGGTTCACGCCTGGGATAAATATTTCAAGCCGTCTAGATTTTAAGAATCGCT
CTAGAAGCCAAGCATCGAGAAACATGAGTTATGGTAGTGTTTTTCACAGACAGAGCCAGAGAATACCGAGTCAACCCATTAGATCAACAGTAAGATCTCAACCAG
GTCAGGAGTCTGTTGCTACTACCGTTAGGCTAACACCATGCACGAGTTGTGGCAGAAACCATCGGGGTCAGTGTTTGGTAGGTGCCGGTGTATGTTACCAGTGCG
GACAGCAAGGACATTTCAAGAAAGATTGTCCGCAGTTAAACATGACAGTTCAGAGAGATCAGGGATTTGGGTCCCAGACAGTTAAGCAATTGAGAGTTTCAGTGG
TTCCAACAGAGGGCGCCGAGTGTACCGGTGGAGTTAGAATCTCTATTTTGTATGGTTGGAAAGCTTATTCAATTTACTACAGCTTCCATGAAGAAATTGTACAAG
TTAAAGAGGTTGTGTGTGCGCGATTCTGGAAGTGGTTCTATTTCGAGGGTTATCTGAGTTTATGCACGGGGAATCAGATTTATAGGTCTTCAGAACTACAGTTTT
CCATGAAAGAGCTGAGTAAAATGATTTATGAAATTTTCATTAACTACATGACTAAGATCTTGAGCCTGAGGCTAGAGATTACCGTGTGCACACTGGTGTCTTTAG
ATTTACTCGTGAAATCGGTAAATTTTGAGGTACACTTCATATTGGCAAGTTCATCACATGAAAGTTTTAAGCAAAGTCGACAGATACAGGTACAGAAAAGCGTTG
TTCGTCGGCTTCACGTCATCTTTCGGTCTAAGGTAGCAGATGTTTTAAAATCTAAAGGCAACAAAAATGTGGCATATTTGTCTGAAATTGAGCCATATAACGATA
TGCAAACTCAAAATCATCTGTTGCAGCAAATAACTGAGAGAGATGACTATAACATTAAGAAACAAGCAACCAGAGTTCAATGCATGGTTATGGAATCCAAGTTTG
TATTTGAAGCCAAGTTCATACAAGAATATGAAAGAAGACTCCGATATGAACTCAACACAATGAAAATCACTTAA
Protein sequenceShow/hide protein sequence
MSAGRAPLHRRACKRCCTGCTLHNVDLSCVFRFTREIDWQVHHMKVLSRVDRYRYRKALFVDFTPSFGLRSHATTYWQTTPIESGRDARSYPSTYCEAKRDEFLG
LKQGSLSVAEYERKYTKLSRYADVIVASESDRCRRFEKGFVLKYLVETALRVELSITEEKSAVELSRGTSTASGFRGREQRRFTPGINISSRLDFKNRSRSQASR
NMSYGSVFHRQSQRIPSQPIRSTVRSQPGQESVATTVRLTPCTSCGRNHRGQCLVGAGVCYQCGQQGHFKKDCPQLNMTVQRDQGFGSQTVKQLRVSVVPTEGAE
CTGGVRISILYGWKAYSIYYSFHEEIVQVKEVVCARFWKWFYFEGYLSLCTGNQIYRSSELQFSMKELSKMIYEIFINYMTKILSLRLEITVCTLVSLDLLVKSV
NFEVHFILASSSHESFKQSRQIQVQKSVVRRLHVIFRSKVADVLKSKGNKNVAYLSEIEPYNDMQTQNHLLQQITERDDYNIKKQATRVQCMVMESKFVFEAKFI
QEYERRLRYELNTMKIT