; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031943 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031943
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:20022831..20023911
RNA-Seq ExpressionLag0031943
SyntenyLag0031943
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.3e-11968.68Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE AL VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL  +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

KAA0056684.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.3e-11968.68Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE AL VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL  +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.3e-11968.68Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE AL VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL  +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

TYK06334.1 putative polyprotein [Cucumis melo var. makuwa]5.3e-11968.68Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE AL VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL  +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

TYK27507.1 reverse transcriptase [Cucumis melo var. makuwa]5.3e-11968.68Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE AL VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL  +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

TrEMBL top hitse value%identityAlignment
A0A5A7T1Y5 Reverse transcriptase2.6e-11968.68Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE AL VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL  +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

A0A5A7U2V7 Reverse transcriptase2.6e-11968.68Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE AL VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL  +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

A0A5A7VQD2 Putative Gag-pol protein2.6e-11968.39Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE A+ VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL+ +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

A0A5D3BHI1 Reverse transcriptase2.6e-11968.68Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE AL VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL  +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

A0A5D3BS67 Reverse transcriptase2.6e-11968.68Show/hide
Query:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL
        YGIERLKKLGATVFEGSTDPAD E                             K+AEGWWKSILARRS+AR LDWQTFRGIFEDKY P+TYCEAK  E L
Subjt:  YGIERLKKLGATVFEGSTDPADTE-----------------------------KKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELL

Query:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT
        G KQGSLSV EYER+Y ELSRYA+VI+A ESDRCRRFERGL FEIRT VT IAKWT+FSQLVE AL VEQSITEEKS VE SRG STAS FRGREQRRFT
Subjt:  GPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFERGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFT

Query:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS
        PG+        K RSGGQ+SR +S     + QSQR PSQ   S  R + G+ES+AS  RR P  SCG+NHRGQCL+GAGVCYQCGQ GHFK+DCPQL  +
Subjt:  PGV--------KRRSGGQSSRQMS-----RPQSQRTPSQSASSVARPRTGRESLASQTRRTPFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRAS

Query:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE
        V+RDQGV S T+EQ RVSV   EGT GARQKGVVGRPRQQGKVYAM +
Subjt:  VERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGGAATAGAACGACTGAAGAAATTAGGAGCCACAGTGTTTGAGGGTTCCACAGATCCTGCTGACACCGAGAAGAAGGCAGAGGGATGGTGGAAATCCATTTTAGC
CAGGCGTAGTAATGCACGTACGTTAGACTGGCAGACTTTCAGAGGCATATTCGAAGACAAGTATTGTCCCAACACATACTGTGAGGCAAAGAGTTATGAGTTACTGGGGC
CGAAGCAAGGGTCACTTTCAGTGGACGAGTACGAGAGAGAGTATATTGAGCTTTCACGGTATGCTAATGTGATTGTGGCATATGAGAGTGACAGGTGTCGAAGGTTTGAA
AGAGGGTTGTGTTTTGAAATACGTACCCTAGTTACAGATATTGCTAAGTGGACGGATTTTTCCCAGTTAGTAGAGATTGCCTTACTTGTGGAGCAGAGTATAACAGAGGA
AAAGTCGGTAGTGGAGCCTAGTCGTGGGGCTTCAACAGCTAGTAGTTTCCGAGGTCGTGAGCAGCGGAGGTTCACACCTGGAGTTAAGCGTCGGTCTGGTGGCCAGTCAT
CAAGGCAGATGAGTAGACCGCAGAGTCAGAGAACCCCCAGTCAGTCTGCGAGTTCAGTAGCAAGACCACGGACGGGTCGGGAGTCTCTTGCTAGTCAAACCAGGAGAACC
CCATTTGCGAGTTGTGGCAAGAATCATCGGGGTCAGTGTCTTATTGGCGCCGGTGTGTGTTACCAGTGTGGACAACAAGGGCATTTTAAGAGAGATTGTCCACAACTGAG
AGCATCAGTTGAGAGGGACCAGGGAGTTGAGTCTCACACAGTTGAGCAGCCGAGAGTCTCAGTAGCCGCAGGAGAGGGCACTTGTGGTGCAAGGCAGAAGGGAGTTGTGG
GGAGACCTAGGCAGCAAGGAAAAGTCTACGCCATGAATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTATGGAATAGAACGACTGAAGAAATTAGGAGCCACAGTGTTTGAGGGTTCCACAGATCCTGCTGACACCGAGAAGAAGGCAGAGGGATGGTGGAAATCCATTTTAGC
CAGGCGTAGTAATGCACGTACGTTAGACTGGCAGACTTTCAGAGGCATATTCGAAGACAAGTATTGTCCCAACACATACTGTGAGGCAAAGAGTTATGAGTTACTGGGGC
CGAAGCAAGGGTCACTTTCAGTGGACGAGTACGAGAGAGAGTATATTGAGCTTTCACGGTATGCTAATGTGATTGTGGCATATGAGAGTGACAGGTGTCGAAGGTTTGAA
AGAGGGTTGTGTTTTGAAATACGTACCCTAGTTACAGATATTGCTAAGTGGACGGATTTTTCCCAGTTAGTAGAGATTGCCTTACTTGTGGAGCAGAGTATAACAGAGGA
AAAGTCGGTAGTGGAGCCTAGTCGTGGGGCTTCAACAGCTAGTAGTTTCCGAGGTCGTGAGCAGCGGAGGTTCACACCTGGAGTTAAGCGTCGGTCTGGTGGCCAGTCAT
CAAGGCAGATGAGTAGACCGCAGAGTCAGAGAACCCCCAGTCAGTCTGCGAGTTCAGTAGCAAGACCACGGACGGGTCGGGAGTCTCTTGCTAGTCAAACCAGGAGAACC
CCATTTGCGAGTTGTGGCAAGAATCATCGGGGTCAGTGTCTTATTGGCGCCGGTGTGTGTTACCAGTGTGGACAACAAGGGCATTTTAAGAGAGATTGTCCACAACTGAG
AGCATCAGTTGAGAGGGACCAGGGAGTTGAGTCTCACACAGTTGAGCAGCCGAGAGTCTCAGTAGCCGCAGGAGAGGGCACTTGTGGTGCAAGGCAGAAGGGAGTTGTGG
GGAGACCTAGGCAGCAAGGAAAAGTCTACGCCATGAATGAATAG
Protein sequenceShow/hide protein sequence
MYGIERLKKLGATVFEGSTDPADTEKKAEGWWKSILARRSNARTLDWQTFRGIFEDKYCPNTYCEAKSYELLGPKQGSLSVDEYEREYIELSRYANVIVAYESDRCRRFE
RGLCFEIRTLVTDIAKWTDFSQLVEIALLVEQSITEEKSVVEPSRGASTASSFRGREQRRFTPGVKRRSGGQSSRQMSRPQSQRTPSQSASSVARPRTGRESLASQTRRT
PFASCGKNHRGQCLIGAGVCYQCGQQGHFKRDCPQLRASVERDQGVESHTVEQPRVSVAAGEGTCGARQKGVVGRPRQQGKVYAMNE