; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007322 (gene) of Snake gourd v1 genome

Gene IDTan0007322
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG04:40862839..40882404
RNA-Seq ExpressionTan0007322
SyntenyTan0007322
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0004518 - nuclease activity (molecular function)
GO:0005488 - binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035073.1 uncharacterized protein E6C27_scaffold57G001380 [Cucumis melo var. makuwa]1.6e-8859.63Show/hide
Query:  LHVVFRAKLQVIELVQIEQVEYGPRKDKS-KQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEG
        LH +FR+K+  +           PR  +  +Q+Q G Q PTQ  S   SS   ++  A  EQ A  +QE+GR +RA PSDPEK YGIERLKKLGATVFEG
Subjt:  LHVVFRAKLQVIELVQIEQVEYGPRKDKS-KQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEG

Query:  YINLADAEVWLNKLEKCFDLMSFPEEQKEQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ------------
          +LADAE WLN LEKCFD+M+ PEE+K +LATF L+KEA+GWWKSILARRSDAR LDWQTFR IFEDKYYPSTY EAKRDEF+ LKQ            
Subjt:  YINLADAEVWLNKLEKCFDLMSFPEEQKEQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ------------

Query:  ----------------EWRKFERGLRPEIRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAG
                          R+FERGLR EIRTPVT IAKWT+FSQ+VETALRVEQ+ITE +  VE +RG ST S F GREQRRF PG+N SSRQDFKNR+G
Subjt:  ----------------EWRKFERGLRPEIRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAG

Query:  GQALRQMNVGGAYQRQNQRAPS
        GQA R ++ G  +QRQ+QR PS
Subjt:  GQALRQMNVGGAYQRQNQRAPS

KAA0037805.1 uncharacterized protein E6C27_scaffold918G00190 [Cucumis melo var. makuwa]3.5e-8863.27Show/hide
Query:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK
        + +Q+Q G Q PTQ  S   SS   ++  A  EQ A  +QE+GR +RA PSDPEK YGIERLKKLGATVFEG  + ADAE WLN LEKCFD+M+ PEE+K
Subjt:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK

Query:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE
         +LATF L+KEAEGWWKSILARRSDAR LDWQTFR IFEDKYYPSTY EAKRDEF+ LKQ                              R+FERGLR E
Subjt:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE

Query:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS
        IRTPVT IAKWT+FSQ+VETALRVEQ+ITE + VVE +RG ST S F GREQRRF PG+N+SSRQDFKNR+GGQA R ++ G  +QRQ+QR PS
Subjt:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS

KAA0056684.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.1e-8954.67Show/hide
Query:  VEIELLVPDILPY--------FYRETY--------------EVFLLTGNNVCWLHVVFRAKLQVIELVQIEQVEYGPRKD-KSKQDQGGTQDPTQSQSER
        VEI+  VPD LPY         Y   Y                  +  N V  LH +FR+K+  +           PR   + +Q+Q G Q PTQ  S  
Subjt:  VEIELLVPDILPY--------FYRETY--------------EVFLLTGNNVCWLHVVFRAKLQVIELVQIEQVEYGPRKD-KSKQDQGGTQDPTQSQSER

Query:  GSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQKEQLATFQLEKEAEGWWKSI
         SS   ++  A  EQ A  +QE+GR +RA PSDPEK YGIERLKKLGATVFEG  + ADAE WLN LEKCFD+M+ PEE+K +LATF L+KEAEGWWKSI
Subjt:  GSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQKEQLATFQLEKEAEGWWKSI

Query:  LARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPEIRTPVTVIAKWTDFSQVVE
        LARRSDAR LDWQTFR IFEDKYYPSTY EAKRDEF+ LKQ                              R+FERGLR EIRTPVT IAKWT+FSQ+VE
Subjt:  LARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPEIRTPVTVIAKWTDFSQVVE

Query:  TALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS
        TALRVEQ+ITE +  VE +RG ST S F GREQRRF PG+N+SSRQDFKNR+GGQA R ++ G  +QRQ+QR PS
Subjt:  TALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.9e-8862.93Show/hide
Query:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK
        + +Q+Q G Q PTQ  S   SS   ++  A  EQ A  +QE+GR +RA PSDPEK YGIERLKKLGATVFEG  + ADAE WLN LEKCFD+M+ PEE+K
Subjt:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK

Query:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE
         +LATF L+KEAEGWWKSILARRSDAR LDWQTFR IFEDKYYPSTY EAKRDEF+ LKQ                              R+FERGLR E
Subjt:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE

Query:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS
        IRTPVT IAKWT+FSQ+VETALRVEQ+ITE +  VE +RG ST S F GREQRRF PG+N+SSRQDFKNR+GGQA R ++ G  +QRQ+QR PS
Subjt:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS

TYK14494.1 uncharacterized protein E5676_scaffold15G00050 [Cucumis melo var. makuwa]1.8e-9756.85Show/hide
Query:  MTEILSPRLYGTVCTQ-----------------VEIELLVPDILPYFYR----ETYEVFLLTGNNVCWLHVVFRAKLQVIE---LVQIEQVEYGPRKD--
        MTEILS RLYGTVCTQ                 VEIEL V D LP         +    ++  ++VCWLH VFRAK        +    Q    PR+   
Subjt:  MTEILSPRLYGTVCTQ-----------------VEIELLVPDILPYFYR----ETYEVFLLTGNNVCWLHVVFRAKLQVIE---LVQIEQVEYGPRKD--

Query:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK
        + +Q+Q G Q PTQ QSERGSSAPR Q E   E+ A  +QE+G PER  PSDPEK Y IERLKKLGATVFEG  +LADAEVWLN LEKCFD+MS P+E+K
Subjt:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK

Query:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE
         +LATF L+KEAEGWWKSI+ARR+DA TLD QTFR IF +KYYP+TY EAKRDEF+ELKQ                              R+FERGLR E
Subjt:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE

Query:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS
        I TPVT IAKWT+FSQ+VETALRVEQ+I E + V+E +RG STTS   GREQ RF PGVNVS  QDFK R+GG+ LRQM+ G AYQRQ+QRA S
Subjt:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS

TrEMBL top hitse value%identityAlignment
A0A5A7SX02 CCHC-type domain-containing protein7.5e-8959.63Show/hide
Query:  LHVVFRAKLQVIELVQIEQVEYGPRKDKS-KQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEG
        LH +FR+K+  +           PR  +  +Q+Q G Q PTQ  S   SS   ++  A  EQ A  +QE+GR +RA PSDPEK YGIERLKKLGATVFEG
Subjt:  LHVVFRAKLQVIELVQIEQVEYGPRKDKS-KQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEG

Query:  YINLADAEVWLNKLEKCFDLMSFPEEQKEQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ------------
          +LADAE WLN LEKCFD+M+ PEE+K +LATF L+KEA+GWWKSILARRSDAR LDWQTFR IFEDKYYPSTY EAKRDEF+ LKQ            
Subjt:  YINLADAEVWLNKLEKCFDLMSFPEEQKEQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ------------

Query:  ----------------EWRKFERGLRPEIRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAG
                          R+FERGLR EIRTPVT IAKWT+FSQ+VETALRVEQ+ITE +  VE +RG ST S F GREQRRF PG+N SSRQDFKNR+G
Subjt:  ----------------EWRKFERGLRPEIRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAG

Query:  GQALRQMNVGGAYQRQNQRAPS
        GQA R ++ G  +QRQ+QR PS
Subjt:  GQALRQMNVGGAYQRQNQRAPS

A0A5A7T8K4 Retrotrans_gag domain-containing protein1.7e-8863.27Show/hide
Query:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK
        + +Q+Q G Q PTQ  S   SS   ++  A  EQ A  +QE+GR +RA PSDPEK YGIERLKKLGATVFEG  + ADAE WLN LEKCFD+M+ PEE+K
Subjt:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK

Query:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE
         +LATF L+KEAEGWWKSILARRSDAR LDWQTFR IFEDKYYPSTY EAKRDEF+ LKQ                              R+FERGLR E
Subjt:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE

Query:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS
        IRTPVT IAKWT+FSQ+VETALRVEQ+ITE + VVE +RG ST S F GREQRRF PG+N+SSRQDFKNR+GGQA R ++ G  +QRQ+QR PS
Subjt:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS

A0A5A7U2V7 Reverse transcriptase2.9e-8862.93Show/hide
Query:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK
        + +Q+Q G Q PTQ  S   SS   ++  A  EQ A  +QE+GR +RA PSDPEK YGIERLKKLGATVFEG  + ADAE WLN LEKCFD+M+ PEE+K
Subjt:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK

Query:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE
         +LATF L+KEAEGWWKSILARRSDAR LDWQTFR IFEDKYYPSTY EAKRDEF+ LKQ                              R+FERGLR E
Subjt:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE

Query:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS
        IRTPVT IAKWT+FSQ+VETALRVEQ+ITE +  VE +RG ST S F GREQRRF PG+N+SSRQDFKNR+GGQA R ++ G  +QRQ+QR PS
Subjt:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS

A0A5A7UNA3 Reverse transcriptase1.5e-8954.67Show/hide
Query:  VEIELLVPDILPY--------FYRETY--------------EVFLLTGNNVCWLHVVFRAKLQVIELVQIEQVEYGPRKD-KSKQDQGGTQDPTQSQSER
        VEI+  VPD LPY         Y   Y                  +  N V  LH +FR+K+  +           PR   + +Q+Q G Q PTQ  S  
Subjt:  VEIELLVPDILPY--------FYRETY--------------EVFLLTGNNVCWLHVVFRAKLQVIELVQIEQVEYGPRKD-KSKQDQGGTQDPTQSQSER

Query:  GSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQKEQLATFQLEKEAEGWWKSI
         SS   ++  A  EQ A  +QE+GR +RA PSDPEK YGIERLKKLGATVFEG  + ADAE WLN LEKCFD+M+ PEE+K +LATF L+KEAEGWWKSI
Subjt:  GSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQKEQLATFQLEKEAEGWWKSI

Query:  LARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPEIRTPVTVIAKWTDFSQVVE
        LARRSDAR LDWQTFR IFEDKYYPSTY EAKRDEF+ LKQ                              R+FERGLR EIRTPVT IAKWT+FSQ+VE
Subjt:  LARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPEIRTPVTVIAKWTDFSQVVE

Query:  TALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS
        TALRVEQ+ITE +  VE +RG ST S F GREQRRF PG+N+SSRQDFKNR+GGQA R ++ G  +QRQ+QR PS
Subjt:  TALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS

A0A5D3CU23 Retrotrans_gag domain-containing protein8.8e-9856.85Show/hide
Query:  MTEILSPRLYGTVCTQ-----------------VEIELLVPDILPYFYR----ETYEVFLLTGNNVCWLHVVFRAKLQVIE---LVQIEQVEYGPRKD--
        MTEILS RLYGTVCTQ                 VEIEL V D LP         +    ++  ++VCWLH VFRAK        +    Q    PR+   
Subjt:  MTEILSPRLYGTVCTQ-----------------VEIELLVPDILPYFYR----ETYEVFLLTGNNVCWLHVVFRAKLQVIE---LVQIEQVEYGPRKD--

Query:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK
        + +Q+Q G Q PTQ QSERGSSAPR Q E   E+ A  +QE+G PER  PSDPEK Y IERLKKLGATVFEG  +LADAEVWLN LEKCFD+MS P+E+K
Subjt:  KSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHASPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQK

Query:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE
         +LATF L+KEAEGWWKSI+ARR+DA TLD QTFR IF +KYYP+TY EAKRDEF+ELKQ                              R+FERGLR E
Subjt:  EQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPSTYREAKRDEFVELKQ----------------------------EWRKFERGLRPE

Query:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS
        I TPVT IAKWT+FSQ+VETALRVEQ+I E + V+E +RG STTS   GREQ RF PGVNVS  QDFK R+GG+ LRQM+ G AYQRQ+QRA S
Subjt:  IRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGAYQRQNQRAPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGAGATATTAAGCCCGAGGCTTTATGGTACCGTGTGCACACAGGTAGAGATCGAGCTCCTGGTACCTGACATCCTGCCATATTTTTATAGAGAGACTTATGAAGT
GTTTCTCCTTACAGGTAACAATGTCTGTTGGCTTCATGTCGTCTTCCGGGCTAAGTTACAAGTTATTGAGTTAGTTCAGATCGAACAAGTAGAGTATGGTCCAAGAAAGG
ATAAGAGTAAGCAGGATCAGGGCGGGACGCAAGATCCTACCCAAAGCCAATCTGAGAGGGGATCTAGTGCCCCGAGAATCCAGACTGAGGCTAGATATGAGCAACATGCT
AGCCCCTCACAGGAGGTAGGCAGACCAGAGAGAGCAGGCCCTAGTGATCCAGAGAAGACCTATGGAATAGAACGTCTGAAGAAATTAGGAGCCACAGTGTTTGAGGGTTA
CATAAATCTAGCTGACGCCGAGGTTTGGTTGAATAAGCTTGAGAAATGTTTTGACTTGATGAGTTTCCCTGAAGAGCAAAAAGAACAGTTGGCCACATTCCAGCTAGAGA
AGGAAGCAGAGGGTTGGTGGAAGTCAATACTAGCCAGGCGCAGTGATGCACGCACTCTAGATTGGCAAACTTTTAGAAGCATCTTCGAAGATAAATATTACCCCAGCACG
TACCGTGAGGCGAAGAGGGATGAATTTGTAGAACTGAAGCAAGAGTGGCGAAAATTTGAAAGAGGGTTACGACCTGAGATACGTACCCCAGTCACAGTCATTGCTAAGTG
GACTGATTTTTCCCAAGTGGTAGAGACTGCTCTACGTGTTGAGCAGACTATAACGGAGGCGCAACCAGTAGTGGAGCCTACTCGAGGGCCTTCAACAACGAGCAGTTTTT
GGGGTCGTGAACAGCGAAGATTTAGACCTGGAGTGAATGTTTCAAGTCGCCAGGACTTTAAGAATCGAGCTGGTGGCCAGGCATTGAGGCAGATGAATGTGGGTGGTGCC
TATCAGAGGCAGAATCAGAGAGCACCTAGCCTTCGTGGGGGAGAGCTGAAGGAAGAAGAAGAGAGTGAATTTCTGGTTTTGTGTTTGAAGAAGGAAGGAGAAAGGGAAGA
AGCTTTCGAGGTATTTTCGTCGTTGGGAATCTCAATTTCATTTCTTCATATTCTTTTTAGAGGGTTCAAAAAGGAAGAGCCAAGAAGAAATAGGTCGAGTATACGGGCCA
GGGTTCTAGTAGAGATCGAGCTCCTGGTACCTGACATCCTAGGCAACAATGTCTGTTGGCTTCATGCCGTCTTCCGGGCTAAGGAACCCACCTTCCTCCCTCACCTAGAG
AACAACGGAGAAGATTCGGAGGTAGTGTCCGTGATCGCTTGGGTTCGTGTTGCTAAGATCAGGATCGTGGGAGAAGGAGCACTCGAAGAAAAGTTCTTCAAAGGTATTAT
TCTTGCTATCCATCTTGTTTGTATATGCTTAATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTGAGATATTAAGCCCGAGGCTTTATGGTACCGTGTGCACACAGGTAGAGATCGAGCTCCTGGTACCTGACATCCTGCCATATTTTTATAGAGAGACTTATGAAGT
GTTTCTCCTTACAGGTAACAATGTCTGTTGGCTTCATGTCGTCTTCCGGGCTAAGTTACAAGTTATTGAGTTAGTTCAGATCGAACAAGTAGAGTATGGTCCAAGAAAGG
ATAAGAGTAAGCAGGATCAGGGCGGGACGCAAGATCCTACCCAAAGCCAATCTGAGAGGGGATCTAGTGCCCCGAGAATCCAGACTGAGGCTAGATATGAGCAACATGCT
AGCCCCTCACAGGAGGTAGGCAGACCAGAGAGAGCAGGCCCTAGTGATCCAGAGAAGACCTATGGAATAGAACGTCTGAAGAAATTAGGAGCCACAGTGTTTGAGGGTTA
CATAAATCTAGCTGACGCCGAGGTTTGGTTGAATAAGCTTGAGAAATGTTTTGACTTGATGAGTTTCCCTGAAGAGCAAAAAGAACAGTTGGCCACATTCCAGCTAGAGA
AGGAAGCAGAGGGTTGGTGGAAGTCAATACTAGCCAGGCGCAGTGATGCACGCACTCTAGATTGGCAAACTTTTAGAAGCATCTTCGAAGATAAATATTACCCCAGCACG
TACCGTGAGGCGAAGAGGGATGAATTTGTAGAACTGAAGCAAGAGTGGCGAAAATTTGAAAGAGGGTTACGACCTGAGATACGTACCCCAGTCACAGTCATTGCTAAGTG
GACTGATTTTTCCCAAGTGGTAGAGACTGCTCTACGTGTTGAGCAGACTATAACGGAGGCGCAACCAGTAGTGGAGCCTACTCGAGGGCCTTCAACAACGAGCAGTTTTT
GGGGTCGTGAACAGCGAAGATTTAGACCTGGAGTGAATGTTTCAAGTCGCCAGGACTTTAAGAATCGAGCTGGTGGCCAGGCATTGAGGCAGATGAATGTGGGTGGTGCC
TATCAGAGGCAGAATCAGAGAGCACCTAGCCTTCGTGGGGGAGAGCTGAAGGAAGAAGAAGAGAGTGAATTTCTGGTTTTGTGTTTGAAGAAGGAAGGAGAAAGGGAAGA
AGCTTTCGAGGTATTTTCGTCGTTGGGAATCTCAATTTCATTTCTTCATATTCTTTTTAGAGGGTTCAAAAAGGAAGAGCCAAGAAGAAATAGGTCGAGTATACGGGCCA
GGGTTCTAGTAGAGATCGAGCTCCTGGTACCTGACATCCTAGGCAACAATGTCTGTTGGCTTCATGCCGTCTTCCGGGCTAAGGAACCCACCTTCCTCCCTCACCTAGAG
AACAACGGAGAAGATTCGGAGGTAGTGTCCGTGATCGCTTGGGTTCGTGTTGCTAAGATCAGGATCGTGGGAGAAGGAGCACTCGAAGAAAAGTTCTTCAAAGGTATTAT
TCTTGCTATCCATCTTGTTTGTATATGCTTAATCTAG
Protein sequenceShow/hide protein sequence
MTEILSPRLYGTVCTQVEIELLVPDILPYFYRETYEVFLLTGNNVCWLHVVFRAKLQVIELVQIEQVEYGPRKDKSKQDQGGTQDPTQSQSERGSSAPRIQTEARYEQHA
SPSQEVGRPERAGPSDPEKTYGIERLKKLGATVFEGYINLADAEVWLNKLEKCFDLMSFPEEQKEQLATFQLEKEAEGWWKSILARRSDARTLDWQTFRSIFEDKYYPST
YREAKRDEFVELKQEWRKFERGLRPEIRTPVTVIAKWTDFSQVVETALRVEQTITEAQPVVEPTRGPSTTSSFWGREQRRFRPGVNVSSRQDFKNRAGGQALRQMNVGGA
YQRQNQRAPSLRGGELKEEEESEFLVLCLKKEGEREEAFEVFSSLGISISFLHILFRGFKKEEPRRNRSSIRARVLVEIELLVPDILGNNVCWLHAVFRAKEPTFLPHLE
NNGEDSEVVSVIAWVRVAKIRIVGEGALEEKFFKGIILAIHLVCICLI