; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018170 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018170
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr5:17964251..17965336
RNA-Seq ExpressionLag0018170
SyntenyLag0018170
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.0e-6142.01Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLK LGAT+F G+T+PADA+ W++MLEKCFDVM CPE+RKVRLATFLLQ+ A  WW S+ ++R  +   +W  F+  F ++YYP +Y + K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTAM-EWREFRKLVEAALRVERSISEGKSRKVPS---NTSSSKQLKD
        R EFL L QGS +VAEYE+KY ELS+YA  +I  E DRCRRFE GLR EIRT  TA+ +W  F +LVE ALRVE+SI+E KS    S   +T+S  + ++
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTAM-EWREFRKLVEAALRVERSISEGKSRKVPS---NTSSSKQLKD

Query:  DKEFASGVAKHGTSKSEFYGKSSSKAGSS---------NTKQVTGSPVSIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRKE
         + F  G+  + +S+ +F  +S  +A  +          ++++   P+     +   +ES  S      C + G+ HRG+CL G  + Y C Q  H +K+
Subjt:  DKEFASGVAKHGTSKSEFYGKSSSKAGSS---------NTKQVTGSPVSIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRKE

Query:  YPLL---MQRDNDLQGSFSQ-LEKSKVQLGRGKYMMQARPTITCSGQSKMG-ASEPRSKRQVGTPVRKE
         P L   +QRD   QG  SQ +E+S+V +          PT   SG  + G    PR + +V    ++E
Subjt:  YPLL---MQRDNDLQGSFSQ-LEKSKVQLGRGKYMMQARPTITCSGQSKMG-ASEPRSKRQVGTPVRKE

KAA0039476.1 uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa]1.2e-6147.8Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLKALGAT FAGTTNP D + W+ ++EKCF V RCPEDRKV LA FLLQ GA  WW    S+R  +   +W+EFKKAF++++YP+S++D K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL
        R+EFL+L QGS TVAEYEKKY ELSKYA+ +IEDE +R +RFE GLREEIRTS TA  +W +F KLVEAALRV +S++E K     S+ V + +SS  + 
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL

Query:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSSNTKQ-VTGS--PV-SIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK
        +  KE    F  GV   G  KS++ G   S +GS    Q  +GS  P+ SI GS   R +   S    S                  + YNC Q  H R+
Subjt:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSSNTKQ-VTGS--PV-SIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK

Query:  EYPLLMQRDNDLQGSFSQ
        + P L+   N +  + SQ
Subjt:  EYPLLMQRDNDLQGSFSQ

KAA0060484.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa]1.0e-6546.44Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLKALGAT FAGTTNPADA+ W+ ++EKCF V RCPEDRKV LA FLLQ GA  WW    S+R  +   +WNEFKKAF++++YP+S++D K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL
        R+EFL+L QGS T+AEYEKKY ELS YA+ +IEDE +RC+RFE GLREEIRT  TA  +W +F KLVEAALRVE+S++E K     S+ V + +SS  + 
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL

Query:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSS-NTKQVTGS--PVSIV-GSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK
        +  KE    F  GV+  G  KS++ G S SK+GSS   ++ +GS  P+S   GS   R +   S    S                  + YNC Q  H R+
Subjt:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSS-NTKQVTGS--PVSIV-GSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK

Query:  EYPLLMQRDNDLQGSFSQLEKSKVQLGRGKYMMQARPTITCSGQSKMGASE
        + P L+   N +  + SQ               Q R TIT SG+   G  +
Subjt:  EYPLLMQRDNDLQGSFSQLEKSKVQLGRGKYMMQARPTITCSGQSKMGASE

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.0e-6142.01Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLK LGAT+F G+T+PADA+ W++MLEKCFDVM CPE+RKVRLATFLLQ+ A  WW S+ ++R  +   +W  F+  F ++YYP +Y + K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTAM-EWREFRKLVEAALRVERSISEGKSRKVPS---NTSSSKQLKD
        R EFL L QGS +VAEYE+KY ELS+YA  +I  E DRCRRFE GLR EIRT  TA+ +W  F +LVE ALRVE+SI+E KS    S   +T+S  + ++
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTAM-EWREFRKLVEAALRVERSISEGKSRKVPS---NTSSSKQLKD

Query:  DKEFASGVAKHGTSKSEFYGKSSSKAGSS---------NTKQVTGSPVSIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRKE
         + F  G+  + +S+ +F  +S  +A  +          ++++   P+     +   +ES  S      C + G+ HRG+CL G  + Y C Q  H +K+
Subjt:  DKEFASGVAKHGTSKSEFYGKSSSKAGSS---------NTKQVTGSPVSIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRKE

Query:  YPLL---MQRDNDLQGSFSQ-LEKSKVQLGRGKYMMQARPTITCSGQSKMG-ASEPRSKRQVGTPVRKE
         P L   +QRD   QG  SQ +E+S+V +          PT   SG  + G    PR + +V    ++E
Subjt:  YPLL---MQRDNDLQGSFSQ-LEKSKVQLGRGKYMMQARPTITCSGQSKMG-ASEPRSKRQVGTPVRKE

TYK15233.1 uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa]1.2e-6147.8Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLKALGAT FAGTTNP D + W+ ++EKCF V RCPEDRKV LA FLLQ GA  WW    S+R  +   +W+EFKKAF++++YP+S++D K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL
        R+EFL+L QGS TVAEYEKKY ELSKYA+ +IEDE +R +RFE GLREEIRTS TA  +W +F KLVEAALRV +S++E K     S+ V + +SS  + 
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL

Query:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSSNTKQ-VTGS--PV-SIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK
        +  KE    F  GV   G  KS++ G   S +GS    Q  +GS  P+ SI GS   R +   S    S                  + YNC Q  H R+
Subjt:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSSNTKQ-VTGS--PV-SIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK

Query:  EYPLLMQRDNDLQGSFSQ
        + P L+   N +  + SQ
Subjt:  EYPLLMQRDNDLQGSFSQ

TrEMBL top hitse value%identityAlignment
A0A5A7TBS0 CCHC-type domain-containing protein5.8e-6247.8Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLKALGAT FAGTTNP D + W+ ++EKCF V RCPEDRKV LA FLLQ GA  WW    S+R  +   +W+EFKKAF++++YP+S++D K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL
        R+EFL+L QGS TVAEYEKKY ELSKYA+ +IEDE +R +RFE GLREEIRTS TA  +W +F KLVEAALRV +S++E K     S+ V + +SS  + 
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL

Query:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSSNTKQ-VTGS--PV-SIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK
        +  KE    F  GV   G  KS++ G   S +GS    Q  +GS  P+ SI GS   R +   S    S                  + YNC Q  H R+
Subjt:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSSNTKQ-VTGS--PV-SIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK

Query:  EYPLLMQRDNDLQGSFSQ
        + P L+   N +  + SQ
Subjt:  EYPLLMQRDNDLQGSFSQ

A0A5A7U2V7 Reverse transcriptase9.9e-6242.01Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLK LGAT+F G+T+PADA+ W++MLEKCFDVM CPE+RKVRLATFLLQ+ A  WW S+ ++R  +   +W  F+  F ++YYP +Y + K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTAM-EWREFRKLVEAALRVERSISEGKSRKVPS---NTSSSKQLKD
        R EFL L QGS +VAEYE+KY ELS+YA  +I  E DRCRRFE GLR EIRT  TA+ +W  F +LVE ALRVE+SI+E KS    S   +T+S  + ++
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTAM-EWREFRKLVEAALRVERSISEGKSRKVPS---NTSSSKQLKD

Query:  DKEFASGVAKHGTSKSEFYGKSSSKAGSS---------NTKQVTGSPVSIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRKE
         + F  G+  + +S+ +F  +S  +A  +          ++++   P+     +   +ES  S      C + G+ HRG+CL G  + Y C Q  H +K+
Subjt:  DKEFASGVAKHGTSKSEFYGKSSSKAGSS---------NTKQVTGSPVSIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRKE

Query:  YPLL---MQRDNDLQGSFSQ-LEKSKVQLGRGKYMMQARPTITCSGQSKMG-ASEPRSKRQVGTPVRKE
         P L   +QRD   QG  SQ +E+S+V +          PT   SG  + G    PR + +V    ++E
Subjt:  YPLL---MQRDNDLQGSFSQ-LEKSKVQLGRGKYMMQARPTITCSGQSKMG-ASEPRSKRQVGTPVRKE

A0A5A7UZM6 Gag protease polyprotein-like protein5.1e-6646.44Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLKALGAT FAGTTNPADA+ W+ ++EKCF V RCPEDRKV LA FLLQ GA  WW    S+R  +   +WNEFKKAF++++YP+S++D K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL
        R+EFL+L QGS T+AEYEKKY ELS YA+ +IEDE +RC+RFE GLREEIRT  TA  +W +F KLVEAALRVE+S++E K     S+ V + +SS  + 
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL

Query:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSS-NTKQVTGS--PVSIV-GSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK
        +  KE    F  GV+  G  KS++ G S SK+GSS   ++ +GS  P+S   GS   R +   S    S                  + YNC Q  H R+
Subjt:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSS-NTKQVTGS--PVSIV-GSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK

Query:  EYPLLMQRDNDLQGSFSQLEKSKVQLGRGKYMMQARPTITCSGQSKMGASE
        + P L+   N +  + SQ               Q R TIT SG+   G  +
Subjt:  EYPLLMQRDNDLQGSFSQLEKSKVQLGRGKYMMQARPTITCSGQSKMGASE

A0A5D3BS67 Reverse transcriptase9.9e-6242.01Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLK LGAT+F G+T+PADA+ W++MLEKCFDVM CPE+RKVRLATFLLQ+ A  WW S+ ++R  +   +W  F+  F ++YYP +Y + K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTAM-EWREFRKLVEAALRVERSISEGKSRKVPS---NTSSSKQLKD
        R EFL L QGS +VAEYE+KY ELS+YA  +I  E DRCRRFE GLR EIRT  TA+ +W  F +LVE ALRVE+SI+E KS    S   +T+S  + ++
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTAM-EWREFRKLVEAALRVERSISEGKSRKVPS---NTSSSKQLKD

Query:  DKEFASGVAKHGTSKSEFYGKSSSKAGSS---------NTKQVTGSPVSIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRKE
         + F  G+  + +S+ +F  +S  +A  +          ++++   P+     +   +ES  S      C + G+ HRG+CL G  + Y C Q  H +K+
Subjt:  DKEFASGVAKHGTSKSEFYGKSSSKAGSS---------NTKQVTGSPVSIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRKE

Query:  YPLL---MQRDNDLQGSFSQ-LEKSKVQLGRGKYMMQARPTITCSGQSKMG-ASEPRSKRQVGTPVRKE
         P L   +QRD   QG  SQ +E+S+V +          PT   SG  + G    PR + +V    ++E
Subjt:  YPLL---MQRDNDLQGSFSQ-LEKSKVQLGRGKYMMQARPTITCSGQSKMG-ASEPRSKRQVGTPVRKE

A0A5D3CTK6 CCHC-type domain-containing protein5.8e-6247.8Show/hide
Query:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK
        DP + + IERLKALGAT FAGTTNP D + W+ ++EKCF V RCPEDRKV LA FLLQ GA  WW    S+R  +   +W+EFKKAF++++YP+S++D K
Subjt:  DPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEK

Query:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL
        R+EFL+L QGS TVAEYEKKY ELSKYA+ +IEDE +R +RFE GLREEIRTS TA  +W +F KLVEAALRV +S++E K     S+ V + +SS  + 
Subjt:  RSEFLKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTA-MEWREFRKLVEAALRVERSISEGK-----SRKVPSNTSSSKQL

Query:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSSNTKQ-VTGS--PV-SIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK
        +  KE    F  GV   G  KS++ G   S +GS    Q  +GS  P+ SI GS   R +   S    S                  + YNC Q  H R+
Subjt:  KDDKE----FASGVAKHGTSKSEFYGKSSSKAGSSNTKQ-VTGS--PV-SIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRK

Query:  EYPLLMQRDNDLQGSFSQ
        + P L+   N +  + SQ
Subjt:  EYPLLMQRDNDLQGSFSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGCTCATAAAAGACCCGCATAGAAATTTTAAAATTGAGCGATTAAAAGCTTTAGGAGCAACTATATTTGCTGGGACAACAAACCCAGCTGATGCTAAGATATG
GATGGACATGCTAGAAAAATGTTTCGATGTGATGAGATGCCCAGAAGATAGAAAAGTGAGACTTGCTACATTTCTACTTCAAGAAGGAGCGAGTGTTTGGTGGAATTCAG
TGAGAAGCAAACGATTAGGGTCTGAAATAACCAATTGGAACGAGTTCAAGAAAGCATTTTATGAAGAATATTATCCACAATCCTATAAGGATGAAAAGCGAAGTGAATTC
TTGAAATTGGTGCAAGGATCAACAACTGTTGCAGAGTACGAGAAGAAGTATATTGAACTTTCAAAGTATGCCTCAGACCTCATCGAAGACGAGAAAGATAGATGTAGAAG
GTTTGAAGTTGGCCTACGAGAGGAGATTCGAACTTCAACCACCGCAATGGAATGGAGGGAATTTAGAAAGTTGGTAGAAGCGGCTTTGAGGGTCGAGAGAAGCATATCAG
AAGGGAAGAGTCGAAAAGTGCCCTCAAATACTTCAAGTAGCAAACAATTGAAGGATGATAAGGAATTTGCGTCAGGAGTAGCTAAACATGGAACCTCCAAATCAGAGTTT
TATGGAAAATCAAGTTCTAAAGCTGGTTCAAGCAACACGAAGCAAGTGACAGGTAGCCCAGTTTCTATTGTAGGATCCACAAGTGAGCGGGAAGAATCTACTTTCAGCCA
TGATTTGTACTCTCAATGCCAAAATTATGGTAAGTTACATCGAGGTCGATGTTTGAAAGGAACAGATATTAGTTATAATTGCAAGCAATCAAACCACATGAGGAAAGAGT
ATCCATTGTTAATGCAAAGGGACAACGATTTGCAAGGATCGTTTTCACAGCTAGAGAAATCAAAAGTACAACTTGGTAGAGGGAAATATATGATGCAAGCAAGACCAACT
ATAACATGTAGTGGACAGTCTAAGATGGGTGCTAGTGAGCCAAGATCAAAAAGACAAGTTGGAACGCCCGTTCGCAAGGAAAAGTTTCCATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATGCTCATAAAAGACCCGCATAGAAATTTTAAAATTGAGCGATTAAAAGCTTTAGGAGCAACTATATTTGCTGGGACAACAAACCCAGCTGATGCTAAGATATG
GATGGACATGCTAGAAAAATGTTTCGATGTGATGAGATGCCCAGAAGATAGAAAAGTGAGACTTGCTACATTTCTACTTCAAGAAGGAGCGAGTGTTTGGTGGAATTCAG
TGAGAAGCAAACGATTAGGGTCTGAAATAACCAATTGGAACGAGTTCAAGAAAGCATTTTATGAAGAATATTATCCACAATCCTATAAGGATGAAAAGCGAAGTGAATTC
TTGAAATTGGTGCAAGGATCAACAACTGTTGCAGAGTACGAGAAGAAGTATATTGAACTTTCAAAGTATGCCTCAGACCTCATCGAAGACGAGAAAGATAGATGTAGAAG
GTTTGAAGTTGGCCTACGAGAGGAGATTCGAACTTCAACCACCGCAATGGAATGGAGGGAATTTAGAAAGTTGGTAGAAGCGGCTTTGAGGGTCGAGAGAAGCATATCAG
AAGGGAAGAGTCGAAAAGTGCCCTCAAATACTTCAAGTAGCAAACAATTGAAGGATGATAAGGAATTTGCGTCAGGAGTAGCTAAACATGGAACCTCCAAATCAGAGTTT
TATGGAAAATCAAGTTCTAAAGCTGGTTCAAGCAACACGAAGCAAGTGACAGGTAGCCCAGTTTCTATTGTAGGATCCACAAGTGAGCGGGAAGAATCTACTTTCAGCCA
TGATTTGTACTCTCAATGCCAAAATTATGGTAAGTTACATCGAGGTCGATGTTTGAAAGGAACAGATATTAGTTATAATTGCAAGCAATCAAACCACATGAGGAAAGAGT
ATCCATTGTTAATGCAAAGGGACAACGATTTGCAAGGATCGTTTTCACAGCTAGAGAAATCAAAAGTACAACTTGGTAGAGGGAAATATATGATGCAAGCAAGACCAACT
ATAACATGTAGTGGACAGTCTAAGATGGGTGCTAGTGAGCCAAGATCAAAAAGACAAGTTGGAACGCCCGTTCGCAAGGAAAAGTTTCCATTCTGA
Protein sequenceShow/hide protein sequence
MKMLIKDPHRNFKIERLKALGATIFAGTTNPADAKIWMDMLEKCFDVMRCPEDRKVRLATFLLQEGASVWWNSVRSKRLGSEITNWNEFKKAFYEEYYPQSYKDEKRSEF
LKLVQGSTTVAEYEKKYIELSKYASDLIEDEKDRCRRFEVGLREEIRTSTTAMEWREFRKLVEAALRVERSISEGKSRKVPSNTSSSKQLKDDKEFASGVAKHGTSKSEF
YGKSSSKAGSSNTKQVTGSPVSIVGSTSEREESTFSHDLYSQCQNYGKLHRGRCLKGTDISYNCKQSNHMRKEYPLLMQRDNDLQGSFSQLEKSKVQLGRGKYMMQARPT
ITCSGQSKMGASEPRSKRQVGTPVRKEKFPF