; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011865 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011865
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153107:35974..44230
RNA-Seq ExpressionSgr011865
SyntenySgr011865
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8691458.1 hypothetical protein F3Y22_tig00110890pilonHSYRG01487 [Hibiscus syriacus]4.1e-7659.4Show/hide
Query:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER
        IC   T + + SK   IS  +D    T        S    VF+KFKV+VEK TG +IKA+  DRG EY ST FM YC EQGI+RFLT PY+PQQN V ER
Subjt:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER

Query:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK
        KN+TIL+MVR MLK+  MPK FWAEA+QCA+YVQNRCPH KL DQTPQE WSGQKPT+S+ K                KL+DKSKKY+FIGYDEK K YK
Subjt:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK

Query:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI
         FDP  KKV+VSRD+++NE S WDWNN  EA    E GESSI++P    TNS+T DDEDEPRQP+I
Subjt:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI

KAE8702390.1 hypothetical protein F3Y22_tig00110483pilonHSYRG00411 [Hibiscus syriacus]1.1e-7666.37Show/hide
Query:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA
        VF+KFKV+VEK TG +IKA+  DRG EY STAFM YC EQGI+RFLT PY+PQQN V ERKN+TIL+MVR MLKSK MPK FW EA+QCA+YVQNRCPH 
Subjt:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA

Query:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES
        KL DQTPQE WSGQKPT+S+ K                KL+DKSKKY+FIGYDEK+K YK FDP  KKV+VSRD+++NE S WDWNN  EA    E GES
Subjt:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES

Query:  SIIAPIITLTNSQTFDDEDEPRQPRI
        SI++P    TNS+T DDEDEPRQP+I
Subjt:  SIIAPIITLTNSQTFDDEDEPRQPRI

KAE8719102.1 hypothetical protein F3Y22_tig00109972pilonHSYRG00011 [Hibiscus syriacus]8.3e-7760.15Show/hide
Query:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER
        IC   T + + SK   IS  +D    T        S    VF+KFKV+VEK TG +IKA+  DRG +Y STAFM YC EQGI+RFLT PY+PQQN V ER
Subjt:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER

Query:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK
        KN+TIL+MVR MLKSK MPK FWAEAMQCA+YVQNRCPH KL DQTPQE WSGQKPT+S+ K                KL++KSKKY+FIGYDEK K YK
Subjt:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK

Query:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI
         FDP  KKV+VSRD+++NE S WDWNN  EA    E GESSI+ P    TNS+T DDEDEPRQP+I
Subjt:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI

KAE8721174.1 hypothetical protein F3Y22_tig00016637pilonHSYRG00095 [Hibiscus syriacus]1.8e-7660.15Show/hide
Query:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER
        IC   T + + SK   IS  +D    T        S    VF+KFKV+VEK TG +IKA+  DRG EY STAFM YC EQGI+RFLT PY+PQQN V ER
Subjt:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER

Query:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK
        KN+TIL+MVR MLKSK MPK FWAEA+QCA+YVQNRCPH KL DQTPQE WSGQKPT+S+ K                KL+DKSKKY+FIGYDEK K YK
Subjt:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK

Query:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI
         FDP  KKV+VSRD+++NE S WDWNN  EA    E GESSI++P    TNS+T D EDEPRQP+I
Subjt:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI

KAE8732752.1 hypothetical protein F3Y22_tig00001732pilonHSYRG00018 [Hibiscus syriacus]1.2e-7565.93Show/hide
Query:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA
        VF+KFKV+VEK TG +IKA+  DRG EY STAF+ YC EQGI+RFLT PY+PQQN V ERKN+TIL+MVR MLKSK M K FWAEA+QCA+YVQNRCPH 
Subjt:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA

Query:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES
        KL DQTPQEAWSGQKPT+S+ K                KL+DKSKKY+FIGYDEK K YK FDP  KKV+VSRD+++NE S WDWNN  EA    E GES
Subjt:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES

Query:  SIIAPIITLTNSQTFDDEDEPRQPRI
        SI++P     NS+T DDEDEPRQP+I
Subjt:  SIIAPIITLTNSQTFDDEDEPRQPRI

TrEMBL top hitse value%identityAlignment
A0A6A2ZHU4 Uncharacterized protein2.0e-7659.4Show/hide
Query:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER
        IC   T + + SK   IS  +D    T        S    VF+KFKV+VEK TG +IKA+  DRG EY ST FM YC EQGI+RFLT PY+PQQN V ER
Subjt:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER

Query:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK
        KN+TIL+MVR MLK+  MPK FWAEA+QCA+YVQNRCPH KL DQTPQE WSGQKPT+S+ K                KL+DKSKKY+FIGYDEK K YK
Subjt:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK

Query:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI
         FDP  KKV+VSRD+++NE S WDWNN  EA    E GESSI++P    TNS+T DDEDEPRQP+I
Subjt:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI

A0A6A3AD07 Uncharacterized protein5.2e-7766.37Show/hide
Query:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA
        VF+KFKV+VEK TG +IKA+  DRG EY STAFM YC EQGI+RFLT PY+PQQN V ERKN+TIL+MVR MLKSK MPK FW EA+QCA+YVQNRCPH 
Subjt:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA

Query:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES
        KL DQTPQE WSGQKPT+S+ K                KL+DKSKKY+FIGYDEK+K YK FDP  KKV+VSRD+++NE S WDWNN  EA    E GES
Subjt:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES

Query:  SIIAPIITLTNSQTFDDEDEPRQPRI
        SI++P    TNS+T DDEDEPRQP+I
Subjt:  SIIAPIITLTNSQTFDDEDEPRQPRI

A0A6A3BVR6 Uncharacterized protein4.0e-7760.15Show/hide
Query:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER
        IC   T + + SK   IS  +D    T        S    VF+KFKV+VEK TG +IKA+  DRG +Y STAFM YC EQGI+RFLT PY+PQQN V ER
Subjt:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER

Query:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK
        KN+TIL+MVR MLKSK MPK FWAEAMQCA+YVQNRCPH KL DQTPQE WSGQKPT+S+ K                KL++KSKKY+FIGYDEK K YK
Subjt:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK

Query:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI
         FDP  KKV+VSRD+++NE S WDWNN  EA    E GESSI+ P    TNS+T DDEDEPRQP+I
Subjt:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI

A0A6A3BX58 Uncharacterized protein8.9e-7760.15Show/hide
Query:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER
        IC   T + + SK   IS  +D    T        S    VF+KFKV+VEK TG +IKA+  DRG EY STAFM YC EQGI+RFLT PY+PQQN V ER
Subjt:  ICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPS---HVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVER

Query:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK
        KN+TIL+MVR MLKSK MPK FWAEA+QCA+YVQNRCPH KL DQTPQE WSGQKPT+S+ K                KL+DKSKKY+FIGYDEK K YK
Subjt:  KNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYK

Query:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI
         FDP  KKV+VSRD+++NE S WDWNN  EA    E GESSI++P    TNS+T D EDEPRQP+I
Subjt:  PFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRI

A0A6A3CZT6 Uncharacterized protein5.8e-7665.93Show/hide
Query:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA
        VF+KFKV+VEK TG +IKA+  DRG EY STAF+ YC EQGI+RFLT PY+PQQN V ERKN+TIL+MVR MLKSK M K FWAEA+QCA+YVQNRCPH 
Subjt:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA

Query:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES
        KL DQTPQEAWSGQKPT+S+ K                KL+DKSKKY+FIGYDEK K YK FDP  KKV+VSRD+++NE S WDWNN  EA    E GES
Subjt:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES

Query:  SIIAPIITLTNSQTFDDEDEPRQPRI
        SI++P     NS+T DDEDEPRQP+I
Subjt:  SIIAPIITLTNSQTFDDEDEPRQPRI

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-3037.26Show/hide
Query:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA
        VF+KF  LVE+ TG  +K L  D G EY S  F  YC   GI+   T P  PQ N V ER N+TI+  VR ML+   +PK FW EA+Q A Y+ NR P  
Subjt:  VFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHA

Query:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES
         L  + P+  W+ ++ + S+ K                KLDDKS   +FIGY ++   Y+ +DP +KKV+ SRD+   E      +  + A    EK ++
Subjt:  KLTDQTPQEAWSGQKPTLSYFK---------------DKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEESSWDWNNQKEATKKKEKGES

Query:  SIIAPIITLTNS
         II   +T+ ++
Subjt:  SIIAPIITLTNS

P92512 Uncharacterized mitochondrial protein AtMg007104.4e-0437.7Show/hide
Query:  NQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK
        N+TI+  VR ML    +PK F A+A   AV++ N+ P   +    P E W    PT SY +
Subjt:  NQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-1228.65Show/hide
Query:  FRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAK
        F  FK L+E      I   + D G E++  A   Y  + GI    + P+ P+ N + ERK++ I+     +L    +PK +W  A   AVY+ NR P   
Subjt:  FRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAK

Query:  LTDQTPQEAWSGQKPTLS---------------YFKDKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNE
        L  ++P +   G  P                  Y + KLDDKS++ VF+GY     AY        ++ +SR ++ +E
Subjt:  LTDQTPQEAWSGQKPTLS---------------YFKDKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-1429.29Show/hide
Query:  FKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTD
        FK LVE      I  L+ D G E++      Y  + GI  F + P+ P+ N + ERK++ I+ M   +L    +PK +W  A   AVY+ NR P   L  
Subjt:  FKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQGIKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTD

Query:  QTPQEAWSGQKPTLS---------------YFKDKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEES-SWDWNNQKEATKKKEKGESS
        Q+P +   GQ P                  Y + KL+DKSK+  F+GY     AY        ++  SR +Q +E    +   N   +T ++++ +S+
Subjt:  QTPQEAWSGQKPTLS---------------YFKDKLDDKSKKYVFIGYDEKMKAYKPFDPTEKKVVVSRDMQVNEES-SWDWNNQKEATKKKEKGESS

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.1e-0537.7Show/hide
Query:  NQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK
        N+TI+  VR ML    +PK F A+A   AV++ N+ P   +    P E W    PT SY +
Subjt:  NQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTCGAACACTTACATTACCATGGCCTAAAAGAATTAGTAAAGAAGGATATGATTCATGGGCTACCAGACATGGATTACACCAAGAAATTTTGCGAGGGATATGTACTTGG
AAAACAAGGAAGGAATACTTTTCAAAAGAAGGCAAAATATCGTGCAAGGAGGACTCTCTAGTTGATACACACACATCGACATTTGCAGTCCCCTCACACGTGTTCAGGAA
ATTCAAGGTGTTAGTGGAGAAAATGACCGGTCTTTATATCAAGGCCTTATGGTTAGATAGAGGCAAAGAGTACATGTCAACCGCCTTCATGAGTTATTGCGGGGAACAAG
GGATCAAAAGATTCTTAACTACACCCTATGCACCTCAGCAAAACTGTGTGGTCGAGAGGAAAAATCAGACTATCCTTAACATGGTTCGATTGATGTTAAAAAGCAAAGAC
ATGCCAAAAGTTTTTTGGGCGGAAGCAATGCAATGTGCTGTGTATGTGCAAAATCGGTGCCCACATGCGAAATTGACAGACCAAACACCACAGGAAGCTTGGAGCGGACA
GAAACCTACCTTATCGTATTTCAAAGACAAACTAGATGATAAAAGCAAGAAGTACGTGTTCATCGGGTATGACGAGAAAATGAAAGCTTACAAGCCATTTGACCCAACTG
AAAAAAAGGTGGTTGTAAGTCGAGATATGCAAGTAAATGAAGAAAGCTCATGGGATTGGAACAATCAAAAGGAGGCGACGAAAAAGAAAGAAAAAGGAGAATCATCAATC
ATTGCACCGATAATCACACTGACAAACTCTCAAACATTTGATGATGAAGACGAGCCAAGACAACCAAGGATATTTCCATTCATGGGAGAGAATCGCTTGCTTCACGGACG
TGGGCTCACTGAGATAAGAGGAAGGGCAAACTTGGGCTTGGGAGGAGAATCTGTAGTAGAAATAGGAGTAGAAGAAGAAAGGACTAGCCTCAGCATGTCATTGGGTAAAA
AGAACGTGGCAACGAGAGACGGTGAAGCCTGGATGGCGTATCAGCGTTGGATGCAACAGCTGAAGGGGCTACAAGCACTGGATTATAAAGCGGTAATTTGTCCGTGTCCC
ATAATTCCCCGGCTAGCTGGTGACTCTCCACCAGTCTCTGGTCTTCGGCTTCGGCCAGACC
mRNA sequenceShow/hide mRNA sequence
TTCGAACACTTACATTACCATGGCCTAAAAGAATTAGTAAAGAAGGATATGATTCATGGGCTACCAGACATGGATTACACCAAGAAATTTTGCGAGGGATATGTACTTGG
AAAACAAGGAAGGAATACTTTTCAAAAGAAGGCAAAATATCGTGCAAGGAGGACTCTCTAGTTGATACACACACATCGACATTTGCAGTCCCCTCACACGTGTTCAGGAA
ATTCAAGGTGTTAGTGGAGAAAATGACCGGTCTTTATATCAAGGCCTTATGGTTAGATAGAGGCAAAGAGTACATGTCAACCGCCTTCATGAGTTATTGCGGGGAACAAG
GGATCAAAAGATTCTTAACTACACCCTATGCACCTCAGCAAAACTGTGTGGTCGAGAGGAAAAATCAGACTATCCTTAACATGGTTCGATTGATGTTAAAAAGCAAAGAC
ATGCCAAAAGTTTTTTGGGCGGAAGCAATGCAATGTGCTGTGTATGTGCAAAATCGGTGCCCACATGCGAAATTGACAGACCAAACACCACAGGAAGCTTGGAGCGGACA
GAAACCTACCTTATCGTATTTCAAAGACAAACTAGATGATAAAAGCAAGAAGTACGTGTTCATCGGGTATGACGAGAAAATGAAAGCTTACAAGCCATTTGACCCAACTG
AAAAAAAGGTGGTTGTAAGTCGAGATATGCAAGTAAATGAAGAAAGCTCATGGGATTGGAACAATCAAAAGGAGGCGACGAAAAAGAAAGAAAAAGGAGAATCATCAATC
ATTGCACCGATAATCACACTGACAAACTCTCAAACATTTGATGATGAAGACGAGCCAAGACAACCAAGGATATTTCCATTCATGGGAGAGAATCGCTTGCTTCACGGACG
TGGGCTCACTGAGATAAGAGGAAGGGCAAACTTGGGCTTGGGAGGAGAATCTGTAGTAGAAATAGGAGTAGAAGAAGAAAGGACTAGCCTCAGCATGTCATTGGGTAAAA
AGAACGTGGCAACGAGAGACGGTGAAGCCTGGATGGCGTATCAGCGTTGGATGCAACAGCTGAAGGGGCTACAAGCACTGGATTATAAAGCGGTAATTTGTCCGTGTCCC
ATAATTCCCCGGCTAGCTGGTGACTCTCCACCAGTCTCTGGTCTTCGGCTTCGGCCAGACC
Protein sequenceShow/hide protein sequence
RTLTLPWPKRISKEGYDSWATRHGLHQEILRGICTWKTRKEYFSKEGKISCKEDSLVDTHTSTFAVPSHVFRKFKVLVEKMTGLYIKALWLDRGKEYMSTAFMSYCGEQG
IKRFLTTPYAPQQNCVVERKNQTILNMVRLMLKSKDMPKVFWAEAMQCAVYVQNRCPHAKLTDQTPQEAWSGQKPTLSYFKDKLDDKSKKYVFIGYDEKMKAYKPFDPTE
KKVVVSRDMQVNEESSWDWNNQKEATKKKEKGESSIIAPIITLTNSQTFDDEDEPRQPRIFPFMGENRLLHGRGLTEIRGRANLGLGGESVVEIGVEEERTSLSMSLGKK
NVATRDGEAWMAYQRWMQQLKGLQALDYKAVICPCPIIPRLAGDSPPVSGLRLRPDX