; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020765 (gene) of Snake gourd v1 genome

Gene IDTan0020765
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG08:35859211..35865578
RNA-Seq ExpressionTan0020765
SyntenyTan0020765
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023520282.1 uncharacterized protein LOC111783592 [Cucurbita pepo subsp. pepo]8.3e-12980.6Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +NQEI EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV RR
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQESSNQPRGKNERKAKIV
        GDLLLGRPWQFDRRV+YDGYANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQEADAKA+     IEKES EK SLSEKQES+ QPR K ERKAK V
Subjt:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQESSNQPRGKNERKAKIV

XP_023520835.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111784339 [Cucurbita pepo subsp. pepo]5.4e-12880.27Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +N EI EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV RR
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQESSNQPRGKNERKAKIV
        GDLLLGRPWQFDRRV+YDGYANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQEADAKA+     IEKES EK SLSEKQES+ QPR K ERKAK V
Subjt:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQESSNQPRGKNERKAKIV

XP_023521183.1 uncharacterized protein LOC111784872 [Cucurbita pepo subsp. pepo]5.4e-12880.27Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +N EI EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV RR
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQESSNQPRGKNERKAKIV
        GDLLLGRPWQFDRRV+YDGYANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQEADAKA+     IEKES EK SLSEKQES+ QPR K ERKAK V
Subjt:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQESSNQPRGKNERKAKIV

XP_023530046.1 uncharacterized protein LOC111792716 [Cucurbita pepo subsp. pepo]8.9e-12380.92Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +N EI EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV RR
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQE
        GDLLLGRPWQFDRRV+YDGYANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQEADAKA+     IEKES EK SLSEKQE
Subjt:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQE

XP_023553652.1 uncharacterized protein LOC111811140 [Cucurbita pepo subsp. pepo]5.4e-12880.27Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +N EI EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV RR
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQESSNQPRGKNERKAKIV
        GDLLLGRPWQFDRRV+YDGYANRYSFT+NGRKTTLVPLSPKDVFID CKLEKKRQEADAKA+     IEKES EK SLSEKQES+ QPR K ERKAK V
Subjt:  GDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQESSNQPRGKNERKAKIV

TrEMBL top hitse value%identityAlignment
A0A6J1EQJ1 uncharacterized protein LOC1114365301.6e-9380.37Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +NQEI EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV RR
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRR
        GDLLLGRPWQFDRR
Subjt:  GDLLLGRPWQFDRR

A0A6J1EVV9 uncharacterized protein LOC1114364633.0e-9279.91Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +NQEI EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+ QGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV RR
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRR
        GDLLLGRPWQFDRR
Subjt:  GDLLLGRPWQFDRR

A0A6J1G2Q3 uncharacterized protein LOC1114502861.6e-9380.37Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +NQEI EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV RR
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRR
        GDLLLGRPWQFDRR
Subjt:  GDLLLGRPWQFDRR

A0A6J1I622 LOW QUALITY PROTEIN: uncharacterized protein LOC1114699471.0e-9279.91Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +NQEI EKP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV R 
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRR
        GDLLLGRPWQFDRR
Subjt:  GDLLLGRPWQFDRR

A0A6J1I8S0 uncharacterized protein LOC1114724891.0e-9279.91Show/hide
Query:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR
        +NQEI  KP+A  EKGESS+ GKEK+++SNVRNRDLKCW+CQGVGHYSRDCPN RIMTI+EGEIVTDDE  D++ EE DES  EE SEEDP ++SLV RR
Subjt:  KNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREGEIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARR

Query:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA
        AL+T IKED LDQRENLF TRCL+QS+PCSVVIDSGSCTNVVS+ILVKRLNL+T+PHPRPYKLQWLNDC +VRV++Q LVSFTIGKY DDVLCDVVSMH 
Subjt:  ALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQVRVSKQALVSFTIGKYNDDVLCDVVSMHA

Query:  GDLLLGRPWQFDRR
        GDLLLGRPWQFDRR
Subjt:  GDLLLGRPWQFDRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGAATGATTCGTGGAATAGAAGAGTTATCGGAGAGGAGAATTCCACCCCCTCCACAACAACGTGAGGATGACCATGACAACGAATATGAGGGAGGAAGTTACGA
TCAACTAGAAGATGACCAAGTTACATTAATAGCAAAAAACCAAGAGATATATGAGAAACCTAAAGCAAATGTAGAGAAAGGGGAGAGTTCTAAAAAGGGGAAAGAGAAGA
TAGATGAATCTAATGTGCGAAATAGGGATTTGAAATGTTGGAAATGTCAAGGGGTAGGTCACTATAGTAGAGATTGCCCTAATAGGAGAATTATGACCATTAGAGAGGGA
GAGATTGTGACTGATGATGAAGAGGAAGATGAGGTTAAGGAAGAAAATGATGAGAGTGAGAATGAGGAGTTAAGCGAAGAGGATCCCGCAAACTTGTCCTTAGTTGCTAG
GAGAGCTTTAAGCACCCAAATTAAGGAGGATAGTCTAGACCAAAGAGAGAACTTGTTTCACACTAGGTGCCTTATTCAATCTATGCCTTGTAGTGTGGTCATTGATAGTG
GTAGTTGCACCAATGTTGTGAGTACAATTCTGGTCAAGAGGCTTAATTTAGAGACCAAACCACATCCTAGACCATATAAACTTCAATGGTTGAATGATTGTGCGCAAGTA
AGGGTGAGTAAGCAAGCTCTTGTTTCTTTTACCATTGGAAAGTATAATGATGATGTTTTGTGTGATGTTGTATCCATGCATGCTGGAGATTTATTGTTGGGGAGGCCTTG
GCAATTTGATCGTCGGGTAGTATATGATGGGTATGCAAATCGTTACTCTTTTACTTATAATGGTAGAAAAACTACTCTTGTTCCATTGTCTCCAAAAGATGTATTTATTG
ATCAATGCAAACTTGAAAAAAAAAGGCAAGAGGCTGATGCAAAAGCAAAAAGTGAAAATGAAATAATAGAAAAAGAATCGAGAGAAAAAAAGAGTTTGAGTGAAAAGCAA
GAGAGTAGCAATCAGCCTAGAGGAAAAAATGAGAGAAAAGCCAAAATAGTTAAGGGTCTCAAAGGCTATATAAAGCCCTCTCTTCTTCCATTTTCTTTTCTCTTTAGCTT
CAGTTTTTGCAGATCCGAGCATGTTCTTATTCTGAATAAAAATACAAAAATATCCTTTCCGTTCCGTATTTCATTATATCGTCTTCTACAAGATTTGAGATCTAGATCTG
ATCTACAATTTAAGAGTTATCGGAGCGTATACGGAGGTTGGAGGTTCAAAACCACGCTAGGAGGAGAATTCCACCCCCTCCACAACAACGTGAGGATGACCATGACAACG
AATATGAGGGAGGAAGTTACGATCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGAATGATTCGTGGAATAGAAGAGTTATCGGAGAGGAGAATTCCACCCCCTCCACAACAACGTGAGGATGACCATGACAACGAATATGAGGGAGGAAGTTACGA
TCAACTAGAAGATGACCAAGTTACATTAATAGCAAAAAACCAAGAGATATATGAGAAACCTAAAGCAAATGTAGAGAAAGGGGAGAGTTCTAAAAAGGGGAAAGAGAAGA
TAGATGAATCTAATGTGCGAAATAGGGATTTGAAATGTTGGAAATGTCAAGGGGTAGGTCACTATAGTAGAGATTGCCCTAATAGGAGAATTATGACCATTAGAGAGGGA
GAGATTGTGACTGATGATGAAGAGGAAGATGAGGTTAAGGAAGAAAATGATGAGAGTGAGAATGAGGAGTTAAGCGAAGAGGATCCCGCAAACTTGTCCTTAGTTGCTAG
GAGAGCTTTAAGCACCCAAATTAAGGAGGATAGTCTAGACCAAAGAGAGAACTTGTTTCACACTAGGTGCCTTATTCAATCTATGCCTTGTAGTGTGGTCATTGATAGTG
GTAGTTGCACCAATGTTGTGAGTACAATTCTGGTCAAGAGGCTTAATTTAGAGACCAAACCACATCCTAGACCATATAAACTTCAATGGTTGAATGATTGTGCGCAAGTA
AGGGTGAGTAAGCAAGCTCTTGTTTCTTTTACCATTGGAAAGTATAATGATGATGTTTTGTGTGATGTTGTATCCATGCATGCTGGAGATTTATTGTTGGGGAGGCCTTG
GCAATTTGATCGTCGGGTAGTATATGATGGGTATGCAAATCGTTACTCTTTTACTTATAATGGTAGAAAAACTACTCTTGTTCCATTGTCTCCAAAAGATGTATTTATTG
ATCAATGCAAACTTGAAAAAAAAAGGCAAGAGGCTGATGCAAAAGCAAAAAGTGAAAATGAAATAATAGAAAAAGAATCGAGAGAAAAAAAGAGTTTGAGTGAAAAGCAA
GAGAGTAGCAATCAGCCTAGAGGAAAAAATGAGAGAAAAGCCAAAATAGTTAAGGGTCTCAAAGGCTATATAAAGCCCTCTCTTCTTCCATTTTCTTTTCTCTTTAGCTT
CAGTTTTTGCAGATCCGAGCATGTTCTTATTCTGAATAAAAATACAAAAATATCCTTTCCGTTCCGTATTTCATTATATCGTCTTCTACAAGATTTGAGATCTAGATCTG
ATCTACAATTTAAGAGTTATCGGAGCGTATACGGAGGTTGGAGGTTCAAAACCACGCTAGGAGGAGAATTCCACCCCCTCCACAACAACGTGAGGATGACCATGACAACG
AATATGAGGGAGGAAGTTACGATCAACTAG
Protein sequenceShow/hide protein sequence
MERMIRGIEELSERRIPPPPQQREDDHDNEYEGGSYDQLEDDQVTLIAKNQEIYEKPKANVEKGESSKKGKEKIDESNVRNRDLKCWKCQGVGHYSRDCPNRRIMTIREG
EIVTDDEEEDEVKEENDESENEELSEEDPANLSLVARRALSTQIKEDSLDQRENLFHTRCLIQSMPCSVVIDSGSCTNVVSTILVKRLNLETKPHPRPYKLQWLNDCAQV
RVSKQALVSFTIGKYNDDVLCDVVSMHAGDLLLGRPWQFDRRVVYDGYANRYSFTYNGRKTTLVPLSPKDVFIDQCKLEKKRQEADAKAKSENEIIEKESREKKSLSEKQ
ESSNQPRGKNERKAKIVKGLKGYIKPSLLPFSFLFSFSFCRSEHVLILNKNTKISFPFRISLYRLLQDLRSRSDLQFKSYRSVYGGWRFKTTLGGEFHPLHNNVRMTMTT
NMREEVTIN