; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0249461 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0249461
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr09:15356084..15356734
RNA-Seq ExpressionCmc09g0249461
SyntenyCmc09g0249461
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025998.1 pol protein [Cucumis melo var. makuwa]1.7e-10086.43Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDMQDF+VILGM+WLSAN A+IDCF K+VV       SFKF+GAG+VCIPKVISAMKASKLLSQGTW ILASVVD REPEVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFP+EL  L PPR++DFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVTVKNRYPLP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQGAT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

KAA0037369.1 pol protein [Cucumis melo var. makuwa]3.0e-10086.43Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDMQDF+VILGM+WLSAN A+IDCF K+VV       SFKF+G GIVCIPKVISAMKASKLLSQGTW ILASVVD REPEVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFP+EL  L PPR++DFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVTVKNRYPLP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQGAT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

KAA0058812.1 pol protein [Cucumis melo var. makuwa]1.3e-10086.43Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDMQDF+VILGM+WLSAN A+IDCF K+VV       SFKF+GAGIVCIPKVISAMKASKLLSQGTW ILASVVD REPEVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFP+EL  L PPR++DFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVT+KNRYPLP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQGAT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

KAA0059723.1 pol protein [Cucumis melo var. makuwa]2.3e-10086.43Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDMQDF+VILGM+WLSAN A+IDCF K+VV       SFKF+GAG+VCIPKVISAMKASKLLSQGTW ILASVVD REPEVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFP+EL  L PPR++DFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVTVKNRYPLP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQGAT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

TYK19093.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.3e-10087.78Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDM+DF+VILGM+WLSAN ASIDCFRK+VV      TSFKFKGAGIVC+PKVISAMKASKLLSQGTW+ILASVVDTRE EVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFPNELL L PPR+IDFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLD+GFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVTVKNRY LP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQ AT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

TrEMBL top hitse value%identityAlignment
A0A5A7SIJ5 Reverse transcriptase8.4e-10186.43Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDMQDF+VILGM+WLSAN A+IDCF K+VV       SFKF+GAG+VCIPKVISAMKASKLLSQGTW ILASVVD REPEVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFP+EL  L PPR++DFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVTVKNRYPLP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQGAT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

A0A5A7T7M0 Reverse transcriptase1.4e-10086.43Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDMQDF+VILGM+WLSAN A+IDCF K+VV       SFKF+G GIVCIPKVISAMKASKLLSQGTW ILASVVD REPEVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFP+EL  L PPR++DFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVTVKNRYPLP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQGAT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

A0A5A7USG7 Reverse transcriptase6.5e-10186.43Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDMQDF+VILGM+WLSAN A+IDCF K+VV       SFKF+GAGIVCIPKVISAMKASKLLSQGTW ILASVVD REPEVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFP+EL  L PPR++DFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVT+KNRYPLP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQGAT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

A0A5A7UUX8 Reverse transcriptase1.1e-10086.43Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDMQDF+VILGM+WLSAN A+IDCF K+VV       SFKF+GAG+VCIPKVISAMKASKLLSQGTW ILASVVD REPEVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFP+EL  L PPR++DFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVTVKNRYPLP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQGAT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

A0A5D3D6H5 DNA/RNA polymerases superfamily protein1.1e-10087.78Show/hide
Query:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY
        MLDV LLVLDM+DF+VILGM+WLSAN ASIDCFRK+VV      TSFKFKGAGIVC+PKVISAMKASKLLSQGTW+ILASVVDTRE EVSLS EPVVREY
Subjt:  MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVL-----TSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREY

Query:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP
        PDVFPNELL L PPR+IDFAIELEPGTA ISRAPYRMAPAELKELKVQLQELLD+GFIRPSVSPWGAPVLFVKK DGSMRLCI+YRELNKVTVKNRY LP
Subjt:  PDVFPNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLP

Query:  RIDDLFDQLQGATIFSKIDLQ
        RIDDLFDQLQ AT+FSKIDL+
Subjt:  RIDDLFDQLQGATIFSKIDLQ

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.0e-1530.94Show/hide
Query:  REPEVSLSFEPVVREYPDVF--PNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLC
        +EPE+      + +E+ D+    N      P + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV K +G++R+ 
Subjt:  REPEVSLSFEPVVREYPDVF--PNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLC

Query:  INYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLQ
        ++Y+ LNK    N YPLP I+ L  ++QG+TIF+K+DL+
Subjt:  INYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLQ

P0CT41 Transposon Tf2-12 polyprotein4.0e-1530.94Show/hide
Query:  REPEVSLSFEPVVREYPDVF--PNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLC
        +EPE+      + +E+ D+    N      P + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV K +G++R+ 
Subjt:  REPEVSLSFEPVVREYPDVF--PNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLC

Query:  INYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLQ
        ++Y+ LNK    N YPLP I+ L  ++QG+TIF+K+DL+
Subjt:  INYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLQ

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-2035.47Show/hide
Query:  KASKLLSQGTWSILASVVDTREPEV-------SLSFEPV--VREYPDVFPNELLRLSPPRKIDF-------AIELEPGTASISRAPYRMAPAELKELKVQ
        +AS L   G +S + S + + EP         +    PV   ++Y ++  N+L    PPR  D         IE++PG       PY +     +E+   
Subjt:  KASKLLSQGTWSILASVVDTREPEV-------SLSFEPV--VREYPDVFPNELLRLSPPRKIDF-------AIELEPGTASISRAPYRMAPAELKELKVQ

Query:  LQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDL
        +Q+LLD  FI PS SP  +PV+ V K DG+ RLC++YR LNK T+ + +PLPRID+L  ++  A IF+ +DL
Subjt:  LQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-2035.47Show/hide
Query:  KASKLLSQGTWSILASVVDTREPEV-------SLSFEPV--VREYPDVFPNELLRLSPPRKIDF-------AIELEPGTASISRAPYRMAPAELKELKVQ
        +AS L   G +S + S + + EP         +    PV   ++Y ++  N+L    PPR  D         IE++PG       PY +     +E+   
Subjt:  KASKLLSQGTWSILASVVDTREPEV-------SLSFEPV--VREYPDVFPNELLRLSPPRKIDF-------AIELEPGTASISRAPYRMAPAELKELKVQ

Query:  LQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDL
        +Q+LLD  FI PS SP  +PV+ V K DG+ RLC++YR LNK T+ + +PLPRID+L  ++  A IF+ +DL
Subjt:  LQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDL

Q9UR07 Transposon Tf2-11 polyprotein4.0e-1530.94Show/hide
Query:  REPEVSLSFEPVVREYPDVF--PNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLC
        +EPE+      + +E+ D+    N      P + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV K +G++R+ 
Subjt:  REPEVSLSFEPVVREYPDVF--PNELLRLSPPRKIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLC

Query:  INYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLQ
        ++Y+ LNK    N YPLP I+ L  ++QG+TIF+K+DL+
Subjt:  INYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGATGTGATCTTACTAGTGTTAGACATGCAGGATTTCAATGTAATCCTAGGCATGAATTGGCTGTCTGCTAACCCTGCAAGTATAGACTGCTTCCGTAAGAAAGT
CGTTTTGACTAGTTTCAAATTTAAGGGGGCAGGAATCGTATGTATACCCAAGGTCATCTCAGCCATGAAGGCTAGTAAACTACTCAGCCAGGGTACCTGGAGCATCTTGG
CAAGCGTAGTAGATACCAGAGAACCAGAAGTTTCCCTGTCCTTCGAACCAGTAGTAAGGGAGTACCCCGATGTTTTCCCCAACGAGCTTCTAAGACTTTCGCCTCCCAGG
AAGATAGACTTCGCCATCGAGTTAGAGCCAGGCACTGCTTCTATCTCGAGGGCCCCTTATAGAATGGCTCCAGCTGAGCTAAAGGAGCTGAAGGTGCAGTTGCAGGAGTT
ACTGGACAAGGGTTTTATTCGACCCAGTGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAAGAAGAATGATGGGTCGATGCGCCTTTGCATTAACTACAGGGAGCTGA
ACAAGGTGACAGTTAAGAATCGCTATCCCTTGCCCAGGATTGATGATTTGTTCGATCAGTTGCAAGGAGCCACCATCTTTTCTAAGATCGACCTGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGATGTGATCTTACTAGTGTTAGACATGCAGGATTTCAATGTAATCCTAGGCATGAATTGGCTGTCTGCTAACCCTGCAAGTATAGACTGCTTCCGTAAGAAAGT
CGTTTTGACTAGTTTCAAATTTAAGGGGGCAGGAATCGTATGTATACCCAAGGTCATCTCAGCCATGAAGGCTAGTAAACTACTCAGCCAGGGTACCTGGAGCATCTTGG
CAAGCGTAGTAGATACCAGAGAACCAGAAGTTTCCCTGTCCTTCGAACCAGTAGTAAGGGAGTACCCCGATGTTTTCCCCAACGAGCTTCTAAGACTTTCGCCTCCCAGG
AAGATAGACTTCGCCATCGAGTTAGAGCCAGGCACTGCTTCTATCTCGAGGGCCCCTTATAGAATGGCTCCAGCTGAGCTAAAGGAGCTGAAGGTGCAGTTGCAGGAGTT
ACTGGACAAGGGTTTTATTCGACCCAGTGTGTCACCTTGGGGAGCACCAGTGTTGTTTGTGAAGAAGAATGATGGGTCGATGCGCCTTTGCATTAACTACAGGGAGCTGA
ACAAGGTGACAGTTAAGAATCGCTATCCCTTGCCCAGGATTGATGATTTGTTCGATCAGTTGCAAGGAGCCACCATCTTTTCTAAGATCGACCTGCAATAA
Protein sequenceShow/hide protein sequence
MLDVILLVLDMQDFNVILGMNWLSANPASIDCFRKKVVLTSFKFKGAGIVCIPKVISAMKASKLLSQGTWSILASVVDTREPEVSLSFEPVVREYPDVFPNELLRLSPPR
KIDFAIELEPGTASISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKNDGSMRLCINYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLQ