; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g03760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g03760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr1:2390661..2392519
RNA-Seq ExpressionMoc01g03760
SyntenyMoc01g03760
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042379.1 reverse transcriptase [Cucumis melo var. makuwa]2.6e-3928.01Show/hide
Query:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGE-----------------------------------
        +CK  H   +CP++ A  A QA+ A+    +    E +    E  D P MGALKF+S+LQ+KVGE                                   
Subjt:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGE-----------------------------------

Query:  ----AKEPNCKIE-------------------------------GSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGV
            AK  N + E                               G ++FV+V+MDDF+VVLGMEFLLEH+VIPMPLAKCLV+TG  P+VV T+++Q  G+
Subjt:  ----AKEPNCKIE-------------------------------GSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGV

Query:  KMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPR----------------------------------------------------------------
        KMISA+ LK+GL+ DEPTFMAIP+   E+S+E +P+                                                                
Subjt:  KMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPR----------------------------------------------------------------

Query:  ------------------------------------------------------------------------KTEKSRLF--------------------
                                                                                K ++++L+                    
Subjt:  ------------------------------------------------------------------------KTEKSRLF--------------------

Query:  -------KIGKMAV-------------------------------------------------------AFDKLKKAMMEGPVLGITDVTKSFEAETDAL
               + GK+AV                                                       AF+ LK+AMMEGP+LGI DVTK FE ETDA 
Subjt:  -------KIGKMAV-------------------------------------------------------AFDKLKKAMMEGPVLGITDVTKSFEAETDAL

Query:  DFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAV
        D+ALGGVLLQ+GHP+AYESRKLN AE+ Y  SEKEMLAV
Subjt:  DFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAV

KAA0045292.1 reverse transcriptase [Cucumis melo var. makuwa]7.9e-3651.55Show/hide
Query:  DCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCK------IEGSLNFVIVRMDDFNVVLGMEFLLEHKVIP
        +CP++ A  A QA+ A+    +    E +V   E  D P MGALKF+S+LQ+KVGE   P  +        G ++FV+V+MDDF+VVLGMEFLLEH+VIP
Subjt:  DCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCK------IEGSLNFVIVRMDDFNVVLGMEFLLEHKVIP

Query:  MPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVI
        MPLAKCLV+TGS P+VV T+++Q  G+KMISA+ LK+GL+ +EPTFM IP+   ++S E++
Subjt:  MPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVI

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]9.0e-4829.58Show/hide
Query:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEP-------------------------------
        +CK  HRV +CP+RAAL+A QAT  N   +E +  ET  P E+  D P MGALKF+SALQ+K  E KEP                               
Subjt:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEP-------------------------------

Query:  ------------------------------------NCKI---EGSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGV
                                            + K+    G ++FVIVRMDDF+VVLG++FLLEHKVIPMPLAKCLVVT SDP VV T+IKQ SGV
Subjt:  ------------------------------------NCKI---EGSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGV

Query:  KMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPRKTEK------------------------------------------------------------
        KMISAL LK+G+A DEPTFMAIPV E  +S+E++PR+ ++                                                            
Subjt:  KMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPRKTEK------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------SRLFKIGKMAV---------------------------------------------------------
                                          + + G++ +                                                         
Subjt:  --------------------------------SRLFKIGKMAV---------------------------------------------------------

Query:  -----AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVVHC
             AF+ LKKAMMEG VLGI DVT+ FE ETDA DFALGGVLLQDGHP+AYES+KLNDAE+RY ASEKEMLAVVHC
Subjt:  -----AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVVHC

XP_022972954.1 uncharacterized protein LOC111471473 [Cucurbita maxima]1.0e-3853.33Show/hide
Query:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCKIE-------------GSLNFVIVRMDDF
        +CK  H+VS CPHRA+L ALQ +     E     IET +  +E  D P MGALKF+SALQRKV    EP   +E             G L+ V+ RMDDF
Subjt:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCKIE-------------GSLNFVIVRMDDF

Query:  NVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPRK
        +VVLGMEFLLEHKVIPMPLAKCLV+T S+PTV+  +IKQ   ++MISA+ LKRGLA +EPTFM IP++E  +++E +P +
Subjt:  NVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPRK

XP_022975176.1 uncharacterized protein LOC111474215 [Cucurbita maxima]1.3e-4129.63Show/hide
Query:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCKIE-------------GSLNFVIVRMDDF
        +CK  H+VS CPHRA+L ALQ +     + +   +ET +  +E  D P MGALKF+SALQRKV    EP   +E             G L+ V+ RMDDF
Subjt:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCKIE-------------GSLNFVIVRMDDF

Query:  NVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPR---------------------
        +VVLGMEFLLEHKVIPMPLAKCLVVT  +PTV+  +IKQ   ++MIS + LKRGLA +E TFMAIP++E  +++E +P                      
Subjt:  NVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPR---------------------

Query:  -------------------KTEKSRLFKIG---------------------------------KMAV---------------------------------
                           + +   L K G                                 K+ V                                 
Subjt:  -------------------KTEKSRLFKIG---------------------------------KMAV---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRY
                                                AF+ LK  MM GPVLG+ DVTK FE ETDA DFALGGVL+Q+GHP+A+ESRKLNDAE+RY
Subjt:  ----------------------------------------AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRY

Query:  VASEKEMLAVVHC
        + SEK+ML VVHC
Subjt:  VASEKEMLAVVHC

TrEMBL top hitse value%identityAlignment
A0A5D3CW71 Reverse transcriptase1.3e-3928.01Show/hide
Query:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGE-----------------------------------
        +CK  H   +CP++ A  A QA+ A+    +    E +    E  D P MGALKF+S+LQ+KVGE                                   
Subjt:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGE-----------------------------------

Query:  ----AKEPNCKIE-------------------------------GSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGV
            AK  N + E                               G ++FV+V+MDDF+VVLGMEFLLEH+VIPMPLAKCLV+TG  P+VV T+++Q  G+
Subjt:  ----AKEPNCKIE-------------------------------GSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGV

Query:  KMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPR----------------------------------------------------------------
        KMISA+ LK+GL+ DEPTFMAIP+   E+S+E +P+                                                                
Subjt:  KMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPR----------------------------------------------------------------

Query:  ------------------------------------------------------------------------KTEKSRLF--------------------
                                                                                K ++++L+                    
Subjt:  ------------------------------------------------------------------------KTEKSRLF--------------------

Query:  -------KIGKMAV-------------------------------------------------------AFDKLKKAMMEGPVLGITDVTKSFEAETDAL
               + GK+AV                                                       AF+ LK+AMMEGP+LGI DVTK FE ETDA 
Subjt:  -------KIGKMAV-------------------------------------------------------AFDKLKKAMMEGPVLGITDVTKSFEAETDAL

Query:  DFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAV
        D+ALGGVLLQ+GHP+AYESRKLN AE+ Y  SEKEMLAV
Subjt:  DFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAV

A0A6J1DLQ6 uncharacterized protein LOC1110223204.4e-4829.58Show/hide
Query:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEP-------------------------------
        +CK  HRV +CP+RAAL+A QAT  N   +E +  ET  P E+  D P MGALKF+SALQ+K  E KEP                               
Subjt:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEP-------------------------------

Query:  ------------------------------------NCKI---EGSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGV
                                            + K+    G ++FVIVRMDDF+VVLG++FLLEHKVIPMPLAKCLVVT SDP VV T+IKQ SGV
Subjt:  ------------------------------------NCKI---EGSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGV

Query:  KMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPRKTEK------------------------------------------------------------
        KMISAL LK+G+A DEPTFMAIPV E  +S+E++PR+ ++                                                            
Subjt:  KMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPRKTEK------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------SRLFKIGKMAV---------------------------------------------------------
                                          + + G++ +                                                         
Subjt:  --------------------------------SRLFKIGKMAV---------------------------------------------------------

Query:  -----AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVVHC
             AF+ LKKAMMEG VLGI DVT+ FE ETDA DFALGGVLLQDGHP+AYES+KLNDAE+RY ASEKEMLAVVHC
Subjt:  -----AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVVHC

A0A6J1ID35 uncharacterized protein LOC1114714734.8e-3953.33Show/hide
Query:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCKIE-------------GSLNFVIVRMDDF
        +CK  H+VS CPHRA+L ALQ +     E     IET +  +E  D P MGALKF+SALQRKV    EP   +E             G L+ V+ RMDDF
Subjt:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCKIE-------------GSLNFVIVRMDDF

Query:  NVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPRK
        +VVLGMEFLLEHKVIPMPLAKCLV+T S+PTV+  +IKQ   ++MISA+ LKRGLA +EPTFM IP++E  +++E +P +
Subjt:  NVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPRK

A0A6J1IDF7 uncharacterized protein LOC1114742156.1e-4229.63Show/hide
Query:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCKIE-------------GSLNFVIVRMDDF
        +CK  H+VS CPHRA+L ALQ +     + +   +ET +  +E  D P MGALKF+SALQRKV    EP   +E             G L+ V+ RMDDF
Subjt:  MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCKIE-------------GSLNFVIVRMDDF

Query:  NVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPR---------------------
        +VVLGMEFLLEHKVIPMPLAKCLVVT  +PTV+  +IKQ   ++MIS + LKRGLA +E TFMAIP++E  +++E +P                      
Subjt:  NVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPR---------------------

Query:  -------------------KTEKSRLFKIG---------------------------------KMAV---------------------------------
                           + +   L K G                                 K+ V                                 
Subjt:  -------------------KTEKSRLFKIG---------------------------------KMAV---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRY
                                                AF+ LK  MM GPVLG+ DVTK FE ETDA DFALGGVL+Q+GHP+A+ESRKLNDAE+RY
Subjt:  ----------------------------------------AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRY

Query:  VASEKEMLAVVHC
        + SEK+ML VVHC
Subjt:  VASEKEMLAVVHC

A0A803N8Q7 Uncharacterized protein2.9e-3640.71Show/hide
Query:  GALKFMSALQRKV-GEAKEPNCKI---EGSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLT-NIKQLSGVKMISALHLKRGLAHD
        G++K +++  + V G A+    K+    G+L+F +  MDDFN+VLGM+FL   K +PMP    ++V G  P ++ T  +++ S   ++SA+ LK+GL  +
Subjt:  GALKFMSALQRKV-GEAKEPNCKI---EGSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVVTGSDPTVVLT-NIKQLSGVKMISALHLKRGLAHD

Query:  EPTFMAIPVIEEESSKEVIPRKTEK-----------------SRLFKIGKMAVAFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVA
        EPTF+A   ++E+ +   IP   +K                  R     K   AF KLK +++  PVL + +  K FE ETDA DF+LGGVLLQDGHP+A
Subjt:  EPTFMAIPVIEEESSKEVIPRKTEK-----------------SRLFKIGKMAVAFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVA

Query:  YESRKLNDAEKRYVASEKEMLAVVHC
        +ESRKLN  E++Y A EKE+LA+VHC
Subjt:  YESRKLNDAEKRYVASEKEMLAVVHC

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.6e-1354.93Show/hide
Query:  AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVV
        AF KLK  + E P+L + D TK F   TDA D ALG VL QDGHP++Y SR LN+ E  Y   EKE+LA+V
Subjt:  AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVV

P10273 Gag-Pol polyprotein3.7e-0439.73Show/hide
Query:  TEKSRLFKIG-KMAVAFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQD----GHPVAYESRKLN
        T    LF+ G +  +AF+ +KKA++  P LG+ D+TK FE   D       GVL+Q       PVAY S+KL+
Subjt:  TEKSRLFKIG-KMAVAFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQD----GHPVAYESRKLN

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.2e-0537.5Show/hide
Query:  AFDKLKKAM-MEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVV
        AF +L+  +  E  +L   D  K F+  TDA    +G VL Q+G P+   SR L   E+ Y  +E+E+LA+V
Subjt:  AFDKLKKAM-MEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVV

P20825 Retrovirus-related Pol polyprotein from transposon 2972.8e-1250.7Show/hide
Query:  AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVV
        AF+KLK  ++  P+L + D  K F   TDA + ALG VL Q+GHP+++ SR LND E  Y A EKE+LA+V
Subjt:  AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQDGHPVAYESRKLNDAEKRYVASEKEMLAVV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.5e-0841.33Show/hide
Query:  AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQD----GHPVAYESRKLNDAEKRYVASEKEMLAVV
        +F+ LK  +    +L     TK F   TDA ++A+G VL QD      P+AY SR LN  E+ Y   EKEMLA++
Subjt:  AFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVLLQD----GHPVAYESRKLNDAEKRYVASEKEMLAVV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTAAAGAGGCTCATCGCGTAAGCGACTGCCCGCATCGAGCTGCCCTGAAGGCACTCCAAGCTACAAGAGCCAATGGTCCTGAGGTCGAGACAAAAACTATCGAGAC
AAAGGTACCGACCGAAGAGCCTACTGACAAGCCCTACATGGGGGCACTGAAATTCATGTCGGCTCTCCAAAGGAAAGTAGGGGAAGCAAAAGAACCGAACTGCAAGATAG
AGGGCTCATTAAATTTTGTGATTGTTAGAATGGATGACTTCAACGTCGTGCTTGGAATGGAATTTCTTCTAGAGCACAAAGTCATCCCCATGCCCCTGGCAAAGTGTTTA
GTCGTTACTGGATCTGATCCCACCGTTGTTCTGACTAACATAAAACAACTAAGTGGGGTTAAGATGATATCAGCTTTACATTTGAAGAGGGGTCTTGCTCACGACGAACC
AACATTTATGGCCATCCCAGTGATCGAAGAAGAGAGCAGTAAAGAAGTGATTCCCAGGAAGACGGAAAAGTCAAGGCTATTCAAGATTGGAAAAATGGCGGTGGCTTTCG
ATAAACTGAAGAAAGCCATGATGGAAGGACCAGTGCTTGGTATCACAGATGTCACCAAGTCGTTCGAGGCAGAAACTGATGCCTTAGACTTTGCCCTAGGTGGTGTTCTC
CTTCAAGATGGACACCCAGTGGCGTACGAAAGCCGAAAGTTGAACGACGCAGAAAAGAGGTATGTAGCCTCCGAGAAAGAGATGTTGGCAGTAGTCCACTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTAAAGAGGCTCATCGCGTAAGCGACTGCCCGCATCGAGCTGCCCTGAAGGCACTCCAAGCTACAAGAGCCAATGGTCCTGAGGTCGAGACAAAAACTATCGAGAC
AAAGGTACCGACCGAAGAGCCTACTGACAAGCCCTACATGGGGGCACTGAAATTCATGTCGGCTCTCCAAAGGAAAGTAGGGGAAGCAAAAGAACCGAACTGCAAGATAG
AGGGCTCATTAAATTTTGTGATTGTTAGAATGGATGACTTCAACGTCGTGCTTGGAATGGAATTTCTTCTAGAGCACAAAGTCATCCCCATGCCCCTGGCAAAGTGTTTA
GTCGTTACTGGATCTGATCCCACCGTTGTTCTGACTAACATAAAACAACTAAGTGGGGTTAAGATGATATCAGCTTTACATTTGAAGAGGGGTCTTGCTCACGACGAACC
AACATTTATGGCCATCCCAGTGATCGAAGAAGAGAGCAGTAAAGAAGTGATTCCCAGGAAGACGGAAAAGTCAAGGCTATTCAAGATTGGAAAAATGGCGGTGGCTTTCG
ATAAACTGAAGAAAGCCATGATGGAAGGACCAGTGCTTGGTATCACAGATGTCACCAAGTCGTTCGAGGCAGAAACTGATGCCTTAGACTTTGCCCTAGGTGGTGTTCTC
CTTCAAGATGGACACCCAGTGGCGTACGAAAGCCGAAAGTTGAACGACGCAGAAAAGAGGTATGTAGCCTCCGAGAAAGAGATGTTGGCAGTAGTCCACTGCTGA
Protein sequenceShow/hide protein sequence
MCKEAHRVSDCPHRAALKALQATRANGPEVETKTIETKVPTEEPTDKPYMGALKFMSALQRKVGEAKEPNCKIEGSLNFVIVRMDDFNVVLGMEFLLEHKVIPMPLAKCL
VVTGSDPTVVLTNIKQLSGVKMISALHLKRGLAHDEPTFMAIPVIEEESSKEVIPRKTEKSRLFKIGKMAVAFDKLKKAMMEGPVLGITDVTKSFEAETDALDFALGGVL
LQDGHPVAYESRKLNDAEKRYVASEKEMLAVVHC