; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g11370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g11370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr7:8783748..8787161
RNA-Seq ExpressionMoc07g11370
SyntenyMoc07g11370
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK03099.1 reverse transcriptase [Cucumis melo var. makuwa]1.2e-23143.73Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAM--RPGSFERGDSSTNPNTQIEVRM
        M+++  L K+  DRLVE+EEQ+LYL EV DS+R LE+R++E SEK   IDAV  RV+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAM--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDLKA-----------------------PNQANMG----FNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+ED +A                        NQA  G     +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDLKA-----------------------PNQANMG----FNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLK WA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPS-RGPYPQSQNVQRPISCFLCKGPHRV
        +TKLYEQRVQDL +A A+AERL D S++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S RG   Q+ +  RP+SCF+CKGPH  
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPS-RGPYPQSQNVQRPISCFLCKGPHRV

Query:  AECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRL
         ECP++ A  A QAS+ S +   + +   +  + E+ + PRMGALKFLS++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA RL
Subjt:  AECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRL

Query:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQL
         L  EKD G+MKAVNS ALPI+G+ KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T   P+VV   ++QP G++MISA+QL
Subjt:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQL

Query:  KKGLNREEPTFMAI--------------------------------------SMVEQPVET------------RDVPPEI--------------------
        KKGL+R+EPTFMAI                                       M++  +E             R  PPE+                    
Subjt:  KKGLNREEPTFMAI--------------------------------------SMVEQPVET------------RDVPPEI--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYT
                                             Q AF+ LK A+M+GP+LG+A VTKPFEVETDASDYALGGVLLQ+GHPIAYESRKLN AERRYT
Subjt:  -------------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYT

Query:  VSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIR
        VSEKEMLAVVHCLR+WRQYLLGS FVVKTDNSA CHFF QPKLTSKQARWQE LAEFDF+FEHK G SNQAADALSRK EHAA+C+LAH+  S++ GS+R
Subjt:  VSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIR

Query:  DLIREYLQK
        D +RE+LQK
Subjt:  DLIREYLQK

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]0.0e+0066.24Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE
        M+ TKQLSKSHVDRLVEIEEQLLYLREV D LRLLEARVDEFSEKFGEIDAVNAR+DGLPIQDIAMRVETLESKA RPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDLK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHSAMMQLFNEMTED K                           APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  LNNSHSAMMQLFNEMTEDLK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFI+GLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQN QR +S FLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIEKDTGKMK
        TALQASVQSCNEPEV TDCEKEEDEETPRM ALKFLSAIQKRVNGPKGTSEKGLMFVDATINCN AKSTMVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLNREEPTFM
         VN EALPIVGVSKRV LKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSN+PTVVT SIKQPGGIRMISALQLKKGLNREEPTFM
Subjt:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLNREEPTFM

Query:  AISMVEQPVETRDVPPEI----------------------------------------------------------------------------------
        AI MVEQPVETRDVPPEI                                                                                  
Subjt:  AISMVEQPVETRDVPPEI----------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHC
                                 QDAFEDLKAAMMKGPVLGLA VTKPFEVETDASDYALGGVLLQD HPI YESRKLNNAERRYTVSEKEMLAVVHC
Subjt:  -------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHC

Query:  LRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIRDLIREYLQKTP
        LRSWRQYLLGS FVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHK GKSNQAADALSRKGEHA LCMLAHIH SK DGSIRDLI EYLQ  P
Subjt:  LRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIRDLIREYLQKTP

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]0.0e+0063.42Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE
        M+ATKQLSKSHVDRLVEIEEQLLYLREV DSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVET ESKA RPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDLK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHS MMQLFNEMTED K                           APNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDLK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQN QRP SCFLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIEKDTGKMK
        TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLNREEPTFM
        AVNSEALPIVGVSKRVTLKLGTWTGS DFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSN+PTVVTASIKQPGGIRMISALQLKKGLNREEPTFM
Subjt:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLNREEPTFM

Query:  AISMVEQPVETRDVPPEI----------------------------------------------------------------------------------
              QPVETRDVPPEI                                                                                  
Subjt:  AISMVEQPVETRDVPPEI----------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHC
                                 QDAFEDLKAAMMKGPVLGLA VTKPFEVETDASDYALGGVLLQD HPIAYESRKLNNAERRYTVSEKEMLAVVHC
Subjt:  -------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHC

Query:  LRSWRQ
        LRSWRQ
Subjt:  LRSWRQ

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]9.6e-23743.57Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE
        M+ TK   K+  DRL  IEE++L+L+EV D+LR LE RV E SEK  ++DA+  R+DGLPI ++  RV +LE +     S +  DS  +     E R  E
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDLKA---------------------------PNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
         +   + MM LFN + ++ ++                             Q + G NKL+ P+P+ F GNRDAK+LENF+FDVEQYFKAT   +++ KVT
Subjt:  LNNSHSAMMQLFNEMTEDLKA---------------------------PNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART
        +A M+L DDAKLWWR+KV DI++G CTI+SW+DLK+ELR QF P+N   +A  K+  L+HTG IRDYV+QFS +MLDIR  +EKDKVF FI GL+ WA+T
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAE
        K++E RVQ LA AMA AERL+D  +E    ++    P  G K ++P   ++G         D+ PQ    G SRGPY Q  +   P+ C LCKGPH+V+ 
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAE

Query:  CPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIE
        CPHRA+LTALQ S+Q  NE  V T  +K+ED + PRMGALKFLSA+Q++V  PK   EKGLMFVDATIN   +KST++DSGATHNFI++QEA RL LTI 
Subjt:  CPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIE

Query:  KDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLN
        KD GKMKAVNSEALPIVGVSK V  K+G WTG +D VVVRMDDFDVVLGMEFL+EHKVIPMPLAKC+++T  NPTV+ ASIKQPG +RMISA+QLK+GL 
Subjt:  KDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLN

Query:  REEPTFMAISMVEQ-------PVETRDV-------------------------------------------PPEI-------------------------
        REEPTFMAI ++E+       P E ++V                                           PPE+                         
Subjt:  REEPTFMAISMVEQ-------PVETRDV-------------------------------------------PPEI-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKE
                                        Q AFE+LK  M +GPVLGL  VTKPFEVETDASD+ALGGVL+Q+GHPIAYESRKLN+AERRYTVSEKE
Subjt:  --------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKE

Query:  MLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIRDLIRE
        MLAVVHCLR WRQYLLGS FVVKTDNSA CHFF+QPKLT+KQARWQE LAEFDFKFEHK GKSNQAADALSRKGEHAALCMLAHIH+SK+DGS+RD+I+E
Subjt:  MLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIRDLIRE

Query:  YLQKTPPPNLWSSWLRPGRPASSGLKG
        +L K P   +     + G+     ++G
Subjt:  YLQKTPPPNLWSSWLRPGRPASSGLKG

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]7.3e-23744.3Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE
        M+ TK   K+  DRL  IEE++L+L+EV D+LR LE RV E SEK  ++DA+  R+DGLPI ++  RV +LE +     S +  DS  +     E R  E
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDLKA---------------------------PNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
         +   + MM LFN + ++ ++                             Q + G NKL+ P+P+ F GNRDAK+LENF+FDVEQYFKAT   +++ KVT
Subjt:  LNNSHSAMMQLFNEMTEDLKA---------------------------PNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART
        +A M+L DDAKLWWR+KV DI++G CTI+SW+DLK+ELR QF P+N   +A  K+  L+HTG IRDYV+QFS +MLDIR  +EKDKVF FI GL+ WA+T
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAE
        K++E RVQ LA AMA AERL+D  +E    ++    P  G K ++P   ++G         D+ PQ    G SRGPY Q  +   P+ C LCKGPH+V+ 
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGG-------ADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAE

Query:  CPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIE
        CPHRA+LTALQ S+Q  NE  V T  +K+ED + PRMGALKFLSA+Q++V  PK   EKGLMFVDATIN   +KST++DSGATHNFI++QEA RL LTI 
Subjt:  CPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIE

Query:  KDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLN
        KD GKMKAVNSEALPIVGVSK V  K+G WTG +D VVVRMDDFDVVLGMEFL+EHKVIPMPLAKC+++T  NPTV+ ASIKQPG +RMISA+QLK+GL 
Subjt:  KDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLN

Query:  REEPTFMAISMVEQ-------PVETRDV-------------------------------------------PPEI-------------------------
        REEPTFMAI ++E+       P E +DV                                           PPE+                         
Subjt:  REEPTFMAISMVEQ-------PVETRDV-------------------------------------------PPEI-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKE
                                        Q AFE+LK  M +GPVLGL  VTKPFEVETDASD+ALGGVL+Q+GHPIAYESRKLN+AERRYTVSEKE
Subjt:  --------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKE

Query:  MLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIRDLIRE
        MLAVVHCLR WRQYLLGS FVVKTDNSA CHFF+QPKLT+KQARWQE LAEFDFKFEHK GKSNQAADALSRKGEHAALCMLAHIH+SK+DGS+RD+I+E
Subjt:  MLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIRDLIRE

Query:  YLQKTP
        +L K P
Subjt:  YLQKTP

TrEMBL top hitse value%identityAlignment
A0A5A7T0E2 Reverse transcriptase2.2e-23143.73Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAM--RPGSFERGDSSTNPNTQIEVRM
        M+++    K+  DRLVE+EEQ+LYL EV DS+R LE+R++E SEK   IDAV  RV+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAM--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDLKA-----------------------PNQANMG----FNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+ED +A                        NQA  G     +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDLKA-----------------------PNQANMG----FNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLK WA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPS-RGPYPQSQNVQRPISCFLCKGPHRV
        +TKLYEQRVQDL +A A+AERL D S++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S RG   Q+ +  RP+SCF+CKGPH  
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPS-RGPYPQSQNVQRPISCFLCKGPHRV

Query:  AECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRL
         ECP++ A  A QAS+ S +   + +   +  + E+ + PRMGALKFLS++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA RL
Subjt:  AECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRL

Query:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQL
         L  EKD G+MKAVNS ALPI+G+ KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T   P+VV   ++QP G++MISA+QL
Subjt:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQL

Query:  KKGLNREEPTFMAI--------------------------------------SMVEQPVET------------RDVPPEI--------------------
        KKGL+R+EPTFMAI                                       M++  +E             R  PPE+                    
Subjt:  KKGLNREEPTFMAI--------------------------------------SMVEQPVET------------RDVPPEI--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYT
                                             Q AF+ LK A+M+GP+LG+A VTKPFEVETDASDYALGGVLLQ+GHPIAYESRKLN AERRYT
Subjt:  -------------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYT

Query:  VSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIR
        VSEKEMLAVVHCLR+WRQYLLGS FVVKTDNSA CHFF QPKLTSKQARWQE LAEFDF+FEHK G SNQAADALSRK EHAA+C+LAH+  S++ GSIR
Subjt:  VSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIR

Query:  DLIREYLQK
        D +RE+LQK
Subjt:  DLIREYLQK

A0A5A7UIP7 Reverse transcriptase2.2e-23143.64Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAM--RPGSFERGDSSTNPNTQIEVRM
        M+++    K+  DRLVE+EEQ+LYL EV DS+R LE+R++E SEK   IDAV  RV+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAM--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDLKA-----------------------PNQANMG----FNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+ED +A                        NQA  G     +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDLKA-----------------------PNQANMG----FNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLK WA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPS-RGPYPQSQNVQRPISCFLCKGPHRV
        +TKLYEQRVQDL +A A+AERL D S++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S RG   Q+ +  RP+SCF+CKGPH  
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPS-RGPYPQSQNVQRPISCFLCKGPHRV

Query:  AECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRL
         ECP++ A  A QAS+ S +   + +   +  + E+ + PRMGALKFLS++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA RL
Subjt:  AECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRL

Query:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQL
         L  EKD G+MKAVNS ALPI+G+ KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T   P+VV   ++QP G++MISA+QL
Subjt:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQL

Query:  KKGLNREEPTFMAI--------------------------------------SMVEQPVET------------RDVPPEI--------------------
        KKGL+R+EPTFMAI                                       M++  +E             R  PPE+                    
Subjt:  KKGLNREEPTFMAI--------------------------------------SMVEQPVET------------RDVPPEI--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYT
                                             Q AF+ LK A+M+GP+LG+A VTKPFEVETDASDYALGGVLLQ+GHPIAYESRKLN AERRYT
Subjt:  -------------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYT

Query:  VSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIR
        VSEKEMLAVVHCLR+WRQYLLGS FVVKTDNSA CHFF QPKLTSKQARWQE LAEFDF+FEHK G SNQAADALSRK EHAA+C+LAH+  S++ GS+R
Subjt:  VSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIR

Query:  DLIREYLQK
        D +RE+LQK
Subjt:  DLIREYLQK

A0A5D3BYE6 Reverse transcriptase5.9e-23243.73Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAM--RPGSFERGDSSTNPNTQIEVRM
        M+++  L K+  DRLVE+EEQ+LYL EV DS+R LE+R++E SEK   IDAV  RV+G PIQ++  RV+ LE+     R  ++ERGDSST     IE R+
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAM--RPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDLKA-----------------------PNQANMG----FNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL++S   ++++ N M+ED +A                        NQA  G     +++K+PEPKPF G RDAK LEN++FD+EQYF+AT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDLKA-----------------------PNQANMG----FNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWA
        VTLATMHL++DAKLWWRS+  DIQ GRCTI++WD LK+ELR QFFP+NVE +ARRKLREL+HTG+IR+YVKQF+ +MLDIRDMSEKDKVF F+EGLK WA
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWA

Query:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPS-RGPYPQSQNVQRPISCFLCKGPHRV
        +TKLYEQRVQDL +A A+AERL D S++    +++ ++ +GG++  +P +PK+ G D+R  G       N G S RG   Q+ +  RP+SCF+CKGPH  
Subjt:  RTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQG------PNPGPS-RGPYPQSQNVQRPISCFLCKGPHRV

Query:  AECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRL
         ECP++ A  A QAS+ S +   + +   +  + E+ + PRMGALKFLS++QK+V       E+GLM+VD  IN  P KSTMVDSGATHNFI+E EA RL
Subjt:  AECPHRAALTALQASVQSCN---EPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRL

Query:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQL
         L  EKD G+MKAVNS ALPI+G+ KR  ++LG W+G VDFVVV+MDDFDVVLGMEFL+EH+VIPMPLAKC+++T   P+VV   ++QP G++MISA+QL
Subjt:  KLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQL

Query:  KKGLNREEPTFMAI--------------------------------------SMVEQPVET------------RDVPPEI--------------------
        KKGL+R+EPTFMAI                                       M++  +E             R  PPE+                    
Subjt:  KKGLNREEPTFMAI--------------------------------------SMVEQPVET------------RDVPPEI--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYT
                                             Q AF+ LK A+M+GP+LG+A VTKPFEVETDASDYALGGVLLQ+GHPIAYESRKLN AERRYT
Subjt:  -------------------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYT

Query:  VSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIR
        VSEKEMLAVVHCLR+WRQYLLGS FVVKTDNSA CHFF QPKLTSKQARWQE LAEFDF+FEHK G SNQAADALSRK EHAA+C+LAH+  S++ GS+R
Subjt:  VSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIR

Query:  DLIREYLQK
        D +RE+LQK
Subjt:  DLIREYLQK

A0A6J1D906 Reverse transcriptase0.0e+0066.33Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE
        M+ TKQLSKSHVDRLVEIEEQLLYLREV D LRLLEARVDEFSEKFGEIDAVNAR+DGLPIQDIAMRVETLESKA RPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDLK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHSAMMQLFNEMTED K                           APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  LNNSHSAMMQLFNEMTEDLK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVM+DIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMA+AERLLDY+SEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQN QR +S FLCKGPHRVAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIEKDTGKMK
        TALQASVQSCNEPEV TDCEKEEDEETPRM ALKFLSAIQKRVNGPKGTSEKGLMFVDATINCN AKSTMVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLNREEPTFM
         VN EALPIVGVSKRV LKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSN+PTVVT SIKQPGGIRMISALQLKKGLNREEPTFM
Subjt:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLNREEPTFM

Query:  AISMVEQPVETRDVPPEI----------------------------------------------------------------------------------
        AI MVEQPVETRDVPPEI                                                                                  
Subjt:  AISMVEQPVETRDVPPEI----------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHC
                                 QDAFEDLKAAMMKGPVLGLA VTKPFEVETDASDYALGGVLLQD HPI YESRKLNNAERRYTVSEKEMLAVVHC
Subjt:  -------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHC

Query:  LRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIRDLIREYLQKTP
        LRSWRQYLLGS FVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHK GKSNQAADALSRKGEHA LCMLAHIH SK DGSIRDLI EYLQ  P
Subjt:  LRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEHAALCMLAHIHASKVDGSIRDLIREYLQKTP

A0A6J1DK29 uncharacterized protein LOC1110218290.0e+0063.42Show/hide
Query:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE
        M+ATKQLSKSHVDRLVEIEEQLLYLREV DSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVET ESKA RPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDLK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHS MMQLFNEMTED K                           APNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDLK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLK WART
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWART

Query:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAECPHRAAL
        KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPK GGADKRP GPNPGPSRGPYPQSQN QRP SCFLC+GPH+VAECPHRAAL
Subjt:  KLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNKTFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAECPHRAAL

Query:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIEKDTGKMK
        TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDA INCNPAKS MVDSGATHNFISEQEA RLKLTIEKDTGKMK
Subjt:  TALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPKGTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIEKDTGKMK

Query:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLNREEPTFM
        AVNSEALPIVGVSKRVTLKLGTWTGS DFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSN+PTVVTASIKQPGGIRMISALQLKKGLNREEPTFM
Subjt:  AVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVIPMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLNREEPTFM

Query:  AISMVEQPVETRDVPPEI----------------------------------------------------------------------------------
              QPVETRDVPPEI                                                                                  
Subjt:  AISMVEQPVETRDVPPEI----------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHC
                                 QDAFEDLKAAMMKGPVLGLA VTKPFEVETDASDYALGGVLLQD HPIAYESRKLNNAERRYTVSEKEMLAVVHC
Subjt:  -------------------------QDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHC

Query:  LRSWRQ
        LRSWRQ
Subjt:  LRSWRQ

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.1e-2844.76Show/hide
Query:  PEIQDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAI
        PE   AF+ LK  + + P+L +   TK F + TDASD ALG VL QDGHP++Y SR LN  E  Y+  EKE+LA+V   +++R YLLG +F + +D+  +
Subjt:  PEIQDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAI

Query:  CHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSR
           +      SK  RW+  L+EFDF  ++  GK N  ADALSR
Subjt:  CHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSR

P0CT41 Transposon Tf2-12 polyprotein2.6e-1936.36Show/hide
Query:  SMVEQPVETRDVPPEIQDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDG-----HPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSW
        +++++ V  +  P + Q A E++K  ++  PVL     +K   +ETDASD A+G VL Q       +P+ Y S K++ A+  Y+VS+KEMLA++  L+ W
Subjt:  SMVEQPVETRDVPPEIQDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDG-----HPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSW

Query:  RQYLLGSY--FVVKTDN-SAICHFFNQPKLTSKQ-ARWQELLAEFDFKFEHKVGKSNQAADALSR
        R YL  +   F + TD+ + I    N+ +  +K+ ARWQ  L +F+F+  ++ G +N  ADALSR
Subjt:  RQYLLGSY--FVVKTDN-SAICHFFNQPKLTSKQ-ARWQELLAEFDFKFEHKVGKSNQAADALSR

P20825 Retrovirus-related Pol polyprotein from transposon 2971.7e-2643.66Show/hide
Query:  EIQDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAIC
        E  +AFE LKA +++ P+L L    K F + TDAS+ ALG VL Q+GHPI++ SR LN+ E  Y+  EKE+LA+V   +++R YLLG  F++ +D+  + 
Subjt:  EIQDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAIC

Query:  HFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSR
           N  +  +K  RW+  L+E+ FK ++  GK N  ADALSR
Subjt:  HFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus6.2e-2140.56Show/hide
Query:  AFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQD----GHPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSWRQYLLGSYFV-VKTDNSAI
        +F DLK+ +    +L     TKPF + TDAS++A+G VL QD      PIAY SR LN  E  Y   EKEMLA++  L + R YL G+  + V TD+  +
Subjt:  AFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQD----GHPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSWRQYLLGSYFV-VKTDNSAI

Query:  CHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSR
                  +K  RW+  + E++ +  +K GKSN  ADALSR
Subjt:  CHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSR

Q9UR07 Transposon Tf2-11 polyprotein2.6e-1936.36Show/hide
Query:  SMVEQPVETRDVPPEIQDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDG-----HPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSW
        +++++ V  +  P + Q A E++K  ++  PVL     +K   +ETDASD A+G VL Q       +P+ Y S K++ A+  Y+VS+KEMLA++  L+ W
Subjt:  SMVEQPVETRDVPPEIQDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGGVLLQDG-----HPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSW

Query:  RQYLLGSY--FVVKTDN-SAICHFFNQPKLTSKQ-ARWQELLAEFDFKFEHKVGKSNQAADALSR
        R YL  +   F + TD+ + I    N+ +  +K+ ARWQ  L +F+F+  ++ G +N  ADALSR
Subjt:  RQYLLGSY--FVVKTDN-SAICHFFNQPKLTSKQ-ARWQELLAEFDFKFEHKVGKSNQAADALSR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTGGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCTCAGACTCCCTCCGTCTGCTG
GAGGCGCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTGTAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACC
CTAGAAAGCAAAGCTATGCGTCCTGGTAGCTTTGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGAGAGTTAAACAACTCGCAT
TCGGCAATGATGCAATTGTTTAACGAAATGACAGAAGACCTCAAAGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTC
AATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGACGTAGAACAATACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCC
ACCATGCATCTTACTGATGATGCAAAGTTGTGGTGGAGATCTAAAGTCAACGACATTCAGAATGGTCGATGCACGATCAATAGCTGGGATGATCTAAAGAAAGAA
TTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTCCGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCT
GCCGTGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAAATCGTGGGCCCGAACCAAGCTTTATGAACAAAGGGTA
CAAGACCTTGCCACCGCCATGGCCTCTGCCGAAAGATTGCTAGACTATAGCAGTGAACCATCCCACCCAAAAAAGAATGCTACAAACCCCACTGGGGGAAACAAG
ACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCTGGTCCTTCCCGAGGACCCTATCCACAAAGTCAAAACGTTCAA
AGGCCGATATCATGCTTCTTGTGCAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTCAAGCATCCGTTCAGTCGTGCAATGAA
CCTGAAGTTGGAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGGGGGCGTTAAAATTCCTATCTGCCATTCAAAAGAGGGTCAATGGACCCAAA
GGAACGTCTGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCTGCAAAGAGCACCATGGTGGATTCGGGTGCAACCCACAACTTCATATCAGAA
CAGGAAGCCTGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTGGGAGTTTCTAAAAGAGTGACG
CTGAAATTAGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAACATAAAGTCATA
CCGATGCCCCTGGCCAAGTGTATGATTGTCACTAGTAATAATCCCACTGTGGTCACAGCTAGCATCAAACAACCTGGTGGCATAAGGATGATATCCGCGTTACAG
TTGAAGAAGGGCCTCAACCGGGAGGAACCTACCTTTATGGCCATTTCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCACCTGAAATTCAAGACGCCTTC
GAAGATCTAAAGGCGGCCATGATGAAAGGCCCAGTACTTGGGCTAGCCCACGTGACGAAGCCGTTTGAAGTAGAAACAGACGCTTCAGACTACGCTTTAGGTGGT
GTCCTTCTCCAAGACGGTCACCCGATCGCTTACGAAAGTCGCAAGCTGAATAATGCGGAACGACGTTATACCGTATCAGAGAAAGAAATGCTAGCAGTCGTTCAC
TGCCTCAGGTCTTGGAGGCAGTACTTGTTGGGCTCATATTTTGTGGTAAAGACGGACAACAGTGCAATCTGCCACTTCTTCAATCAGCCTAAGCTAACATCCAAA
CAGGCCAGGTGGCAGGAATTGTTAGCCGAGTTTGATTTTAAGTTCGAACACAAGGTAGGAAAGTCCAACCAAGCTGCCGACGCTCTTAGCCGTAAAGGAGAACAT
GCGGCCCTATGCATGTTAGCCCACATTCACGCCAGCAAAGTGGACGGGTCGATCCGTGACCTCATTAGAGAGTATCTTCAAAAGACCCCTCCACCCAATCTGTGG
TCGAGCTGGCTAAGACCGGGAAGACCCGCCAGTTCTGGGTTGAAGGGGACCTATTATTTACAAGAGGAAACAGACTGTACGTGCCAAGAAAGGGAGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTGGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCTCAGACTCCCTCCGTCTGCTG
GAGGCGCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTGTAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACC
CTAGAAAGCAAAGCTATGCGTCCTGGTAGCTTTGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGAGAGTTAAACAACTCGCAT
TCGGCAATGATGCAATTGTTTAACGAAATGACAGAAGACCTCAAAGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTC
AATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGACGTAGAACAATACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCC
ACCATGCATCTTACTGATGATGCAAAGTTGTGGTGGAGATCTAAAGTCAACGACATTCAGAATGGTCGATGCACGATCAATAGCTGGGATGATCTAAAGAAAGAA
TTGAGGGGTCAGTTCTTCCCCGACAATGTCGAGTTCATGGCTAGAAGGAAGCTACGTGAACTCCGACACACTGGAACAATCCGGGACTACGTGAAACAATTCTCT
GCCGTGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAGGTGTTCGTCTTTATCGAAGGATTGAAATCGTGGGCCCGAACCAAGCTTTATGAACAAAGGGTA
CAAGACCTTGCCACCGCCATGGCCTCTGCCGAAAGATTGCTAGACTATAGCAGTGAACCATCCCACCCAAAAAAGAATGCTACAAACCCCACTGGGGGAAACAAG
ACGTTCAAACCCTTTACCCCAAAGAGTGGGGGAGCTGACAAGAGACCTCAAGGCCCGAATCCTGGTCCTTCCCGAGGACCCTATCCACAAAGTCAAAACGTTCAA
AGGCCGATATCATGCTTCTTGTGCAAAGGTCCCCACCGGGTAGCTGAATGCCCACACCGAGCTGCCCTTACTGCCCTTCAAGCATCCGTTCAGTCGTGCAATGAA
CCTGAAGTTGGAACGGATTGTGAGAAGGAAGAAGACGAAGAAACACCTAGAATGGGGGCGTTAAAATTCCTATCTGCCATTCAAAAGAGGGTCAATGGACCCAAA
GGAACGTCTGAAAAGGGGCTTATGTTCGTAGATGCTACGATCAATTGTAACCCTGCAAAGAGCACCATGGTGGATTCGGGTGCAACCCACAACTTCATATCAGAA
CAGGAAGCCTGCCGACTGAAATTGACTATTGAGAAGGATACGGGTAAGATGAAGGCCGTCAACTCCGAAGCTCTACCCATTGTGGGAGTTTCTAAAAGAGTGACG
CTGAAATTAGGGACGTGGACAGGCAGTGTCGATTTCGTAGTAGTTCGCATGGACGACTTCGACGTTGTACTGGGGATGGAGTTCCTCATCGAACATAAAGTCATA
CCGATGCCCCTGGCCAAGTGTATGATTGTCACTAGTAATAATCCCACTGTGGTCACAGCTAGCATCAAACAACCTGGTGGCATAAGGATGATATCCGCGTTACAG
TTGAAGAAGGGCCTCAACCGGGAGGAACCTACCTTTATGGCCATTTCGATGGTCGAACAGCCGGTGGAGACAAGAGATGTCCCACCTGAAATTCAAGACGCCTTC
GAAGATCTAAAGGCGGCCATGATGAAAGGCCCAGTACTTGGGCTAGCCCACGTGACGAAGCCGTTTGAAGTAGAAACAGACGCTTCAGACTACGCTTTAGGTGGT
GTCCTTCTCCAAGACGGTCACCCGATCGCTTACGAAAGTCGCAAGCTGAATAATGCGGAACGACGTTATACCGTATCAGAGAAAGAAATGCTAGCAGTCGTTCAC
TGCCTCAGGTCTTGGAGGCAGTACTTGTTGGGCTCATATTTTGTGGTAAAGACGGACAACAGTGCAATCTGCCACTTCTTCAATCAGCCTAAGCTAACATCCAAA
CAGGCCAGGTGGCAGGAATTGTTAGCCGAGTTTGATTTTAAGTTCGAACACAAGGTAGGAAAGTCCAACCAAGCTGCCGACGCTCTTAGCCGTAAAGGAGAACAT
GCGGCCCTATGCATGTTAGCCCACATTCACGCCAGCAAAGTGGACGGGTCGATCCGTGACCTCATTAGAGAGTATCTTCAAAAGACCCCTCCACCCAATCTGTGG
TCGAGCTGGCTAAGACCGGGAAGACCCGCCAGTTCTGGGTTGAAGGGGACCTATTATTTACAAGAGGAAACAGACTGTACGTGCCAAGAAAGGGAGACCTGA
Protein sequenceShow/hide protein sequence
MAATKQLSKSHVDRLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKAMRPGSFERGDSSTNPNTQIEVRMGELNNSH
SAMMQLFNEMTEDLKAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKE
LRGQFFPDNVEFMARRKLRELRHTGTIRDYVKQFSAVMLDIRDMSEKDKVFVFIEGLKSWARTKLYEQRVQDLATAMASAERLLDYSSEPSHPKKNATNPTGGNK
TFKPFTPKSGGADKRPQGPNPGPSRGPYPQSQNVQRPISCFLCKGPHRVAECPHRAALTALQASVQSCNEPEVGTDCEKEEDEETPRMGALKFLSAIQKRVNGPK
GTSEKGLMFVDATINCNPAKSTMVDSGATHNFISEQEACRLKLTIEKDTGKMKAVNSEALPIVGVSKRVTLKLGTWTGSVDFVVVRMDDFDVVLGMEFLIEHKVI
PMPLAKCMIVTSNNPTVVTASIKQPGGIRMISALQLKKGLNREEPTFMAISMVEQPVETRDVPPEIQDAFEDLKAAMMKGPVLGLAHVTKPFEVETDASDYALGG
VLLQDGHPIAYESRKLNNAERRYTVSEKEMLAVVHCLRSWRQYLLGSYFVVKTDNSAICHFFNQPKLTSKQARWQELLAEFDFKFEHKVGKSNQAADALSRKGEH
AALCMLAHIHASKVDGSIRDLIREYLQKTPPPNLWSSWLRPGRPASSGLKGTYYLQEETDCTCQERET