; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022038 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022038
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:16728249..16734255
RNA-Seq ExpressionLag0022038
SyntenyLag0022038
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031395197.1 uncharacterized protein LOC116206453 [Punica granatum]2.9e-27044.64Show/hide
Query:  VDPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD--------RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGS
        +DPEI+RT H  RR  R R +             L  ++MADD         AR +RDYA          I +PT+    FELKP + QM   + QFGG 
Subjt:  VDPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD--------RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGS

Query:  ATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFK
          E P  H+  FL  CN   +  VT + +++ LFP+ LRD A+ W N  P  SI TW  L+ KFL ++FPP   +RLR EI  FT+ + E+L EAWERFK
Subjt:  ATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFK

Query:  GLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMD
          ++ CPHHGLP  + +E FY+ L+   + LVD++A GAL+ K YDEA  +++ +A + + W   E  + R    N    ++D + ++  +I ALT ++ 
Subjt:  GLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMD

Query:  ALTARTAPLIAQLNAFGCEICGEGHTNDQCPS-------NPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSAN-------MQQQGNVNHAPRLF
         LT+  +    Q+ AF CE+C   H+  +C S       N E V ++ +   + + PYS+TYNPGWRNHPNFSW    N        Q+QG   +AP   
Subjt:  ALTARTAPLIAQLNAFGCEICGEGHTNDQCPS-------NPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSAN-------MQQQGNVNHAPRLF

Query:  NSNNDQSSQGHFQRNMDG--GNQRASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDP-----------CKNATTEGHSQPPEEKATQEKGSEKKE
        + +  +     + +  D    NQ+A+  N LE  + +      + ++   + + P+ T+++P            +       +Q  EE   ++KG +K E
Subjt:  NSNNDQSSQGHFQRNMDG--GNQRASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDP-----------CKNATTEGHSQPPEEKATQEKGSEKKE

Query:  AHAEKELDT--YKHALLFPQRLRKKNDEAQFKKFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILAN---KLPPKLKD
           +K L    Y   + FP+RL+++  +AQF KFL+V ++L IN+P  EAL++MPSYA+F+KD+L  KR +   E V LT ECS IL      LP K +D
Subjt:  AHAEKELDT--YKHALLFPQRLRKKNDEAQFKKFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILAN---KLPPKLKD

Query:  PGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLA
         GSFT+PC+IG       L D GASINLMPLSI+++LG+GE + T +TLQLADRS+ +P+G +E+VLVKVDKFIFP DFI+L+ E D+E+P+ILGRPFLA
Subjt:  PGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLA

Query:  TRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEI--------LSNAVSPTEGGNQEDQERGNRILPPHEVVTHMLDLEGRN
        T + LIDV++G+LT+R+ ++++ FN+  ++K  DD K C  +  ID    + +        + + +   +  + +D+     +    E+     +  G +
Subjt:  TRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEI--------LSNAVSPTEGGNQEDQERGNRILPPHEVVTHMLDLEGRN

Query:  PTLP--SITQPPALELKTLPSHLKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAM
         T P  S+TQ P LELK LPSHLKY +LG  ++LP+IISS L   QE  L+ VL  HK+AIGW++ADIKGISP  C H+I L  E    V+ QRRLNP +
Subjt:  PTLP--SITQPPALELKTLPSHLKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAM

Query:  MEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYS
         EVV+KE++K LDAG+IY I+DS+W SP+Q VPKKGGMTVV N  N+LIP+R  T                                 G  YYCFLDGYS
Subjt:  MEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYS

Query:  GYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMV
        GYNQI IAP+DQEKT FTCPYGTFAFRRMPFGLCNAP TFQRCMM IFS++LE  +E+FMDDFSV+GKSF  CLT L  VL+RC+ET+L+LNWEKCHFMV
Subjt:  GYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMV

Query:  EEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL
         EGIVLGH++S  GIEVD+AK+++IEKL PPTS KG+RSFLGHAGFYRR +   S +   +  +L
Subjt:  EEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]1.5e-27446.01Show/hide
Query:  MADDR----ARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAK
        MAD++     R + DYA        P IV+PT+    FE+KP + QM     QFGG  +EDPH+HL +FL+IC+   + GV+ + +++ LFP+ L+D AK
Subjt:  MADDR----ARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAK

Query:  VWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRK
         WLN +  NS  TWN L++ FL KYFPP   ++LR +I +F Q D E+L EAWERFK L + CPHHGLP  + ++TFY GL  + ++ +D++A G L+ K
Subjt:  VWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRK

Query:  TYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTL---KMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPSNPESVLYM
        + +EA+E+L+ +A NNY+W    + R  P K   G ++VD +  +NAK+ +L     K+  + + ++P+++      C+ CG  H +  C        Y 
Subjt:  TYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTL---KMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPSNPESVLYM

Query:  GSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQ----------------SSQGHFQRNMDGGNQRASSSNALENMMKEFMT
         +    + NPYS+TYNPGWRNHPNFSW    N        H P  F     Q                +S   F+R     +Q ASS+  +E  + +   
Subjt:  GSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQ----------------SSQGHFQRNMDGGNQRASSSNALENMMKEFMT

Query:  SQKEYMAKNDAKNEPTPTQDDP---CKNATTEGHSQPPE--------EKATQEKGSEKKEAHAE---------KELDTYKHALLFPQRLRKKNDEAQFKK
        S    +      N P+ T+ +P   CK  T     Q  +        +K   E+ ++K     E           ++ Y   + FPQRL++   + QF+K
Subjt:  SQKEYMAKNDAKNEPTPTQDDP---CKNATTEGHSQPPE--------EKATQEKGSEKKEAHAE---------KELDTYKHALLFPQRLRKKNDEAQFKK

Query:  FLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKR
        FL V R+LHIN+P  +AL ++P+Y KFLK+I+  KR  ++FET+ALT ECSAI+ NKLPPKL+DPGSF+IPC+IG +D  RALCDLGAS++LMPLS+ ++
Subjt:  FLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKR

Query:  LGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDP
        LG+ E +PTTI+LQLADRSV +P G +E+VL+KV KFI P DFI+L+ E D EIPIILGRPFLAT   +IDV+ G LT+++ +++V FN+  + KY   P
Subjt:  LGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDP

Query:  KECNMVARIDVC--YPQEIL--SNAVSPTE------GGNQEDQERGNRIL--------PPHEVVTHMLDL-EGRNPTLPSITQPPALELKTLPSHLKYRF
           + V R+DV     +E     N   P E      G +++D     ++          P +   +  D+ +G+ P  PS  Q P LELK LPSHL Y F
Subjt:  KECNMVARIDVC--YPQEIL--SNAVSPTE------GGNQEDQERGNRIL--------PPHEVVTHMLDL-EGRNPTLPSITQPPALELKTLPSHLKYRF

Query:  LGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWES
        LGE  +LPVI+S  L   Q   L+ +L   KKAIGW+++D++GISPS CMH+I + D     VE QRRLNP M EVV+ E++KWLDAG+IY I+DS W S
Subjt:  LGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWES

Query:  PIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFR
        P+Q VPKKGGMTVV N NNELIP+R  T                                 G AYYCFLDGYSGYNQI+I+P+DQEKT FTCPYGTFAFR
Subjt:  PIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFR

Query:  RMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEK
        RMPFGLCNAP TFQRCMM IFS+ +EK +EVFMDDFSV+G SF  CL  L +VL+RCEET+LVLNWEKCHFMV+EGIVLGH+IS+ G+EVD+AKI++IEK
Subjt:  RMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEK

Query:  LLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAILS
        L PPT+VKG+RSFLGH GFYRR +   S +   +  +L+
Subjt:  LLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAILS

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]1.5e-27446.01Show/hide
Query:  MADDR----ARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAK
        MAD++     R + DYA        P IV+PT+    FE+KP + QM     QFGG  +EDPH+HL +FL+IC+   + GV+ + +++ LFP+ L+D AK
Subjt:  MADDR----ARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAK

Query:  VWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRK
         WLN +  NS  TWN L++ FL KYFPP   ++LR +I +F Q D E+L EAWERFK L + CPHHGLP  + ++TFY GL  + ++ +D++A G L+ K
Subjt:  VWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRK

Query:  TYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTL---KMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPSNPESVLYM
        + +EA+E+L+ +A NNY+W    + R  P K   G ++VD +  +NAK+ +L     K+  + + ++P+++      C+ CG  H +  C        Y 
Subjt:  TYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTL---KMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPSNPESVLYM

Query:  GSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQ----------------SSQGHFQRNMDGGNQRASSSNALENMMKEFMT
         +    + NPYS+TYNPGWRNHPNFSW    N        H P  F     Q                +S   F+R     +Q ASS+  +E  + +   
Subjt:  GSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQ----------------SSQGHFQRNMDGGNQRASSSNALENMMKEFMT

Query:  SQKEYMAKNDAKNEPTPTQDDP---CKNATTEGHSQPPE--------EKATQEKGSEKKEAHAE---------KELDTYKHALLFPQRLRKKNDEAQFKK
        S    +      N P+ T+ +P   CK  T     Q  +        +K   E+ ++K     E           ++ Y   + FPQRL++   + QF+K
Subjt:  SQKEYMAKNDAKNEPTPTQDDP---CKNATTEGHSQPPE--------EKATQEKGSEKKEAHAE---------KELDTYKHALLFPQRLRKKNDEAQFKK

Query:  FLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKR
        FL V R+LHIN+P  +AL ++P+Y KFLK+I+  KR  ++FET+ALT ECSAI+ NKLPPKL+DPGSF+IPC+IG +D  RALCDLGAS++LMPLS+ ++
Subjt:  FLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKR

Query:  LGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDP
        LG+ E +PTTI+LQLADRSV +P G +E+VL+KV KFI P DFI+L+ E D EIPIILGRPFLAT   +IDV+ G LT+++ +++V FN+  + KY   P
Subjt:  LGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDP

Query:  KECNMVARIDVC--YPQEIL--SNAVSPTE------GGNQEDQERGNRIL--------PPHEVVTHMLDL-EGRNPTLPSITQPPALELKTLPSHLKYRF
           + V R+DV     +E     N   P E      G +++D     ++          P +   +  D+ +G+ P  PS  Q P LELK LPSHL Y F
Subjt:  KECNMVARIDVC--YPQEIL--SNAVSPTE------GGNQEDQERGNRIL--------PPHEVVTHMLDL-EGRNPTLPSITQPPALELKTLPSHLKYRF

Query:  LGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWES
        LGE  +LPVI+S  L   Q   L+ +L   KKAIGW+++D++GISPS CMH+I + D     VE QRRLNP M EVV+ E++KWLDAG+IY I+DS W S
Subjt:  LGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWES

Query:  PIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFR
        P+Q VPKKGGMTVV N NNELIP+R  T                                 G AYYCFLDGYSGYNQI+I+P+DQEKT FTCPYGTFAFR
Subjt:  PIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFR

Query:  RMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEK
        RMPFGLCNAP TFQRCMM IFS+ +EK +EVFMDDFSV+G SF  CL  L +VL+RCEET+LVLNWEKCHFMV+EGIVLGH+IS+ G+EVD+AKI++IEK
Subjt:  RMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEK

Query:  LLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAILS
        L PPT+VKG+RSFLGH GFYRR +   S +   +  +L+
Subjt:  LLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAILS

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]1.1e-27445.92Show/hide
Query:  MADDR----ARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAK
        MAD++     R + DYA        P IV+PT+    FE+KP + QM     QFGG  +EDPH+HL +FL+IC+   + GV+ + +++ LFP+ L+D AK
Subjt:  MADDR----ARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAK

Query:  VWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRK
         WLN +  NS  TWN L++ FL KYFPP   ++LR +I +F Q D E+L EAWERFK L + CPHHGLP  + ++TFY GL  + ++ +D++A G L+ K
Subjt:  VWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRK

Query:  TYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKI---VALTLKMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPSNPESVLYM
        + +EA+E+L+ +A NNY+W    + R  P K   G ++VD +  +NAK+   V +  K+  + + ++P+++      C+ CG  H +  C        Y 
Subjt:  TYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKI---VALTLKMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPSNPESVLYM

Query:  GSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQ----------------SSQGHFQRNMDGGNQRASSSNALENMMKEFMT
         +    + NPYS+TYNPGWRNHPNFSW    N        H P  F     Q                +S   F+R     +Q ASS+  +E  + +   
Subjt:  GSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQ----------------SSQGHFQRNMDGGNQRASSSNALENMMKEFMT

Query:  SQKEYMAKNDAKNEPTPTQDDP---CKNATTEGHSQPPE--------EKATQEKGSEKKEAHAE---------KELDTYKHALLFPQRLRKKNDEAQFKK
        S    +      N P+ T+ +P   CK  T     Q  +        +K   E+ ++K     E           ++ Y   + FPQRL++   + QF+K
Subjt:  SQKEYMAKNDAKNEPTPTQDDP---CKNATTEGHSQPPE--------EKATQEKGSEKKEAHAE---------KELDTYKHALLFPQRLRKKNDEAQFKK

Query:  FLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKR
        FL V R+LHIN+P  +AL ++P+Y KFLK+I+  KR  ++FET+ALT ECSAI+ NKLPPKL+DPGSF+IPC+IG +D  RALCDLGAS++LMPLS+ ++
Subjt:  FLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKR

Query:  LGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDP
        LG+ E +PTTI+LQLADRSV +P G +E+VL+KV KFI P DFI+L+ E D EIPIILGRPFLAT   +IDV+ G LT+++ +++V FN+  + KY   P
Subjt:  LGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDP

Query:  KECNMVARIDVC--YPQEIL--SNAVSPTE------GGNQEDQERGNRIL--------PPHEVVTHMLDL-EGRNPTLPSITQPPALELKTLPSHLKYRF
           + V R+DV     +E     N   P E      G +++D     ++          P +   +  D+ +G+ P  PS  Q P LELK LPSHL Y F
Subjt:  KECNMVARIDVC--YPQEIL--SNAVSPTE------GGNQEDQERGNRIL--------PPHEVVTHMLDL-EGRNPTLPSITQPPALELKTLPSHLKYRF

Query:  LGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWES
        LGE  +LPVI+S  L + Q   L+ +L   KKAIGW+++D++GISPS CMH+I + D     VE QRRLNP M EVV+ E++KWLDAG+IY I+DS W S
Subjt:  LGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWES

Query:  PIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFR
        P+Q VPKKGGMTVV N NNELIP+R  T                                 G AYYCFLDGYSGYNQI+I+P+DQEKT FTCPYGTFAFR
Subjt:  PIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFR

Query:  RMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEK
        RMPFGLCNAP TFQRCMM IFS+ +EK +EVFMDDFSV+G SF  CL  L +VL+RCEET+LVLNWEKCHFMV+EGI+LGH+IS+ G+EVD+AKI++IEK
Subjt:  RMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEK

Query:  LLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAILS
        L PPT+VKG+RSFLGH GFYRR +   S +   +  +L+
Subjt:  LLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAILS

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]1.1e-27445.96Show/hide
Query:  MADDR----ARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAK
        MAD++     R + DYA        P IV+PT+    FE+KP + QM     QFGG  +EDPH+HL +FL+IC+   + GV+ + +++ LFP+ L+D AK
Subjt:  MADDR----ARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAK

Query:  VWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRK
         WLN +  NS  TWN L++ FL KYFPP   ++LR +I +F Q D E+L EAWERFK L + CPHHGLP  + ++TFY GL  + ++ +D++A G L+ K
Subjt:  VWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRK

Query:  TYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTL---KMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPSNPESVLYM
        + +EA+E+L+ +A NNY+W    + R  P K   G ++VD +  +NAK+ +L     K+  + + ++P+++      C+ CG  H +    S+   V ++
Subjt:  TYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTL---KMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPSNPESVLYM

Query:  GSSGATRK--NPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNH----APRLFNSNNDQS-----------SQGHFQRNMDGGNQRASSSNALENMMKEFM
         +    ++  NPYS+TYNPGWRNHPNFSW    N        H     P+L    + QS           S   F+R     +Q ASS+  +E  + +  
Subjt:  GSSGATRK--NPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNH----APRLFNSNNDQS-----------SQGHFQRNMDGGNQRASSSNALENMMKEFM

Query:  TSQKEYMAKNDAKNEPTPTQDDP---CKNATTEGHSQPPE--------EKATQEKGSEKKEAHAE---------KELDTYKHALLFPQRLRKKNDEAQFK
         S    +      N P+ T+ +P   CK  T     Q  +        +K   E+ ++K     E           ++ Y   + FPQRL++   + QF+
Subjt:  TSQKEYMAKNDAKNEPTPTQDDP---CKNATTEGHSQPPE--------EKATQEKGSEKKEAHAE---------KELDTYKHALLFPQRLRKKNDEAQFK

Query:  KFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYK
        KFL V R+LHIN+P  +AL ++P+Y KFLK+I+  KR  ++FET+ALT ECSAI+ NKLPPKL+DPGSF+IPC+IG +D  RALCDLGAS++LMPLS+ +
Subjt:  KFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYK

Query:  RLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDD
        +LG+ E +PTTI+LQLADRSV +P G +E+VL+KV KFI P DFI+L+ E D EIPIILGRPFLAT   +IDV+ G LT+++ +++V FN+  + KY   
Subjt:  RLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDD

Query:  PKECNMVARIDVC--YPQEIL--SNAVSPTE------GGNQEDQERGNRIL--------PPHEVVTHMLDL-EGRNPTLPSITQPPALELKTLPSHLKYR
        P   + V R+DV     +E     N   P E      G +++D     ++          P +   +  D+ +G+ P  PS  Q P LELK LPSHL Y 
Subjt:  PKECNMVARIDVC--YPQEIL--SNAVSPTE------GGNQEDQERGNRIL--------PPHEVVTHMLDL-EGRNPTLPSITQPPALELKTLPSHLKYR

Query:  FLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWE
        FLGE  +LPVI+S  L   Q   L+ +L   KKAIGW+++D++GISPS CMH+I + D     VE QRRLNP M EVV+ E++KWLDAG+IY I+DS W 
Subjt:  FLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWE

Query:  SPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAF
        SP+Q VPKKGGMTVV N NNELIP+R  T                                 G AYYCFLDGYSGYNQI+I+P+DQEKT FTCPYGTFAF
Subjt:  SPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAF

Query:  RRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIE
        RRMPFGLCNAP TFQRCMM IFS+ +EK +E+FMDDFSV+G SF  CL  L +VL+RCEET+LVLNWEKCHFMV+EGIVLGH+IS+ G+EVD+AKI++IE
Subjt:  RRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIE

Query:  KLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAILS
        KL PPT+VKG+RSFLGH GFYRR +   S +   +  +L+
Subjt:  KLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAILS

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase3.3e-26446.69Show/hide
Query:  DPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD-RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSH
        D EI+RT+  RRR      +       NQ       I MAD+     +R+ A          +V P L     +++  M +M     QF G + E+P+ H
Subjt:  DPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD-RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSH

Query:  LKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPH
        + +FL IC+     GV+ + L++ LF + L   A  W    P +SI TW  L E+F+ K+F P   + LR EIMTF Q   ET+ EAW RF+ +L+ CP+
Subjt:  LKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPH

Query:  HGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMDALTARTAP
        H +P  IQ+ TFY GL    +  +D     + L  T  E H +L+ +  N+YE    +  R  P K  +G  EVD +T++NAKI  L   M         
Subjt:  HGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMDALTARTAP

Query:  LIAQLNAFGCEICGEGHTNDQCPSNPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQSSQGHFQRNMDGGNQR
           Q     CE CGEGH +DQCP + ES+ ++ ++   + NPYS+TYNPGWR HPNFSW                      N+   QG   R   GG Q+
Subjt:  LIAQLNAFGCEICGEGHTNDQCPSNPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQSSQGHFQRNMDGGNQR

Query:  ASSSNALENMMKEFMTSQKE----YMAKNDAKNEPTPTQDDPCKNATTEGHSQPPEEKATQEKGSEKKEAHAEKELDTYKHALLFPQRLRKKNDEAQFKK
              ++  M+E   S +E    +MA   A  +   TQ     NA    +S+P     +  + + ++++ A+ +  T ++     Q + K+  +++ K+
Subjt:  ASSSNALENMMKEFMTSQKE----YMAKNDAKNEPTPTQDDPCKNATTEGHSQPPEEKATQEKGSEKKEAHAEKELDTYKHALLFPQRLRKKNDEAQFKK

Query:  FLNVLRELHINVPL-VEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYK
         ++  +   I  PL V+ALE+MPSY KF+KDIL  KR   ++ETVALT ECSAI+ NKLPPKLKDPGSFTIPC+IG    GRALCDLGASINLMP SIY+
Subjt:  FLNVLRELHINVPL-VEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYK

Query:  RLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDD
         LG+GEA+PT+ITLQLADRS+ +P G IED+LVKVDKFIFPADF++LD E D E+PIILGRPFLAT +TLIDVQKGELTMR+ DQ++ FN+  ++K+ ++
Subjt:  RLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDD

Query:  PKECNMVARID-VCYPQEILSNAVSPTEGG--------NQEDQERGNRILPPHEVVTHMLD-LEGRNPT---LPSITQPPALELKTLPSHLKYRFLGEKE
          EC  V+  D +   + I    + P E          N+ED E    +       +  ++ LE   P+    PSI +PP LELK LPSHL Y +LGE +
Subjt:  PKECNMVARID-VCYPQEILSNAVSPTEGG--------NQEDQERGNRILPPHEVVTHMLD-LEGRNPT---LPSITQPPALELKTLPSHLKYRFLGEKE

Query:  SLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWESPIQCV
        +LPVIISS L   Q   L+ VL  HK AIGW++ADIKGISPS+CMHKI L D     VE QRRLNP M EVV+KEIIKWLDAG+IY I+DS W SP+QCV
Subjt:  SLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWESPIQCV

Query:  PKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFRRMPFG
        PKKGG+TVV N +NELIP+R  T                                 G+ +YCFLDGYSGYNQI I P+DQEKT FTCPYGTF FR+MPFG
Subjt:  PKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFRRMPFG

Query:  LCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEKLLPPT
        LCNAP TFQRCMM IF++++E  +EVFMDDFSVYG SF +CL  L  VL+RCE+T+L+LNWEKCHFMV+EGIVL H++S+ GIEVDKAK++ IEKL PPT
Subjt:  LCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEKLLPPT

Query:  SVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL
        SVKG+RSFLGHAGFYRR +   S +   +  +L
Subjt:  SVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL

A0A2G9HWF8 Reverse transcriptase1.0e-26546.82Show/hide
Query:  DPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD-RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSH
        DPEI+RT+  RRR      +       NQ       I MAD+     +R+ A          +V P L     +++  M QM     QF G + E+P+ H
Subjt:  DPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD-RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGSATEDPHSH

Query:  LKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPH
        + +FL IC+     GV+ + L++ LF + L   A  W    P +SI TW  L E+F+ K+F P   + LR EIMTF Q   ET+ EAW RF+ +L+ CP+
Subjt:  LKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFKGLLQTCPH

Query:  HGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMDALTARTAP
        H +P  IQ+ TFY GL +  +  +D     + L  T  E H +L+ +  N+YE    +  R  P K  +G  EVD +T++NAKI  L   M         
Subjt:  HGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMDALTARTAP

Query:  LIAQLNAFGCEICGEGHTNDQCPSNPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQSSQGHFQRNMDGGNQR
                    CGEGH +DQCP + ES+ ++ ++   + NPYS+TYNPGWR HPNFSW                   N+N  Q S   FQ+   GG   
Subjt:  LIAQLNAFGCEICGEGHTNDQCPSNPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQSSQGHFQRNMDGGNQR

Query:  ASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDPCKNATTEGHSQPPEEKATQEKGSEKKEAHAEKELDTYKHALL---FPQRLRKKNDEAQFKKF
           +   +   +    + +      +   EPT +++                    +E  SE+KE   E  L+  K   L   FPQRL+K+  + QF KF
Subjt:  ASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDPCKNATTEGHSQPPEEKATQEKGSEKKEAHAEKELDTYKHALL---FPQRLRKKNDEAQFKKF

Query:  LNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRL
        L V ++LHIN+P  EALE+MPSY KF+KDIL  KR   ++ETVALT ECSAI+ NKLPPKLKDPG              RALCDLGASINLMP SIY+ L
Subjt:  LNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRL

Query:  GIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPK
        G+ EA+PT+ITLQLADRS+ +P+G IED+LVKVDKFIFPADF++LD E D E+PIILGRPFLAT +TLIDVQKGELTMR+ DQ++ FN+  ++K+ ++  
Subjt:  GIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPK

Query:  ECNMVARID-VCYPQEILSNAVSPTEGG----NQEDQERGNRILPPHEVVTH-----MLDLEGRNPT---LPSITQPPALELKTLPSHLKYRFLGEKESL
        EC  V+  D +   + I    + P E        ED E    ++   +   +     +  LE   P+    PSI +PP LELK LPSHL Y +LGE ++L
Subjt:  ECNMVARID-VCYPQEILSNAVSPTEGG----NQEDQERGNRILPPHEVVTH-----MLDLEGRNPT---LPSITQPPALELKTLPSHLKYRFLGEKESL

Query:  PVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPK
        PVIISS L   Q   L+ VL  HK AIGW++ADIKGISPS+CMHKI L D+    VE QRRLNP M EVV+KEIIKWLDAG+IY I+D  W SP+QCVPK
Subjt:  PVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPK

Query:  KGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLC
        KGG+TVV N +NE IP++  T                                 G+ +YCFLDGYSGYNQI IAP+DQEKT FTCPYGTFAFRR+PF LC
Subjt:  KGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLC

Query:  NAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSV
        NAP TFQRCMM IF++++E  +EVFMDDFSVYG SF +CL  L  VL+RCE+T+LVLNWEKCHFMV+EGIVLGH++S+ GIEVDKAK++ IEKL P TSV
Subjt:  NAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSV

Query:  KGIRSFLGHAGFYRRHL
        KG+RSFLGHAGFYRR +
Subjt:  KGIRSFLGHAGFYRRHL

A0A6P8CBX2 Reverse transcriptase2.5e-25945.62Show/hide
Query:  AGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLS
        + QFGG   E P  H+  FL  CN   +  VT + +++ LFP+ LRD A+ W N  P  SI TW  L+ KFL ++FPP   +RLR EI  FT+ + E+L 
Subjt:  AGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLS

Query:  EAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIV
        EAWERFK  ++ CPHHGLP  + +E FY+ L+   + LVD++A GAL+ K YDEA  +++ +A + + W   E  + R    N    ++D + ++  +I 
Subjt:  EAWERFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIV

Query:  ALTLKMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPS-------NPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSAN-------MQQQGNV
        ALT ++  LT+  +    Q+ AF CE+C   H+  +C S       N E V ++ +   + + PYS+TYNPGWRNHPNFSW    N        Q+QG  
Subjt:  ALTLKMDALTARTAPLIAQLNAFGCEICGEGHTNDQCPS-------NPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSAN-------MQQQGNV

Query:  NHAPRLFNSNNDQSSQGHFQRNMDG--GNQRASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDP-----------CKNATTEGHSQPPEEKATQE
         +AP   + +  +     + +  D    NQ+A+  N LE  + +      + ++   + + P+ T+++P            +       +Q  EE   ++
Subjt:  NHAPRLFNSNNDQSSQGHFQRNMDG--GNQRASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDP-----------CKNATTEGHSQPPEEKATQE

Query:  KGSEKKEAHAEKELDT--YKHALLFPQRLRKKNDEAQFKKFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILAN---K
        KG +K E   +K L    Y   + FP RL+++  +AQF KFL+V ++L IN+P  EAL++MPSYA+F+KD+L  KR +   E V LT ECS IL      
Subjt:  KGSEKKEAHAEKELDT--YKHALLFPQRLRKKNDEAQFKKFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILAN---K

Query:  LPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPII
        LP K +D GSFT+PC+IG       L D GASINLMPLSI+++LG+GE + T ITLQLADRS+ +P+G +E+VLVKVDKFIFP DFI+L+ E D+E+P+I
Subjt:  LPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPII

Query:  LGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEI--------LSNAVSPTEGGNQEDQERGNRILPPHEVVTHM
        LGRPFLAT + LIDV++G+LT+R+ ++++ FN+  ++K  DD K C  +  ID    + +        + + +   +  + +D+     +    E+    
Subjt:  LGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEI--------LSNAVSPTEGGNQEDQERGNRILPPHEVVTHM

Query:  LDLEGRNPTLP--SITQPPALELKTLPSHLKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQ
         +  G + T P  S+TQ P LELK LPSHLKY +LG  ++LP+IISS L   QE  L+ VL  HK+AIGW++ADIKGISP  C H+I L  E    V+ Q
Subjt:  LDLEGRNPTLP--SITQPPALELKTLPSHLKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQ

Query:  RRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYY
        RRLNP + EVV+KE++K LDAG+IY I+DS+W SP+Q VPKKGGMTVV N  N+LIP+R  T                                 G  YY
Subjt:  RRLNPAMMEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYY

Query:  CFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNW
        CFLDGYSGYNQI IAP+DQEKT FTCPYGTFAFRRMPFGLCNAP TFQRCMM IFS++LE  +E+FMDDFSV+GKSF  CLT L  VL+RC+ET+L+LNW
Subjt:  CFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNW

Query:  EKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL
        EKCHFMV EGIVLGH++S  GIEVD+AK+++IEKL PPTS KG+RSFLGHAGFYRR +   S +   +  +L
Subjt:  EKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL

A0A6P8DD93 uncharacterized protein LOC1162064531.4e-27044.64Show/hide
Query:  VDPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD--------RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGS
        +DPEI+RT H  RR  R R +             L  ++MADD         AR +RDYA          I +PT+    FELKP + QM   + QFGG 
Subjt:  VDPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD--------RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGS

Query:  ATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFK
          E P  H+  FL  CN   +  VT + +++ LFP+ LRD A+ W N  P  SI TW  L+ KFL ++FPP   +RLR EI  FT+ + E+L EAWERFK
Subjt:  ATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFK

Query:  GLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMD
          ++ CPHHGLP  + +E FY+ L+   + LVD++A GAL+ K YDEA  +++ +A + + W   E  + R    N    ++D + ++  +I ALT ++ 
Subjt:  GLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMD

Query:  ALTARTAPLIAQLNAFGCEICGEGHTNDQCPS-------NPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSAN-------MQQQGNVNHAPRLF
         LT+  +    Q+ AF CE+C   H+  +C S       N E V ++ +   + + PYS+TYNPGWRNHPNFSW    N        Q+QG   +AP   
Subjt:  ALTARTAPLIAQLNAFGCEICGEGHTNDQCPS-------NPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSAN-------MQQQGNVNHAPRLF

Query:  NSNNDQSSQGHFQRNMDG--GNQRASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDP-----------CKNATTEGHSQPPEEKATQEKGSEKKE
        + +  +     + +  D    NQ+A+  N LE  + +      + ++   + + P+ T+++P            +       +Q  EE   ++KG +K E
Subjt:  NSNNDQSSQGHFQRNMDG--GNQRASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDP-----------CKNATTEGHSQPPEEKATQEKGSEKKE

Query:  AHAEKELDT--YKHALLFPQRLRKKNDEAQFKKFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILAN---KLPPKLKD
           +K L    Y   + FP+RL+++  +AQF KFL+V ++L IN+P  EAL++MPSYA+F+KD+L  KR +   E V LT ECS IL      LP K +D
Subjt:  AHAEKELDT--YKHALLFPQRLRKKNDEAQFKKFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILAN---KLPPKLKD

Query:  PGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLA
         GSFT+PC+IG       L D GASINLMPLSI+++LG+GE + T +TLQLADRS+ +P+G +E+VLVKVDKFIFP DFI+L+ E D+E+P+ILGRPFLA
Subjt:  PGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLA

Query:  TRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEI--------LSNAVSPTEGGNQEDQERGNRILPPHEVVTHMLDLEGRN
        T + LIDV++G+LT+R+ ++++ FN+  ++K  DD K C  +  ID    + +        + + +   +  + +D+     +    E+     +  G +
Subjt:  TRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEI--------LSNAVSPTEGGNQEDQERGNRILPPHEVVTHMLDLEGRN

Query:  PTLP--SITQPPALELKTLPSHLKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAM
         T P  S+TQ P LELK LPSHLKY +LG  ++LP+IISS L   QE  L+ VL  HK+AIGW++ADIKGISP  C H+I L  E    V+ QRRLNP +
Subjt:  PTLP--SITQPPALELKTLPSHLKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAM

Query:  MEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYS
         EVV+KE++K LDAG+IY I+DS+W SP+Q VPKKGGMTVV N  N+LIP+R  T                                 G  YYCFLDGYS
Subjt:  MEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYS

Query:  GYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMV
        GYNQI IAP+DQEKT FTCPYGTFAFRRMPFGLCNAP TFQRCMM IFS++LE  +E+FMDDFSV+GKSF  CLT L  VL+RC+ET+L+LNWEKCHFMV
Subjt:  GYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMV

Query:  EEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL
         EGIVLGH++S  GIEVD+AK+++IEKL PPTS KG+RSFLGHAGFYRR +   S +   +  +L
Subjt:  EEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL

A0A6P8DKJ2 uncharacterized protein LOC1162042314.1e-27044.72Show/hide
Query:  VDPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD--------RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGS
        +DPEI+RT H  RR  R R +             L  ++MADD         AR +RDYA          I +PT+    FELKP + QM   + QFGG 
Subjt:  VDPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADD--------RARGIRDYAATANCNFNPGIVQPTLETERFELKPVMFQMFTAAGQFGGS

Query:  ATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFK
          E P  H+  FL  CN   +  VT + +++ LFP+ LRD A+ W N  P  SI TW  L+ KFL ++FPP   +RLR EI  FT+ + E+L EAWERFK
Subjt:  ATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWERFK

Query:  GLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMD
          ++ CPHHGLP  + +E FY+ L+   + LVD++A GAL+ K YDEA  +++ +A + + W   E  + R    N    ++D + ++  +I ALT ++ 
Subjt:  GLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMD

Query:  ALTARTAPLIAQLNAFGCEICGEGHTNDQCPS-------NPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSAN-------MQQQGNVNHAPRLF
         LT+  +    Q+ AF CE+C   H+  +C S       N E V ++ +   + + PYS+TYNPGWRNHPNFSW    N        Q+QG   +AP   
Subjt:  ALTARTAPLIAQLNAFGCEICGEGHTNDQCPS-------NPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSAN-------MQQQGNVNHAPRLF

Query:  NSNNDQSSQGHFQRNMDG--GNQRASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDP-----------CKNATTEGHSQPPEEKATQEKGSEKKE
        + +  +     + +  D    NQ+A+  N LE  + +      + ++   + + P+ T+++P            +       +Q  EE   ++KG +K E
Subjt:  NSNNDQSSQGHFQRNMDG--GNQRASSSNALENMMKEFMTSQKEYMAKNDAKNEPTPTQDDP-----------CKNATTEGHSQPPEEKATQEKGSEKKE

Query:  AHAEKELDT--YKHALLFPQRLRKKNDEAQFKKFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILAN---KLPPKLKD
            K L    Y   + FP RL+++  +AQF KFL+V ++L IN+P  EAL++MPSYA+F+KD+L  KR +   E V LT ECS IL      LP K +D
Subjt:  AHAEKELDT--YKHALLFPQRLRKKNDEAQFKKFLNVLRELHINVPLVEALEKMPSYAKFLKDILMNKRPWKEFETVALTSECSAILAN---KLPPKLKD

Query:  PGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLA
         GSFT+PC+IG       L D GASINLMPLSI+++LG+GE + T ITLQLADRS+ +P+G +E+VLVKVDKFIFP DFI+L+ E D+E+P+ILGRPFLA
Subjt:  PGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLA

Query:  TRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEI--------LSNAVSPTEGGNQEDQERGNRILPPHEVVTHMLDLEGRN
        T + LIDV++G+LT+R+ ++++ FN+  ++K  DD K C  +  ID    + +        + + +   +  + +D+     +    E+     +  G +
Subjt:  TRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEI--------LSNAVSPTEGGNQEDQERGNRILPPHEVVTHMLDLEGRN

Query:  PTLP--SITQPPALELKTLPSHLKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAM
         T P  S+TQ P LELK LPSHLKY +LG  ++LP+IISS L   QE  L+ VL  HK+AIGW++ADIKGISP  C H+I L  E    V+ QRRLNP +
Subjt:  PTLP--SITQPPALELKTLPSHLKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAM

Query:  MEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYS
         EVV+KE++K LDAG+IY I+DS+W SP+Q VPKKGGMTVV N  N+LIP+R  T                                 G  YYCFLDGYS
Subjt:  MEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTT---------------------------------GQAYYCFLDGYS

Query:  GYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMV
        GYNQI IAP+DQEKT FTCPYGTFAFRRMPFGLCNAP TFQRCMM IFS++LE  +E+FMDDFSV+GKSF  CLT L  VL+RC+ET+L+LNWEKCHFMV
Subjt:  GYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMV

Query:  EEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL
         EGIVLGH++S  GIEVD+AK+++IEKL PPTS KG+RSFLGHAGFYRR +   S +   +  +L
Subjt:  EEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.5e-2923.8Show/hide
Query:  RALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMR
        + L D G+++N+   +I+        + T+  +  ++  ++     +   ++   K +FP     L +   +   ++LGR  LA  +  I  +  E+T+ 
Subjt:  RALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMR

Query:  MNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQE---ILSNAVSPTEGGNQEDQERGNRILPP-HEVVTHMLDLEGRNPTLPSITQPPALELKTLPSH
         N  K++  I +  +     +  NM+    +  P +   IL + +   E  N E+++R   +L   H++  H    EG   T  + T+        LP +
Subjt:  MNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQE---ILSNAVSPTEGGNQEDQERGNRILPP-HEVVTHMLDLEGRNPTLPSITQPPALELKTLPSH

Query:  LKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIG----WSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIY
         KY       S P     +++S  + +L + + R   +      W +   +  S       +    + NE     R   P M E++ K            
Subjt:  LKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIG----WSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAGVIY

Query:  LIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTTGQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSEL
                              +   N             Y+  +D   G++QI + P+   KT F+  +G + + RMPFGL NAP TFQRCM  I   L
Subjt:  LIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTTGQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSEL

Query:  LEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHL
        L K   V++DD  V+  S  + L  L  V E+  + +L L  +KC F+ +E   LGH ++  GI+ +  KI+ I+K   PT  K I++FLG  G+YR+ +
Subjt:  LEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHL

P10394 Retrovirus-related Pol polyprotein from transposon 4121.5e-2423.66Show/hide
Query:  LCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPI----ILGRPFLATRQTLIDVQKGELT
        L D GA I+++  +  K   I       I +Q   +  +   G+   + ++  K++ P DF ++    DK  PI    I+G  F+      ID+ + E  
Subjt:  LCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVKVDKFIFPADFIILDYETDKEIPI----ILGRPFLATRQTLIDVQKGELT

Query:  MRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEILSNAVSPTEGGNQEDQER---GNRILPPHEVVTHMLDLEGRNP--TLPSITQPPALELKTL
          +    + F I   + Y        + AR  V   + I+S+        NQE Q      N I         +L+    +    + ++   P      +
Subjt:  MRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEILSNAVSPTEGGNQEDQER---GNRILPPHEVVTHMLDLEGRNP--TLPSITQPPALELKTL

Query:  PSHLKYR---FLGE-KESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAG
         ++ ++R    L + K++ P +  S+L++     + ++ A   + I  +         +    ++RL D+   + +  R  + + +E +Q ++ K +   
Subjt:  PSHLKYR---FLGE-KESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPAMMEVVQKEIIKWLDAG

Query:  VIYLIADSRWESPIQCVPKKGG----------MTVVANANNELIPSR----------PTTGQA-YYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFR
        ++   + S++ SP+  VPKK            +      N +L+  +             G+A Y+  LD  SG++QI +    ++ T F+   G++ F 
Subjt:  VIYLIADSRWESPIQCVPKKGG----------MTVVANANNELIPSR----------PTTGQA-YYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFR

Query:  RMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEK
        R+PFGL  AP +FQR M   FS +      ++MDD  V G S    L  L +V  +C E +L L+ EKC F + E   LGH+ +  GI  D  K DVI+ 
Subjt:  RMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEK

Query:  LLPPTSVKGIRSFLGHAGFYRRHL
           P      R F+    +YRR +
Subjt:  LLPPTSVKGIRSFLGHAGFYRRHL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy3.2e-2230.8Show/hide
Query:  RLNPAMMEV---VQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELI-----------------PSRP----TTGQA-YYCFLDGYSGY
        R  P +M V   V  E+ + L  G+I   + S + SP   V KK G     N N  L+                 PS P      G+A ++  LD  SGY
Subjt:  RLNPAMMEV---VQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELI-----------------PSRP----TTGQA-YYCFLDGYSGY

Query:  NQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEE
        +QI +A  D+EKT F+   G + F R+PFGL NA   FQR +  +  E + K   V++DD  ++ ++ +D +  ++ VL+   + ++ ++ EK  F  E 
Subjt:  NQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEE

Query:  GIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL
           LG  +S  G + D  K+  I++   P  V  +RSFLG A +YR  +   + +   I  IL
Subjt:  GIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLNPSSLLLLSIAAIL

P20825 Retrovirus-related Pol polyprotein from transposon 2972.8e-2632.34Show/hide
Query:  VQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMT------VVAN---ANNELIPSR---PTTGQ--------AYYCFLDGYSGYNQITIAPDDQEKTI
        V+ ++ + L+ G+I   ++S + SP   VPKK   +      VV +    N   IP R   P   +         Y+  +D   G++QI +  +   KT 
Subjt:  VQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMT------VVAN---ANNELIPSR---PTTGQ--------AYYCFLDGYSGYNQITIAPDDQEKTI

Query:  FTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIE
        F+   G + + RMPFGL NAP TFQRCM  I   LL K   V++DD  ++  S  + L  ++ V  +  + +L L  +KC F+ +E   LGH ++  GI+
Subjt:  FTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIE

Query:  VDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHL
         +  K+  I     PT  K IR+FLG  G+YR+ +
Subjt:  VDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.1e-2532.77Show/hide
Query:  VQKEIIKWLDAGVIYLIADSRWESPIQCVPKK---------------GGMTVVANANNELIPSRPTT----GQA-YYCFLDGYSGYNQITIAPDDQEKTI
        V+++I + L  G+I   ++S + SPI  VPKK                 +  V   +   IP    T    G A Y+  LD  SG++QI +   D  KT 
Subjt:  VQKEIIKWLDAGVIYLIADSRWESPIQCVPKK---------------GGMTVVANANNELIPSRPTT----GQA-YYCFLDGYSGYNQITIAPDDQEKTI

Query:  FTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIE
        F+   G + F R+PFGL NAP  FQR +  I  E + K   V++DD  V+ + +      L  VL    + +L +N EK HF+  +   LG+ +++ GI+
Subjt:  FTCPYGTFAFRRMPFGLCNAPGTFQRCMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIE

Query:  VDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHL
         D  K+  I ++ PPTSVK ++ FLG   +YR+ +
Subjt:  VDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCAAGTTCGTGTTTTAGTTTGTTTTGGTGGAGTATGGGATGAGAAACAAAGAAAATATGAAGGAGGCTTGTTGAAAGGTGTTGTCGTACGTAAGGAAATCTCACA
CAAAGACTTAGTGGTTGATTTATATGCCCTCACAGAAGTCAACCCTAAAGAGTTTGATATGAAAATAAGATGGAAATATGAGGTAAAATTTGAAGATGAAGCTCCTCTAT
TTGAGATAAGTAATGATCGTGATTTGAATTTTTATCTTCTAGGTGAAAATATGTTTGCGGTCCCATTGTACATCTCATTTGAACCTAGGAACAAAGATTTACATGTTGAA
AGTTCTAAAGTTGTTGAACTTGTGCTCCCTGTTGAGTCTTTTTCTCCAGGAACAAGAAATTGTATGTGTGCCGCCCGACGAATTGAATTCGAGGTAGATCCTGAAATCGA
TAGAACATACCACATAAGAAGAAGAAATCGAAGACCTCGAAGGCAACCAGGAGAAATGGCAGCACCAAATCAACCAAATAGACCATTAAACCCCATCCAGATGGCAGATG
ATCGGGCCAGAGGAATTAGAGATTATGCAGCCACTGCAAATTGCAATTTCAATCCAGGGATAGTTCAACCAACTCTGGAAACAGAAAGGTTTGAGTTAAAACCTGTTATG
TTTCAAATGTTCACAGCTGCAGGACAATTTGGAGGAAGTGCAACGGAAGATCCTCACTCTCACCTCAAGTCATTCCTCGACATTTGCAACAAGTTTATCATTCCAGGAGT
TACCCCAGAACAACTTCAAGTCATATTATTTCCTTACTATCTACGAGATGCGGCGAAAGTATGGTTAAACCAGCAGCCAGCAAACTCCATCGCAACATGGAATGTTTTGG
CAGAAAAGTTCCTAGAAAAATACTTTCCACCCACCACAAACTCTAGATTGAGAAAGGAAATAATGACGTTCACACAACTAGACGATGAAACGTTGAGCGAAGCTTGGGAA
CGATTCAAGGGTCTGCTACAAACATGCCCGCACCATGGTTTACCACTGTGCATTCAGCTAGAGACCTTCTATGTAGGTCTTAACAAGAATTCGCAAGTTTTGGTAGACTC
ATCCGCAAATGGTGCATTACTCAGAAAGACTTATGACGAAGCCCACGAAATTCTAGATCGAATTGCACGCAATAACTATGAGTGGGGCACTGCAGAAGATAGAAGACGGA
GACCCATTAAGACCAATTCAGGAAGTTTTGAAGTAGATCATATGACTTCAGTAAACGCAAAGATTGTTGCGTTGACCTTGAAAATGGATGCGCTCACAGCCAGAACCGCA
CCTTTGATCGCGCAACTCAACGCATTTGGATGCGAAATCTGCGGTGAAGGACACACAAATGATCAGTGTCCTTCAAACCCAGAATCAGTTCTCTATATGGGATCAAGTGG
AGCTACGCGCAAAAACCCATATTCAAGCACATATAACCCAGGGTGGAGAAATCACCCCAATTTTTCATGGAGTGGGAGCGCCAATATGCAACAACAAGGTAATGTAAATC
ATGCACCAAGATTATTCAATTCTAACAATGACCAATCTTCGCAGGGACATTTCCAGCGCAACATGGATGGAGGAAACCAAAGAGCATCATCCTCGAACGCATTAGAAAAC
ATGATGAAGGAATTCATGACCAGTCAAAAAGAGTATATGGCCAAGAATGACGCAAAAAATGAACCTACACCAACGCAAGATGACCCATGCAAAAACGCAACCACAGAAGG
GCACAGCCAGCCGCCAGAAGAGAAAGCAACTCAGGAGAAAGGATCCGAAAAAAAGGAAGCTCACGCAGAAAAAGAGCTTGACACATATAAACATGCACTACTTTTTCCTC
AAAGGCTGCGAAAGAAAAATGATGAGGCTCAGTTTAAGAAATTTTTAAATGTTTTGCGAGAACTGCATATTAATGTCCCACTTGTGGAGGCCTTAGAAAAGATGCCAAGT
TATGCTAAGTTCTTGAAGGACATTCTGATGAACAAGCGGCCATGGAAGGAATTCGAAACTGTTGCGCTAACAAGTGAGTGTAGCGCAATACTTGCCAACAAACTTCCTCC
AAAATTGAAAGATCCTGGAAGTTTCACAATTCCCTGTTCGATAGGAGGAATTGATGTTGGCAGAGCACTATGTGACCTGGGCGCAAGTATAAATTTGATGCCATTGTCTA
TCTACAAGCGCCTAGGGATCGGAGAAGCAAGACCGACAACCATTACACTACAGTTAGCAGACAGATCCGTTGTCCACCCTGAGGGAAAGATTGAAGACGTTTTAGTCAAG
GTTGACAAATTCATTTTTCCAGCAGATTTCATCATTTTGGACTATGAAACCGATAAAGAAATTCCCATTATTTTGGGAAGACCATTCCTTGCCACCAGACAAACGTTAAT
TGACGTACAAAAGGGAGAACTGACAATGCGCATGAACGATCAGAAGGTTGTCTTTAATATTCTCTCTTCACTAAAATATCTTGATGATCCCAAGGAATGCAACATGGTAG
CCCGCATAGATGTATGTTATCCGCAGGAAATACTATCAAATGCGGTCAGCCCCACAGAAGGAGGAAACCAAGAAGATCAGGAAAGAGGAAACCGCATCTTACCGCCGCAT
GAAGTAGTTACCCACATGCTGGATCTAGAAGGAAGAAACCCCACCCTGCCTTCCATAACGCAACCTCCTGCACTAGAACTTAAAACCTTGCCATCGCATTTGAAGTATAG
ATTCTTGGGAGAAAAAGAATCACTCCCAGTCATAATCTCATCAAAGCTTGATTCATCGCAAGAACACCTACTGATGGAGGTACTCGCAAGACACAAAAAGGCCATTGGGT
GGAGTTTAGCAGATATTAAAGGTATTAGCCCTTCCTATTGCATGCACAAAATTCGATTGATAGATGAATCCAATGAATTCGTTGAAAGGCAAAGGAGGCTAAATCCCGCA
ATGATGGAAGTAGTACAAAAGGAGATAATTAAGTGGCTCGACGCAGGCGTAATATATCTCATCGCAGATAGCAGATGGGAGAGCCCAATCCAATGCGTGCCCAAGAAAGG
AGGCATGACAGTAGTGGCTAACGCAAACAACGAGCTGATTCCATCCCGCCCCACTACAGGACAAGCTTATTACTGCTTTCTCGATGGATATTCTGGATATAACCAAATTA
CCATTGCGCCAGACGACCAAGAGAAGACCATTTTCACATGTCCATATGGCACCTTCGCATTTAGAAGAATGCCCTTTGGATTGTGTAATGCGCCTGGGACATTCCAAAGG
TGCATGATGCCCATCTTCTCAGAACTTCTAGAGAAATCGGTGGAAGTTTTTATGGATGACTTCTCAGTATATGGGAAGTCTTTCGCAGACTGTTTAACACAGCTTGAGCA
GGTATTGGAAAGATGCGAAGAAACCCACCTTGTGTTGAATTGGGAAAAATGTCATTTCATGGTGGAAGAAGGCATTGTCTTAGGCCACCGCATATCAAGCGCAGGTATTG
AAGTGGATAAAGCTAAGATTGATGTCATTGAGAAATTACTACCTCCTACATCTGTAAAAGGAATTCGCAGCTTCCTGGGACATGCGGGATTCTATCGCAGGCACCTCAAC
CCTAGCTCCCTTCTTCTACTCTCAATCGCCGCAATCCTCTCCCTTCTTCATTGTTTTTCTTCAACGCTGCCGCAACCCTCTCCCTTCTTCATTGTTCTTCTTCAATGTCG
CCGCAGCTCTCTCCCCGAAGGCACATCAACGACCATCGCAGCAGGCTCTTCGTCCGCCGTTCGCAACAGGTTCTTCGTCCGCCGCAGCTTCCCTTCTTCGTCCACCGTTC
GTCGCGGCTTCCCTTCTTCGTCCGCCATTCGTCTGCCGTTCGTCGGAGCTTTGCTTCCCCTCCACGACGGTGAACAAACCTCCAAGACAGCAGTAAGGCGGCGACTAGCA
GCGTGGGTCCGAAGGCGTCTGGACGTCGACTGGACTAAAAGTATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTCAAGTTCGTGTTTTAGTTTGTTTTGGTGGAGTATGGGATGAGAAACAAAGAAAATATGAAGGAGGCTTGTTGAAAGGTGTTGTCGTACGTAAGGAAATCTCACA
CAAAGACTTAGTGGTTGATTTATATGCCCTCACAGAAGTCAACCCTAAAGAGTTTGATATGAAAATAAGATGGAAATATGAGGTAAAATTTGAAGATGAAGCTCCTCTAT
TTGAGATAAGTAATGATCGTGATTTGAATTTTTATCTTCTAGGTGAAAATATGTTTGCGGTCCCATTGTACATCTCATTTGAACCTAGGAACAAAGATTTACATGTTGAA
AGTTCTAAAGTTGTTGAACTTGTGCTCCCTGTTGAGTCTTTTTCTCCAGGAACAAGAAATTGTATGTGTGCCGCCCGACGAATTGAATTCGAGGTAGATCCTGAAATCGA
TAGAACATACCACATAAGAAGAAGAAATCGAAGACCTCGAAGGCAACCAGGAGAAATGGCAGCACCAAATCAACCAAATAGACCATTAAACCCCATCCAGATGGCAGATG
ATCGGGCCAGAGGAATTAGAGATTATGCAGCCACTGCAAATTGCAATTTCAATCCAGGGATAGTTCAACCAACTCTGGAAACAGAAAGGTTTGAGTTAAAACCTGTTATG
TTTCAAATGTTCACAGCTGCAGGACAATTTGGAGGAAGTGCAACGGAAGATCCTCACTCTCACCTCAAGTCATTCCTCGACATTTGCAACAAGTTTATCATTCCAGGAGT
TACCCCAGAACAACTTCAAGTCATATTATTTCCTTACTATCTACGAGATGCGGCGAAAGTATGGTTAAACCAGCAGCCAGCAAACTCCATCGCAACATGGAATGTTTTGG
CAGAAAAGTTCCTAGAAAAATACTTTCCACCCACCACAAACTCTAGATTGAGAAAGGAAATAATGACGTTCACACAACTAGACGATGAAACGTTGAGCGAAGCTTGGGAA
CGATTCAAGGGTCTGCTACAAACATGCCCGCACCATGGTTTACCACTGTGCATTCAGCTAGAGACCTTCTATGTAGGTCTTAACAAGAATTCGCAAGTTTTGGTAGACTC
ATCCGCAAATGGTGCATTACTCAGAAAGACTTATGACGAAGCCCACGAAATTCTAGATCGAATTGCACGCAATAACTATGAGTGGGGCACTGCAGAAGATAGAAGACGGA
GACCCATTAAGACCAATTCAGGAAGTTTTGAAGTAGATCATATGACTTCAGTAAACGCAAAGATTGTTGCGTTGACCTTGAAAATGGATGCGCTCACAGCCAGAACCGCA
CCTTTGATCGCGCAACTCAACGCATTTGGATGCGAAATCTGCGGTGAAGGACACACAAATGATCAGTGTCCTTCAAACCCAGAATCAGTTCTCTATATGGGATCAAGTGG
AGCTACGCGCAAAAACCCATATTCAAGCACATATAACCCAGGGTGGAGAAATCACCCCAATTTTTCATGGAGTGGGAGCGCCAATATGCAACAACAAGGTAATGTAAATC
ATGCACCAAGATTATTCAATTCTAACAATGACCAATCTTCGCAGGGACATTTCCAGCGCAACATGGATGGAGGAAACCAAAGAGCATCATCCTCGAACGCATTAGAAAAC
ATGATGAAGGAATTCATGACCAGTCAAAAAGAGTATATGGCCAAGAATGACGCAAAAAATGAACCTACACCAACGCAAGATGACCCATGCAAAAACGCAACCACAGAAGG
GCACAGCCAGCCGCCAGAAGAGAAAGCAACTCAGGAGAAAGGATCCGAAAAAAAGGAAGCTCACGCAGAAAAAGAGCTTGACACATATAAACATGCACTACTTTTTCCTC
AAAGGCTGCGAAAGAAAAATGATGAGGCTCAGTTTAAGAAATTTTTAAATGTTTTGCGAGAACTGCATATTAATGTCCCACTTGTGGAGGCCTTAGAAAAGATGCCAAGT
TATGCTAAGTTCTTGAAGGACATTCTGATGAACAAGCGGCCATGGAAGGAATTCGAAACTGTTGCGCTAACAAGTGAGTGTAGCGCAATACTTGCCAACAAACTTCCTCC
AAAATTGAAAGATCCTGGAAGTTTCACAATTCCCTGTTCGATAGGAGGAATTGATGTTGGCAGAGCACTATGTGACCTGGGCGCAAGTATAAATTTGATGCCATTGTCTA
TCTACAAGCGCCTAGGGATCGGAGAAGCAAGACCGACAACCATTACACTACAGTTAGCAGACAGATCCGTTGTCCACCCTGAGGGAAAGATTGAAGACGTTTTAGTCAAG
GTTGACAAATTCATTTTTCCAGCAGATTTCATCATTTTGGACTATGAAACCGATAAAGAAATTCCCATTATTTTGGGAAGACCATTCCTTGCCACCAGACAAACGTTAAT
TGACGTACAAAAGGGAGAACTGACAATGCGCATGAACGATCAGAAGGTTGTCTTTAATATTCTCTCTTCACTAAAATATCTTGATGATCCCAAGGAATGCAACATGGTAG
CCCGCATAGATGTATGTTATCCGCAGGAAATACTATCAAATGCGGTCAGCCCCACAGAAGGAGGAAACCAAGAAGATCAGGAAAGAGGAAACCGCATCTTACCGCCGCAT
GAAGTAGTTACCCACATGCTGGATCTAGAAGGAAGAAACCCCACCCTGCCTTCCATAACGCAACCTCCTGCACTAGAACTTAAAACCTTGCCATCGCATTTGAAGTATAG
ATTCTTGGGAGAAAAAGAATCACTCCCAGTCATAATCTCATCAAAGCTTGATTCATCGCAAGAACACCTACTGATGGAGGTACTCGCAAGACACAAAAAGGCCATTGGGT
GGAGTTTAGCAGATATTAAAGGTATTAGCCCTTCCTATTGCATGCACAAAATTCGATTGATAGATGAATCCAATGAATTCGTTGAAAGGCAAAGGAGGCTAAATCCCGCA
ATGATGGAAGTAGTACAAAAGGAGATAATTAAGTGGCTCGACGCAGGCGTAATATATCTCATCGCAGATAGCAGATGGGAGAGCCCAATCCAATGCGTGCCCAAGAAAGG
AGGCATGACAGTAGTGGCTAACGCAAACAACGAGCTGATTCCATCCCGCCCCACTACAGGACAAGCTTATTACTGCTTTCTCGATGGATATTCTGGATATAACCAAATTA
CCATTGCGCCAGACGACCAAGAGAAGACCATTTTCACATGTCCATATGGCACCTTCGCATTTAGAAGAATGCCCTTTGGATTGTGTAATGCGCCTGGGACATTCCAAAGG
TGCATGATGCCCATCTTCTCAGAACTTCTAGAGAAATCGGTGGAAGTTTTTATGGATGACTTCTCAGTATATGGGAAGTCTTTCGCAGACTGTTTAACACAGCTTGAGCA
GGTATTGGAAAGATGCGAAGAAACCCACCTTGTGTTGAATTGGGAAAAATGTCATTTCATGGTGGAAGAAGGCATTGTCTTAGGCCACCGCATATCAAGCGCAGGTATTG
AAGTGGATAAAGCTAAGATTGATGTCATTGAGAAATTACTACCTCCTACATCTGTAAAAGGAATTCGCAGCTTCCTGGGACATGCGGGATTCTATCGCAGGCACCTCAAC
CCTAGCTCCCTTCTTCTACTCTCAATCGCCGCAATCCTCTCCCTTCTTCATTGTTTTTCTTCAACGCTGCCGCAACCCTCTCCCTTCTTCATTGTTCTTCTTCAATGTCG
CCGCAGCTCTCTCCCCGAAGGCACATCAACGACCATCGCAGCAGGCTCTTCGTCCGCCGTTCGCAACAGGTTCTTCGTCCGCCGCAGCTTCCCTTCTTCGTCCACCGTTC
GTCGCGGCTTCCCTTCTTCGTCCGCCATTCGTCTGCCGTTCGTCGGAGCTTTGCTTCCCCTCCACGACGGTGAACAAACCTCCAAGACAGCAGTAAGGCGGCGACTAGCA
GCGTGGGTCCGAAGGCGTCTGGACGTCGACTGGACTAAAAGTATCTGA
Protein sequenceShow/hide protein sequence
MTQVRVLVCFGGVWDEKQRKYEGGLLKGVVVRKEISHKDLVVDLYALTEVNPKEFDMKIRWKYEVKFEDEAPLFEISNDRDLNFYLLGENMFAVPLYISFEPRNKDLHVE
SSKVVELVLPVESFSPGTRNCMCAARRIEFEVDPEIDRTYHIRRRNRRPRRQPGEMAAPNQPNRPLNPIQMADDRARGIRDYAATANCNFNPGIVQPTLETERFELKPVM
FQMFTAAGQFGGSATEDPHSHLKSFLDICNKFIIPGVTPEQLQVILFPYYLRDAAKVWLNQQPANSIATWNVLAEKFLEKYFPPTTNSRLRKEIMTFTQLDDETLSEAWE
RFKGLLQTCPHHGLPLCIQLETFYVGLNKNSQVLVDSSANGALLRKTYDEAHEILDRIARNNYEWGTAEDRRRRPIKTNSGSFEVDHMTSVNAKIVALTLKMDALTARTA
PLIAQLNAFGCEICGEGHTNDQCPSNPESVLYMGSSGATRKNPYSSTYNPGWRNHPNFSWSGSANMQQQGNVNHAPRLFNSNNDQSSQGHFQRNMDGGNQRASSSNALEN
MMKEFMTSQKEYMAKNDAKNEPTPTQDDPCKNATTEGHSQPPEEKATQEKGSEKKEAHAEKELDTYKHALLFPQRLRKKNDEAQFKKFLNVLRELHINVPLVEALEKMPS
YAKFLKDILMNKRPWKEFETVALTSECSAILANKLPPKLKDPGSFTIPCSIGGIDVGRALCDLGASINLMPLSIYKRLGIGEARPTTITLQLADRSVVHPEGKIEDVLVK
VDKFIFPADFIILDYETDKEIPIILGRPFLATRQTLIDVQKGELTMRMNDQKVVFNILSSLKYLDDPKECNMVARIDVCYPQEILSNAVSPTEGGNQEDQERGNRILPPH
EVVTHMLDLEGRNPTLPSITQPPALELKTLPSHLKYRFLGEKESLPVIISSKLDSSQEHLLMEVLARHKKAIGWSLADIKGISPSYCMHKIRLIDESNEFVERQRRLNPA
MMEVVQKEIIKWLDAGVIYLIADSRWESPIQCVPKKGGMTVVANANNELIPSRPTTGQAYYCFLDGYSGYNQITIAPDDQEKTIFTCPYGTFAFRRMPFGLCNAPGTFQR
CMMPIFSELLEKSVEVFMDDFSVYGKSFADCLTQLEQVLERCEETHLVLNWEKCHFMVEEGIVLGHRISSAGIEVDKAKIDVIEKLLPPTSVKGIRSFLGHAGFYRRHLN
PSSLLLLSIAAILSLLHCFSSTLPQPSPFFIVLLQCRRSSLPEGTSTTIAAGSSSAVRNRFFVRRSFPSSSTVRRGFPSSSAIRLPFVGALLPLHDGEQTSKTAVRRRLA
AWVRRRLDVDWTKSI