; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017822 (gene) of Snake gourd v1 genome

Gene IDTan0017822
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:41513850..41515614
RNA-Seq ExpressionTan0017822
SyntenyTan0017822
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.8e-21565.87Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYA LPDSFWGYA+ET   ILNNVPSKSV ETP+ELWKGRK SL +F IWGCPAH+LV NPKKL+ RSKLCLFV Y KE+RGGLFY P+E+KV VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVV-YTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAMVNK
          FLEEDH R+H+PRSK+VL E+ K   N  +K S+ST VV   N+S     SQEL +PRRSGRV+ QP+RY+GL ETQ++IP+D  EDPLTY QAM + 
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVV-YTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAMVNK

Query:  DRQ---------------------MDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM---------------------
        DR                      +D P  VKPIGCKWIYK++R     VQ FKARLVAKGYTQ EGVDYEETFSPVAM                     
Subjt:  DRQ---------------------MDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM---------------------

Query:  ---TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYL
           T FLNGNL+E+IYM Q  GFI   QEQKVC+L++SIYGLKQASRSWN RFD  IKS+GF+ N DEPCVYKKI+NS VAFL+LYVDDILLI NDV YL
Subjt:  ---TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYL

Query:  TDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVM
        TD+K WL TQF+MKDLGEAQ++LGIQIV+NRKN TLA+SQ SYIDKVL R+KMQ SKK               CPKTPQ VEDMR +PY+S VGSLMY M
Subjt:  TDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVM

Query:  LCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        LCTRP+IC+  G+VSR+QSNPG +HWTAVK I KYLRRTRNYML+Y AKDLILTGYTDSDFQ++KD+RKSTS SVFTLNGG +VWRS+K
Subjt:  LCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-21766.5Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYA LP SFWGYAVET   ILNNVPSKSVSETPFELW+GRK SL HF IWGCPAH+LVTNPKKL+ RS+LC FV Y KETRGGLF+DP+E++V VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN
          FLEEDH+R+HKPRSK+VLSE       V ++   S+ V  T  S    PSQ L MPRRSGRV++QP+RY+GL+ETQVVIP+D  EDPL+Y QAM  V+
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN

Query:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------
        KD+                    +D P+GVKPIGCKWIYK++R     VQ FKARLVAKGYTQ EGVDYEETFSPVAM                      
Subjt:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------

Query:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT
          T FLNGNL+E+I+M Q  GFI  GQEQKVC+L RSIYGLKQASRSWN RFD  IKS+GFD N DEPCVYKKI    VAFLVLYVDDILLI NDVGYLT
Subjt:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT

Query:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML
        D+K WLA QF+MKDLGEAQ+VLGIQI+++RKN TLALSQ +YIDK+L+R+ MQ SKK                PKTPQ VEDMRR+PYAS VGSLMY ML
Subjt:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML

Query:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        CTRP+IC+  G+VSR+QSNPG +HWTAVK + KYLRRTR+YML+Y AKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGG +VWRSIK
Subjt:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-21666.33Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYA LP SFWGYAVET   ILNNVPSKSVSETPFELW+GRK SL HF IWGCPAH+LVTNPKKL+ RS+LC FV Y KETRGGLF+DPKE++V VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN
          FLEEDH+R+HKPRSK+VLSE       V ++   S+ V  T  S    PSQ L MPRRSGRV++QP+RY+GL+ETQVVIP+D  EDPL+Y QAM  V+
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN

Query:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------
        KD+                    +D P+GVKPIGCKWIYK++R     VQ FKARLVAKGYT+ EGVDYEETFS VAM                      
Subjt:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------

Query:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT
          T FLNGNL+E+I+M Q  GFI  GQEQKVC+L RSIYGLKQASRSWN RFD  IKS+GFD N DEPCVYKKI    VAFLVLYVDDILLI NDVGYLT
Subjt:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT

Query:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML
        D+K WLA QF+MKDLGE Q+VLGIQI+++RKN TLALSQ +YIDK+L+R+ MQ SKK                PKTPQ VEDMRR+PYAS VGSLMY ML
Subjt:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML

Query:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        CTRP+IC+  G+VSR+QSNPG +HWTAVK I KYLRRTR+YML+Y AKDLILTGYT+SDFQT+KDSRKSTSRSVFTLNGG +VWRSIK
Subjt:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-21364.86Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYAHLP+SFWGYAV+T  +ILN VPSKSVSETP +LW GRKGSL HF IWGCPAH+L  NPKKL+ RSKLCLFV Y K TRGG FYDPK++KV VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTI----ANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM
          FLEEDHIR+HKPRSK+VL+EL K        V  + S  T VV+   S+     Q L  PRRSGRV   P RYM L+ET  VI + D EDPLT+ +AM
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTI----ANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM

Query:  VNKDRQ---------------------MDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM------------------
         + D+                      +D+PDGVKPIGCKWIYK++RG D  VQ FKARLVAKGYTQVEGVDYEETFSPVAM                  
Subjt:  VNKDRQ---------------------MDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM------------------

Query:  ------TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDV
              T FLNGNL+E IYM+Q  GFI PGQEQK+C+L RSIYGLKQASRSWN RFD  IKS+GFD   DEPCVYK+IIN SVAFLVLYVDDILLI ND+
Subjt:  ------TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDV

Query:  GYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLM
        G LTDIK WLATQF+MKDLGEAQFVLGIQI ++RKN  LALSQ SYIDK+++++ MQ SK+               CPKTPQ VE+MR +PYAS VGSLM
Subjt:  GYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLM

Query:  YVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        Y MLCTRP+IC+  G+VSR+QSNPG  HWTAVKTI KYLRRTR+Y L+Y +KDLILTGYTDSDFQT++DSRKSTS SVFTLNGG +VWRSIK
Subjt:  YVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-21766.5Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYA LP SFWGYAVET   ILNNVPSKSVSETPFELW+GRK SL HF IWGCPAH+LVTNPKKL+ RS+LC FV Y KETRGGLF+DP+E++V VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN
          FLEEDH+R+HKPRSK+VLSE       V ++   S+ V  T  S    PSQ L MPRRSGRV++QP+RY+GL+ETQVVIP+D  EDPL+Y QAM  V+
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN

Query:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------
        KD+                    +D P+GVKPIGCKWIYK++R     VQ FKARLVAKGYTQ EGVDYEETFSPVAM                      
Subjt:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------

Query:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT
          T FLNGNL+E+I+M Q  GFI  GQEQKVC+L RSIYGLKQASRSWN RFD  IKS+GFD N DEPCVYKKI    VAFLVLYVDDILLI NDVGYLT
Subjt:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT

Query:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML
        D+K WLA QF+MKDLGEAQ+VLGIQI+++RKN TLALSQ +YIDK+L+R+ MQ SKK                PKTPQ VEDMRR+PYAS VGSLMY ML
Subjt:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML

Query:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        CTRP+IC+  G+VSR+QSNPG +HWTAVK + KYLRRTR+YML+Y AKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGG +VWRSIK
Subjt:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.4e-21364.86Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYAHLP+SFWGYAV+T  +ILN VPSKSVSETP +LW GRKGSL HF IWGCPAH+L  NPKKL+ RSKLCLFV Y K TRGG FYDPK++KV VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTI----ANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM
          FLEEDHIR+HKPRSK+VL+EL K        V  + S  T VV+   S+     Q L  PRRSGRV   P RYM L+ET  VI + D EDPLT+ +AM
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTI----ANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM

Query:  VNKDRQ---------------------MDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM------------------
         + D+                      +D+PDGVKPIGCKWIYK++RG D  VQ FKARLVAKGYTQVEGVDYEETFSPVAM                  
Subjt:  VNKDRQ---------------------MDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM------------------

Query:  ------TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDV
              T FLNGNL+E IYM+Q  GFI PGQEQK+C+L RSIYGLKQASRSWN RFD  IKS+GFD   DEPCVYK+IIN SVAFLVLYVDDILLI ND+
Subjt:  ------TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDV

Query:  GYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLM
        G LTDIK WLATQF+MKDLGEAQFVLGIQI ++RKN  LALSQ SYIDK+++++ MQ SK+               CPKTPQ VE+MR +PYAS VGSLM
Subjt:  GYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLM

Query:  YVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        Y MLCTRP+IC+  G+VSR+QSNPG  HWTAVKTI KYLRRTR+Y L+Y +KDLILTGYTDSDFQT++DSRKSTS SVFTLNGG +VWRSIK
Subjt:  YVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

A0A5A7T2V9 Gag/pol protein6.0e-21766.33Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYA LP SFWGYAVET   ILNNVPSKSVSETPFELW+GRK SL HF IWGCPAH+LVTNPKKL+ RS+LC FV Y KETRGGLF+DPKE++V VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN
          FLEEDH+R+HKPRSK+VLSE       V ++   S+ V  T  S    PSQ L MPRRSGRV++QP+RY+GL+ETQVVIP+D  EDPL+Y QAM  V+
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN

Query:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------
        KD+                    +D P+GVKPIGCKWIYK++R     VQ FKARLVAKGYT+ EGVDYEETFS VAM                      
Subjt:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------

Query:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT
          T FLNGNL+E+I+M Q  GFI  GQEQKVC+L RSIYGLKQASRSWN RFD  IKS+GFD N DEPCVYKKI    VAFLVLYVDDILLI NDVGYLT
Subjt:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT

Query:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML
        D+K WLA QF+MKDLGE Q+VLGIQI+++RKN TLALSQ +YIDK+L+R+ MQ SKK                PKTPQ VEDMRR+PYAS VGSLMY ML
Subjt:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML

Query:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        CTRP+IC+  G+VSR+QSNPG +HWTAVK I KYLRRTR+YML+Y AKDLILTGYT+SDFQT+KDSRKSTSRSVFTLNGG +VWRSIK
Subjt:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

A0A5A7TZD0 Gag/pol protein7.1e-21866.5Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYA LP SFWGYAVET   ILNNVPSKSVSETPFELW+GRK SL HF IWGCPAH+LVTNPKKL+ RS+LC FV Y KETRGGLF+DP+E++V VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN
          FLEEDH+R+HKPRSK+VLSE       V ++   S+ V  T  S    PSQ L MPRRSGRV++QP+RY+GL+ETQVVIP+D  EDPL+Y QAM  V+
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN

Query:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------
        KD+                    +D P+GVKPIGCKWIYK++R     VQ FKARLVAKGYTQ EGVDYEETFSPVAM                      
Subjt:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------

Query:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT
          T FLNGNL+E+I+M Q  GFI  GQEQKVC+L RSIYGLKQASRSWN RFD  IKS+GFD N DEPCVYKKI    VAFLVLYVDDILLI NDVGYLT
Subjt:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT

Query:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML
        D+K WLA QF+MKDLGEAQ+VLGIQI+++RKN TLALSQ +YIDK+L+R+ MQ SKK                PKTPQ VEDMRR+PYAS VGSLMY ML
Subjt:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML

Query:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        CTRP+IC+  G+VSR+QSNPG +HWTAVK + KYLRRTR+YML+Y AKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGG +VWRSIK
Subjt:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

A0A5A7UYE8 Gag/pol protein7.1e-21866.5Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYA LP SFWGYAVET   ILNNVPSKSVSETPFELW+GRK SL HF IWGCPAH+LVTNPKKL+ RS+LC FV Y KETRGGLF+DP+E++V VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN
          FLEEDH+R+HKPRSK+VLSE       V ++   S+ V  T  S    PSQ L MPRRSGRV++QP+RY+GL+ETQVVIP+D  EDPL+Y QAM  V+
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAM--VN

Query:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------
        KD+                    +D P+GVKPIGCKWIYK++R     VQ FKARLVAKGYTQ EGVDYEETFSPVAM                      
Subjt:  KDR-------------------QMDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM----------------------

Query:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT
          T FLNGNL+E+I+M Q  GFI  GQEQKVC+L RSIYGLKQASRSWN RFD  IKS+GFD N DEPCVYKKI    VAFLVLYVDDILLI NDVGYLT
Subjt:  --TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLT

Query:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML
        D+K WLA QF+MKDLGEAQ+VLGIQI+++RKN TLALSQ +YIDK+L+R+ MQ SKK                PKTPQ VEDMRR+PYAS VGSLMY ML
Subjt:  DIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVML

Query:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        CTRP+IC+  G+VSR+QSNPG +HWTAVK + KYLRRTR+YML+Y AKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGG +VWRSIK
Subjt:  CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

E2GK51 Gag/pol protein (Fragment)8.7e-21665.87Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN
        MMSYA LPDSFWGYA+ET   ILNNVPSKSV ETP+ELWKGRK SL +F IWGCPAH+LV NPKKL+ RSKLCLFV Y KE+RGGLFY P+E+KV VSTN
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTN

Query:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVV-YTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAMVNK
          FLEEDH R+H+PRSK+VL E+ K   N  +K S+ST VV   N+S     SQEL +PRRSGRV+ QP+RY+GL ETQ++IP+D  EDPLTY QAM + 
Subjt:  TIFLEEDHIRDHKPRSKVVLSELDKTIANVANKASTSTNVV-YTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAMVNK

Query:  DRQ---------------------MDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM---------------------
        DR                      +D P  VKPIGCKWIYK++R     VQ FKARLVAKGYTQ EGVDYEETFSPVAM                     
Subjt:  DRQ---------------------MDKPDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAM---------------------

Query:  ---TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYL
           T FLNGNL+E+IYM Q  GFI   QEQKVC+L++SIYGLKQASRSWN RFD  IKS+GF+ N DEPCVYKKI+NS VAFL+LYVDDILLI NDV YL
Subjt:  ---TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYL

Query:  TDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVM
        TD+K WL TQF+MKDLGEAQ++LGIQIV+NRKN TLA+SQ SYIDKVL R+KMQ SKK               CPKTPQ VEDMR +PY+S VGSLMY M
Subjt:  TDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKK---------------CPKTPQGVEDMRRVPYASVVGSLMYVM

Query:  LCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        LCTRP+IC+  G+VSR+QSNPG +HWTAVK I KYLRRTRNYML+Y AKDLILTGYTDSDFQ++KD+RKSTS SVFTLNGG +VWRS+K
Subjt:  LCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-5026.35Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSV---SETPFELWKGRKGSLCHFSIWGCPAHMLVTNPK-KLDSRSKLCLFVCYTKETRGGLFYDPKEDKVL
        M+S A L  SFWG AV T  +++N +PS+++   S+TP+E+W  +K  L H  ++G   ++ + N + K D +S   +FV Y  E  G   +D   +K +
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSV---SETPFELWKGRKGSLCHFSIWGCPAHMLVTNPK-KLDSRSKLCLFVCYTKETRGGLFYDPKEDKVL

Query:  VS-------TN----------TIFLEEDHIRDHK----PRSKVVLSEL--------------------DKTIANVANKA--------------------S
        V+       TN          T+FL++    ++K       K++ +E                     +K   N + K                     S
Subjt:  VS-------TN----------TIFLEEDHIRDHK----PRSKVVLSEL--------------------DKTIANVANKA--------------------S

Query:  TSTNVVYTNLS--------------------SHESPSQE------LSMP----------RRSGRVITQPDRYMGLSETQV------------VIPNDDCE
          +N  + N S                    S ES + E      +  P          RRS R+ T+P       +  +             +PN   E
Subjt:  TSTNVVYTNLS--------------------SHESPSQE------LSMP----------RRSGRVITQPDRYMGLSETQV------------VIPNDDCE

Query:  -----DPLTYNQAM--------VNKDRQMDK-PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVA-----------------
             D  ++ +A+        +N    + K P+    +  +W++  +     N   +KARLVA+G+TQ   +DYEETF+PVA                 
Subjt:  -----DPLTYNQAM--------VNKDRQMDK-PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVA-----------------

Query:  -------MTTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVY---KKIINSSVAFLVLYVDDILL
                T FLNG L E IYM    G         VC+L ++IYGLKQA+R W + F++ +K   F  +  + C+Y   K  IN ++ +++LYVDD+++
Subjt:  -------MTTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVY---KKIINSSVAFLVLYVDDILL

Query:  IRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKKCPKTP---------QGVEDMRRVPYASVVGSLMY
           D+  + + K +L  +F+M DL E +  +GI+I    + + + LSQ++Y+ K+L +F M E+     TP            ++    P  S++G LMY
Subjt:  IRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKKCPKTP---------QGVEDMRRVPYASVVGSLMY

Query:  VMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMY---WAKDLILTGYTDSDFQTNKDSRKSTSRSVFTL
        +MLCTRP++     ++SR+ S    E W  +K + +YL+ T +  L++    A +  + GY DSD+  ++  RKST+  +F +
Subjt:  VMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMY---WAKDLILTGYTDSDFQTNKDSRKSTSRSVFTL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-9335.32Show/hide
Query:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVS-ETPFELWKGRKGSLCHFSIWGCP--AHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLV
        M+  A LP SFWG AV+T  +++N  PS  ++ E P  +W  ++ S  H  ++GC   AH+      KLD +S  C+F+ Y  E  G   +DP + KV+ 
Subjt:  MMSYAHLPDSFWGYAVETVGFILNNVPSKSVS-ETPFELWKGRKGSLCHFSIWGCP--AHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLV

Query:  STNTIFLEEDHIRDHKPRSKVVLSELDKT--IANVANKASTSTNVVYTNLSS----------------------------HESPSQELSMP-RRSGRVIT
        S + +F      R+ + R+   +SE  K   I N     STS N      ++                            H +  +E   P RRS R   
Subjt:  STNTIFLEEDHIRDHKPRSKVVLSELDKT--IANVANKASTSTNVVYTNLSS----------------------------HESPSQELSMP-RRSGRVIT

Query:  QPDRYMGLSETQVVIPNDDCE-----DPLTY---NQAMVNKDRQMDK------------PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGV
        +  RY     T+ V+ +DD E     + L++   NQ M     +M+             P G +P+ CKW++K ++  D  +  +KARLV KG+ Q +G+
Subjt:  QPDRYMGLSETQVVIPNDDCE-----DPLTY---NQAMVNKDRQMDK------------PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGV

Query:  DYEETFSPV------------------------AMTTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDE
        D++E FSPV                          T FL+G+L+E IYMEQ  GF   G++  VC+L +S+YGLKQA R W  +FD  +KS  +     +
Subjt:  DYEETFSPV------------------------AMTTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDE

Query:  PCVY-KKIINSSVAFLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQES-----------
        PCVY K+   ++   L+LYVDD+L++  D G +  +K  L+  F MKDLG AQ +LG++IV+ R +  L LSQ  YI++VL RF M+ +           
Subjt:  PCVY-KKIINSSVAFLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQES-----------

Query:  ----KKCPKTPQGVEDMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDS
            K CP T +   +M +VPY+S VGSLMY M+CTRP+I    G+VSR   NPG EHW AVK I +YLR T    L +   D IL GYTD+D   + D+
Subjt:  ----KKCPKTPQGVEDMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDS

Query:  RKSTSRSVFTLNGGVLVWRS
        RKS++  +FT +GG + W+S
Subjt:  RKSTSRSVFTLNGGVLVWRS

P25600 Putative transposon Ty5-1 protein YCL074W5.9e-2830.39Show/hide
Query:  TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLTDI
        T FLN  +DE IY++Q  GF+       V  L   +YGLKQA   WN+  +  +K  GF  ++ E  +Y +  +    ++ +YVDD+L+          +
Subjt:  TTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLTDI

Query:  KNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKV-----LLRFKMQESKKCPKTP------QGVEDMRRVPYASVVGSLMYVMLCTRPNI
        K  L   + MKDLG+    LG+ I Q+  N  + LS   YI K      +  FK+ ++  C   P        ++D+   PY S+VG L++     RP+I
Subjt:  KNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKV-----LLRFKMQESKKCPKTP------QGVEDMRRVPYASVVGSLMYVMLCTRPNI

Query:  CFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWA-KDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
         +   ++SR    P   H  + + + +YL  TR+  L Y +   L LT Y D+      D   ST   V  L G  + W S K
Subjt:  CFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWA-KDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.5e-4332.77Show/hide
Query:  PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAMTT------------------------FLNGNLDENIYMEQLIGFIEPG
        P  V  +GC+WI+ K+   D ++  +KARLVAKGY Q  G+DY ETFSPV  +T                        FL G L +++YM Q  GFI+  
Subjt:  PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAMTT------------------------FLNGNLDENIYMEQLIGFIEPG

Query:  QEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQI
        +   VC+L++++YGLKQA R+W       + + GF  +  +  ++      S+ ++++YVDDIL+  ND   L +  + L+ +F +KD  E  + LGI+ 
Subjt:  QEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQI

Query:  VQNRKNNTLALSQTSYIDKVLLRFKMQESK--KCPKTP-------QGVEDMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIF
           R    L LSQ  YI  +L R  M  +K    P  P        G +      Y  +VGSL Y+   TRP+I +    +S+    P  EH  A+K I 
Subjt:  VQNRKNNTLALSQTSYIDKVLLRFKMQESK--KCPKTP-------QGVEDMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIF

Query:  KYLRRTRNY-MLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        +YL  T N+ + +     L L  Y+D+D+  +KD   ST+  +  L    + W S K
Subjt:  KYLRRTRNY-MLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-4131.09Show/hide
Query:  PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAMTT------------------------FLNGNLDENIYMEQLIGFIEPG
        P  V  +GC+WI+ K+   D ++  +KARLVAKGY Q  G+DY ETFSPV  +T                        FL G L + +YM Q  GF++  
Subjt:  PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAMTT------------------------FLNGNLDENIYMEQLIGFIEPG

Query:  QEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQI
        +   VCRL+++IYGLKQA R+W       + + GF  +  +  ++      S+ ++++YVDDIL+  ND   L    + L+ +F +K+  +  + LGI+ 
Subjt:  QEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQI

Query:  VQNRKNNTLALSQTSYIDKVLLRFKMQESK-------KCPKTP--QGVEDMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIF
           R    L LSQ  Y   +L R  M  +K         PK     G +      Y  +VGSL Y+   TRP++ +    +S++   P  +HW A+K + 
Subjt:  VQNRKNNTLALSQTSYIDKVLLRFKMQESK-------KCPKTP--QGVEDMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIF

Query:  KYLRRTRNY-MLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
        +YL  T ++ + +     L L  Y+D+D+  + D   ST+  +  L    + W S K
Subjt:  KYLRRTRNY-MLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-4533.24Show/hide
Query:  PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAMTT------------------------FLNGNLDENIYMEQLIGFI---
        P   KPIGCKW+YK +   D  ++ +KARLVAKGYTQ EG+D+ ETFSPV   T                        FLNG+LDE IYM+   G+    
Subjt:  PDGVKPIGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAMTT------------------------FLNGNLDENIYMEQLIGFI---

Query:  -EPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVL
         +      VC LK+SIYGLKQASR W  +F   +  FGF  +  +   + KI  +    +++YVDDI++  N+   + ++K+ L + FK++DLG  ++ L
Subjt:  -EPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKIINSSVAFLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVL

Query:  GIQIVQNRKNNTLALSQTSY----IDKV-LLRFKMQESKKCPKTP----QGVEDMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAV
        G++I ++     + + Q  Y    +D+  LL  K       P        G + +    Y  ++G LMY+ + TR +I F    +S+    P   H  AV
Subjt:  GIQIVQNRKNNTLALSQTSY----IDKV-LLRFKMQESKKCPKTP----QGVEDMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAV

Query:  KTIFKYLRRTRNYMLMYWAK-DLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK
          I  Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+     L   ++ W+S K
Subjt:  KTIFKYLRRTRNYMLMYWAK-DLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK

ATMG00810.1 DNA/RNA polymerases superfamily protein4.2e-1328.43Show/hide
Query:  FLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVL--------------LRFKMQESKKCPKTPQGVE
        +L+LYVDDILL  +    L  +   L++ F MKDLG   + LGIQI  +     L LSQT Y +++L              L  K+  S    K P   +
Subjt:  FLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVL--------------LRFKMQESKKCPKTPQGVE

Query:  DMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNY-MLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGG
              + S+VG+L Y+ L TRP+I +   +V +    P    +  +K + +Y++ T  + + ++    L +  + DSD+     +R+ST+     L   
Subjt:  DMRRVPYASVVGSLMYVMLCTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNY-MLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGG

Query:  VLVW
        ++ W
Subjt:  VLVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.5e-0748.94Show/hide
Query:  IGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAMT
        +GCKW++K +   D  +   KARLVAKG+ Q EG+ + ET+SPV  T
Subjt:  IGCKWIYKKRRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAMT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAGTTATGCTCATCTCCCTGATTCCTTTTGGGGTTACGCAGTAGAGACTGTGGGATTCATTTTGAACAATGTGCCATCAAAAAGTGTATCTGAAACACCTTTTGA
GCTCTGGAAAGGACGTAAAGGTAGTTTATGTCATTTCAGTATTTGGGGTTGTCCAGCACATATGCTTGTGACAAATCCAAAGAAGTTGGATTCACGTTCAAAATTGTGCC
TATTCGTATGCTACACAAAAGAGACAAGAGGTGGATTATTCTATGATCCTAAGGAAGATAAGGTACTTGTGTCGACAAATACCATTTTCTTAGAGGAGGACCACATCAGG
GACCACAAACCAAGAAGTAAAGTTGTGTTAAGTGAGTTAGACAAAACAATAGCAAATGTTGCTAATAAAGCTAGTACGTCAACAAATGTTGTTTATACTAACTTGTCTAG
TCATGAGAGTCCATCTCAAGAGTTGAGTATGCCTCGACGTAGTGGGAGGGTTATAACACAACCTGACCGTTACATGGGTTTATCTGAAACCCAAGTTGTTATACCAAATG
ACGATTGTGAGGATCCATTGACTTATAATCAAGCAATGGTTAACAAAGACAGACAAATGGATAAACCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAAA
AGGCGTGGTGTAGATAAAAATGTGCAAATCTTTAAAGCTAGACTAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCTAT
GACAACATTCTTGAATGGCAATCTTGACGAGAACATCTACATGGAACAACTCATAGGGTTCATTGAACCAGGACAAGAGCAAAAGGTTTGCAGGCTTAAAAGGTCAATCT
ATGGATTGAAACAAGCCTCTAGGTCTTGGAATAAAAGATTTGATGAGCCAATCAAATCTTTTGGCTTTGATCCGAATGATGATGAACCTTGTGTCTACAAGAAAATTATC
AATAGTTCTGTCGCATTCCTAGTTCTATATGTGGATGATATCCTACTCATTCGGAATGATGTAGGTTATCTTACTGACATTAAGAATTGGCTAGCTACGCAATTCAAAAT
GAAAGATTTGGGTGAAGCGCAGTTTGTTCTTGGGATCCAGATTGTCCAGAACCGCAAGAATAATACACTAGCCTTGTCTCAGACATCTTACATCGACAAAGTGTTGTTGA
GGTTTAAGATGCAAGAGTCCAAAAAGTGTCCTAAGACACCTCAAGGAGTTGAGGACATGAGACGGGTTCCTTATGCATCAGTTGTTGGGAGCCTGATGTACGTCATGTTG
TGTACTAGGCCAAACATATGTTTTATAGCTGGAATGGTTAGTAGACATCAATCCAATCCAGGACCCGAACACTGGACAGCGGTTAAAACGATCTTTAAGTATCTTCGGAG
AACAAGGAACTACATGCTTATGTATTGGGCTAAAGATTTGATCCTTACAGGATACACGGATTCAGACTTTCAAACTAATAAAGATTCTCGAAAATCTACATCAAGGTCAG
TATTTACTCTTAACGGAGGAGTTTTAGTATGGCGAAGCATCAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGAGTTATGCTCATCTCCCTGATTCCTTTTGGGGTTACGCAGTAGAGACTGTGGGATTCATTTTGAACAATGTGCCATCAAAAAGTGTATCTGAAACACCTTTTGA
GCTCTGGAAAGGACGTAAAGGTAGTTTATGTCATTTCAGTATTTGGGGTTGTCCAGCACATATGCTTGTGACAAATCCAAAGAAGTTGGATTCACGTTCAAAATTGTGCC
TATTCGTATGCTACACAAAAGAGACAAGAGGTGGATTATTCTATGATCCTAAGGAAGATAAGGTACTTGTGTCGACAAATACCATTTTCTTAGAGGAGGACCACATCAGG
GACCACAAACCAAGAAGTAAAGTTGTGTTAAGTGAGTTAGACAAAACAATAGCAAATGTTGCTAATAAAGCTAGTACGTCAACAAATGTTGTTTATACTAACTTGTCTAG
TCATGAGAGTCCATCTCAAGAGTTGAGTATGCCTCGACGTAGTGGGAGGGTTATAACACAACCTGACCGTTACATGGGTTTATCTGAAACCCAAGTTGTTATACCAAATG
ACGATTGTGAGGATCCATTGACTTATAATCAAGCAATGGTTAACAAAGACAGACAAATGGATAAACCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAAA
AGGCGTGGTGTAGATAAAAATGTGCAAATCTTTAAAGCTAGACTAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCTAT
GACAACATTCTTGAATGGCAATCTTGACGAGAACATCTACATGGAACAACTCATAGGGTTCATTGAACCAGGACAAGAGCAAAAGGTTTGCAGGCTTAAAAGGTCAATCT
ATGGATTGAAACAAGCCTCTAGGTCTTGGAATAAAAGATTTGATGAGCCAATCAAATCTTTTGGCTTTGATCCGAATGATGATGAACCTTGTGTCTACAAGAAAATTATC
AATAGTTCTGTCGCATTCCTAGTTCTATATGTGGATGATATCCTACTCATTCGGAATGATGTAGGTTATCTTACTGACATTAAGAATTGGCTAGCTACGCAATTCAAAAT
GAAAGATTTGGGTGAAGCGCAGTTTGTTCTTGGGATCCAGATTGTCCAGAACCGCAAGAATAATACACTAGCCTTGTCTCAGACATCTTACATCGACAAAGTGTTGTTGA
GGTTTAAGATGCAAGAGTCCAAAAAGTGTCCTAAGACACCTCAAGGAGTTGAGGACATGAGACGGGTTCCTTATGCATCAGTTGTTGGGAGCCTGATGTACGTCATGTTG
TGTACTAGGCCAAACATATGTTTTATAGCTGGAATGGTTAGTAGACATCAATCCAATCCAGGACCCGAACACTGGACAGCGGTTAAAACGATCTTTAAGTATCTTCGGAG
AACAAGGAACTACATGCTTATGTATTGGGCTAAAGATTTGATCCTTACAGGATACACGGATTCAGACTTTCAAACTAATAAAGATTCTCGAAAATCTACATCAAGGTCAG
TATTTACTCTTAACGGAGGAGTTTTAGTATGGCGAAGCATCAAGTAG
Protein sequenceShow/hide protein sequence
MMSYAHLPDSFWGYAVETVGFILNNVPSKSVSETPFELWKGRKGSLCHFSIWGCPAHMLVTNPKKLDSRSKLCLFVCYTKETRGGLFYDPKEDKVLVSTNTIFLEEDHIR
DHKPRSKVVLSELDKTIANVANKASTSTNVVYTNLSSHESPSQELSMPRRSGRVITQPDRYMGLSETQVVIPNDDCEDPLTYNQAMVNKDRQMDKPDGVKPIGCKWIYKK
RRGVDKNVQIFKARLVAKGYTQVEGVDYEETFSPVAMTTFLNGNLDENIYMEQLIGFIEPGQEQKVCRLKRSIYGLKQASRSWNKRFDEPIKSFGFDPNDDEPCVYKKII
NSSVAFLVLYVDDILLIRNDVGYLTDIKNWLATQFKMKDLGEAQFVLGIQIVQNRKNNTLALSQTSYIDKVLLRFKMQESKKCPKTPQGVEDMRRVPYASVVGSLMYVML
CTRPNICFIAGMVSRHQSNPGPEHWTAVKTIFKYLRRTRNYMLMYWAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGVLVWRSIK