; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021993 (gene) of Snake gourd v1 genome

Gene IDTan0021993
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:26493386..26496306
RNA-Seq ExpressionTan0021993
SyntenyTan0021993
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-28566.33Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------
        DDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMD RFQDYMIEHGI+S+LSAP TPQQ                          
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------

Query:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY
              VETAV ILN VPSKSVSE PFELW+G KPSL HFRIW C AH     PK+     ++       +      F  P   +        FL     
Subjt:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY

Query:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM
           KPRSKLVL+EATDE TRVVD+ GPSSRVD E +TS QS PSQSL MPRRSGRV+SQP+RYLGL ETQVVIPDDGVEDPLSY+ AMNDVDKDQW+KAM
Subjt:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM

Query:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------
        DLE+ESM FNSVWELVD P+G                                                                               
Subjt:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------

Query:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT
                  PEGFITQGQ++KVCKLNRSIYGLKQASRSW IRFDTAIKSYGFDQN+D+PCVYK+I   KV+FLVLYVDDI+LI NDVGYLT++K WLA 
Subjt:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT

Query:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY
        QFQMK L EAQYVL IQI R+R+NKTLALSQA+Y+DK+L RYSMQ SK+GLLPFRHGVHLSKEQ  KTPQEVEDMRRIPYASAVGSLMY MLC RPDICY
Subjt:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY

Query:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA
         VGIVSRYQSNPGLDHWT  K +LKYLRRTRDY LVYG KDLILT YTD DFQTDKDSRKSTSGSVFTLNGGA VWRSIKQGCIADS +E EYVA CEAA
Subjt:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA

Query:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM
        KE VWLRKFL +LEVV NM  PITLYCDNSGA ANSKEPR +KR KHIERKYHL REIVQR DV V KIASEHNIADPFTKTLT +VFEGHLESLGLRDM
Subjt:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM

Query:  YI
        YI
Subjt:  YI

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-25460.45Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------
        DDYSRYGY+YLM HKSEALEKFKEYKAEVENAL KTIKT RSDRGGEYMD +FQ+Y++E GI S+LSAP+TPQQ                          
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------

Query:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY
              V+TAV ILN VPSKSVSE P +LW GHK SL+HFRIW C AH   + PK+     K+       +      F  P   +        FL     
Subjt:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY

Query:  EEPKPRSKLVLN----EATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQW
         E KPRSK+VLN    E T+  TRVV++     RV    S++    P QSL  PRRSGRV + P RY+ L ET  VI D  +EDPL+++ AM DVDKD+W
Subjt:  EEPKPRSKLVLN----EATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQW

Query:  IKAMDLEIESMDFNSVWELVDQPDG---------------------------------------------------------------------------
        IKAM+LE+ESM FNSVW+LVDQPDG                                                                           
Subjt:  IKAMDLEIESMDFNSVWELVDQPDG---------------------------------------------------------------------------

Query:  --------------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKN
                      PEGFI  GQ++K+CKLNRSIYGLKQASRSW IRFDTAIKSYGFDQ +D+PCVYKRIIN  V+FLVLYVDDI+LI ND+G LT+IK 
Subjt:  --------------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKN

Query:  WLATQFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRP
        WLATQFQMK L EAQ+VL IQIFR+R+NK LALSQASY+DKI+ +YSMQ SKRGLLPFRHGV LSKEQC KTPQ+VE+MR IPYASAVGSLMY MLC RP
Subjt:  WLATQFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRP

Query:  DICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVAT
        DICY VGIVSRYQSNPGL HWT  K ILKYLRRTRDYTLVYG+KDLILT YTD DFQTD+DSRKSTSGSVFTLNGGA VWRSIKQGCIADS +E EYVA 
Subjt:  DICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVAT

Query:  CEAAKEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLG
        CEAAKE VWLR FL +LEVV NM  PITLYCDNSGA ANS+EPR +KR KHIERKYHL REIV R DV V +IAS HN+ADPFTK LT +VFEGHLESLG
Subjt:  CEAAKEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLG

Query:  LRDM
        LRDM
Subjt:  LRDM

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-28165.71Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------
        DDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK  RSDRGGEYMD  FQDYMIEHGI+S+LSAP TPQQ                          
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------

Query:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY
              VETAV ILN VPSKSVSE PFELW+G KPSL HFRIW C AH     PK+     ++       +      F  P   +        FL     
Subjt:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY

Query:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM
           KPRSKLVL+EATDE TRVVD+ GPSSRVD E +TS QS PSQSL MPRRSGRV+SQP+RYLGL ETQVVIPDDGVEDPLSY+ AMNDVDKDQW+KAM
Subjt:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM

Query:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------
        DLE+ESM FNSVWELVD P+G                                                                               
Subjt:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------

Query:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT
                  PEGFITQGQ++KVCKLNRSIYGLKQASRSW IRFDTAIKSYGFDQN+D+PCVYK+I   KV+FLVLYVDDI+LI NDVGYLT++K WLA 
Subjt:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT

Query:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY
        QFQMK L E QYVL IQI R+R+NKTLALSQA+Y+DK+L RYSMQ SK+GLLPFRHGVHLSKEQ  KTPQEVEDMRRIPYASAVGSLMY MLC RPDICY
Subjt:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY

Query:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA
         VGIVSRYQSNPGLDHWT  K ILKYLRRTRDY LVYG KDLILT YT+ DFQTDKDSRKSTS SVFTLNGGA VWRSIKQGCIADS +E EYVA CEAA
Subjt:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA

Query:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM
        KE VWL+KFL +LEVV NM  PITLYCDNSGA ANSKEPR +KR KHIERKYHL REIVQR DV V KIASEHNIADPFTKTLT +VFEGHLESLGLRDM
Subjt:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM

Query:  YI
        YI
Subjt:  YI

KAA0046800.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-26368.93Show/hide
Query:  MHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ-------------------------------IVETAV
        M HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMD RFQDYMIEHGI+S+LSAP TPQQ                                VET V
Subjt:  MHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ-------------------------------IVETAV

Query:  QILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSYEEPKPRSKLVL
         ILN VPSKSVSE P +LW+G KPSL HF+IW C  H     PK      ++  +    +      F  P   +        FL      + KPRSKL+L
Subjt:  QILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSYEEPKPRSKLVL

Query:  NEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAMDLEIESMDFNS
        NEATDE TRV D+ GPSSRVD E +TS QS PSQ L MPRRSGRV+S+P+ YLGL ETQVVIPDDG+EDPL ++ AMNDVDKDQW+KAMDLE++SM FNS
Subjt:  NEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAMDLEIESMDFNS

Query:  VWELVDQPDGPEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT
        +WELVD P+G EGFITQGQ++KVCKLNRSIYGLKQASRSW IRFD AIKSYGFD+N+D+PCVYK+I   KV+FLVLYVDDI+LI NDVGYLTN+K WLA 
Subjt:  VWELVDQPDGPEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT

Query:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY
        QFQMK L EAQYVL IQI R+ +NKTLALSQA Y+DK+L RY MQ SK+GLLPFRHGVHLSKEQC KTPQEVED+RRIPYAS +GSLMY MLC RPDICY
Subjt:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY

Query:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA
         VGIVSRYQSNPGLDHWT  K ILKYLRR RDY LVYG KDLILT Y D+DFQ DKDSRKSTSGS+FTLN  A VW SIKQGCIADS +E +YVA CEAA
Subjt:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA

Query:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHL
        KE  WLRKFL +LEVV NM  PIT Y DNS A ANSKEPR  KR KH ERKYHL REIVQR DV V KIASEHNI DPFTKT T +VFEGHL
Subjt:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHL

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-28566.33Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------
        DDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMD RFQDYMIEHGI+S+LSAP TPQQ                          
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------

Query:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY
              VETAV ILN VPSKSVSE PFELW+G KPSL HFRIW C AH     PK+     ++       +      F  P   +        FL     
Subjt:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY

Query:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM
           KPRSKLVL+EATDE TRVVD+ GPSSRVD E +TS QS PSQSL MPRRSGRV+SQP+RYLGL ETQVVIPDDGVEDPLSY+ AMNDVDKDQW+KAM
Subjt:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM

Query:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------
        DLE+ESM FNSVWELVD P+G                                                                               
Subjt:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------

Query:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT
                  PEGFITQGQ++KVCKLNRSIYGLKQASRSW IRFDTAIKSYGFDQN+D+PCVYK+I   KV+FLVLYVDDI+LI NDVGYLT++K WLA 
Subjt:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT

Query:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY
        QFQMK L EAQYVL IQI R+R+NKTLALSQA+Y+DK+L RYSMQ SK+GLLPFRHGVHLSKEQ  KTPQEVEDMRRIPYASAVGSLMY MLC RPDICY
Subjt:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY

Query:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA
         VGIVSRYQSNPGLDHWT  K +LKYLRRTRDY LVYG KDLILT YTD DFQTDKDSRKSTSGSVFTLNGGA VWRSIKQGCIADS +E EYVA CEAA
Subjt:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA

Query:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM
        KE VWLRKFL +LEVV NM  PITLYCDNSGA ANSKEPR +KR KHIERKYHL REIVQR DV V KIASEHNIADPFTKTLT +VFEGHLESLGLRDM
Subjt:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM

Query:  YI
        YI
Subjt:  YI

TrEMBL top hitse value%identityAlignment
A0A5A7SNP8 Gag/pol protein1.8e-25460.45Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------
        DDYSRYGY+YLM HKSEALEKFKEYKAEVENAL KTIKT RSDRGGEYMD +FQ+Y++E GI S+LSAP+TPQQ                          
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------

Query:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY
              V+TAV ILN VPSKSVSE P +LW GHK SL+HFRIW C AH   + PK+     K+       +      F  P   +        FL     
Subjt:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY

Query:  EEPKPRSKLVLN----EATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQW
         E KPRSK+VLN    E T+  TRVV++     RV    S++    P QSL  PRRSGRV + P RY+ L ET  VI D  +EDPL+++ AM DVDKD+W
Subjt:  EEPKPRSKLVLN----EATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQW

Query:  IKAMDLEIESMDFNSVWELVDQPDG---------------------------------------------------------------------------
        IKAM+LE+ESM FNSVW+LVDQPDG                                                                           
Subjt:  IKAMDLEIESMDFNSVWELVDQPDG---------------------------------------------------------------------------

Query:  --------------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKN
                      PEGFI  GQ++K+CKLNRSIYGLKQASRSW IRFDTAIKSYGFDQ +D+PCVYKRIIN  V+FLVLYVDDI+LI ND+G LT+IK 
Subjt:  --------------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKN

Query:  WLATQFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRP
        WLATQFQMK L EAQ+VL IQIFR+R+NK LALSQASY+DKI+ +YSMQ SKRGLLPFRHGV LSKEQC KTPQ+VE+MR IPYASAVGSLMY MLC RP
Subjt:  WLATQFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRP

Query:  DICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVAT
        DICY VGIVSRYQSNPGL HWT  K ILKYLRRTRDYTLVYG+KDLILT YTD DFQTD+DSRKSTSGSVFTLNGGA VWRSIKQGCIADS +E EYVA 
Subjt:  DICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVAT

Query:  CEAAKEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLG
        CEAAKE VWLR FL +LEVV NM  PITLYCDNSGA ANS+EPR +KR KHIERKYHL REIV R DV V +IAS HN+ADPFTK LT +VFEGHLESLG
Subjt:  CEAAKEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLG

Query:  LRDM
        LRDM
Subjt:  LRDM

A0A5A7T2V9 Gag/pol protein5.8e-28265.71Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------
        DDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK  RSDRGGEYMD  FQDYMIEHGI+S+LSAP TPQQ                          
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------

Query:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY
              VETAV ILN VPSKSVSE PFELW+G KPSL HFRIW C AH     PK+     ++       +      F  P   +        FL     
Subjt:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY

Query:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM
           KPRSKLVL+EATDE TRVVD+ GPSSRVD E +TS QS PSQSL MPRRSGRV+SQP+RYLGL ETQVVIPDDGVEDPLSY+ AMNDVDKDQW+KAM
Subjt:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM

Query:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------
        DLE+ESM FNSVWELVD P+G                                                                               
Subjt:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------

Query:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT
                  PEGFITQGQ++KVCKLNRSIYGLKQASRSW IRFDTAIKSYGFDQN+D+PCVYK+I   KV+FLVLYVDDI+LI NDVGYLT++K WLA 
Subjt:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT

Query:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY
        QFQMK L E QYVL IQI R+R+NKTLALSQA+Y+DK+L RYSMQ SK+GLLPFRHGVHLSKEQ  KTPQEVEDMRRIPYASAVGSLMY MLC RPDICY
Subjt:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY

Query:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA
         VGIVSRYQSNPGLDHWT  K ILKYLRRTRDY LVYG KDLILT YT+ DFQTDKDSRKSTS SVFTLNGGA VWRSIKQGCIADS +E EYVA CEAA
Subjt:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA

Query:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM
        KE VWL+KFL +LEVV NM  PITLYCDNSGA ANSKEPR +KR KHIERKYHL REIVQR DV V KIASEHNIADPFTKTLT +VFEGHLESLGLRDM
Subjt:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM

Query:  YI
        YI
Subjt:  YI

A0A5A7TUI8 Gag/pol protein2.7e-26368.93Show/hide
Query:  MHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ-------------------------------IVETAV
        M HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMD RFQDYMIEHGI+S+LSAP TPQQ                                VET V
Subjt:  MHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ-------------------------------IVETAV

Query:  QILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSYEEPKPRSKLVL
         ILN VPSKSVSE P +LW+G KPSL HF+IW C  H     PK      ++  +    +      F  P   +        FL      + KPRSKL+L
Subjt:  QILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSYEEPKPRSKLVL

Query:  NEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAMDLEIESMDFNS
        NEATDE TRV D+ GPSSRVD E +TS QS PSQ L MPRRSGRV+S+P+ YLGL ETQVVIPDDG+EDPL ++ AMNDVDKDQW+KAMDLE++SM FNS
Subjt:  NEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAMDLEIESMDFNS

Query:  VWELVDQPDGPEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT
        +WELVD P+G EGFITQGQ++KVCKLNRSIYGLKQASRSW IRFD AIKSYGFD+N+D+PCVYK+I   KV+FLVLYVDDI+LI NDVGYLTN+K WLA 
Subjt:  VWELVDQPDGPEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT

Query:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY
        QFQMK L EAQYVL IQI R+ +NKTLALSQA Y+DK+L RY MQ SK+GLLPFRHGVHLSKEQC KTPQEVED+RRIPYAS +GSLMY MLC RPDICY
Subjt:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY

Query:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA
         VGIVSRYQSNPGLDHWT  K ILKYLRR RDY LVYG KDLILT Y D+DFQ DKDSRKSTSGS+FTLN  A VW SIKQGCIADS +E +YVA CEAA
Subjt:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA

Query:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHL
        KE  WLRKFL +LEVV NM  PIT Y DNS A ANSKEPR  KR KH ERKYHL REIVQR DV V KIASEHNI DPFTKT T +VFEGHL
Subjt:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHL

A0A5A7TZD0 Gag/pol protein1.5e-28566.33Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------
        DDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMD RFQDYMIEHGI+S+LSAP TPQQ                          
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------

Query:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY
              VETAV ILN VPSKSVSE PFELW+G KPSL HFRIW C AH     PK+     ++       +      F  P   +        FL     
Subjt:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY

Query:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM
           KPRSKLVL+EATDE TRVVD+ GPSSRVD E +TS QS PSQSL MPRRSGRV+SQP+RYLGL ETQVVIPDDGVEDPLSY+ AMNDVDKDQW+KAM
Subjt:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM

Query:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------
        DLE+ESM FNSVWELVD P+G                                                                               
Subjt:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------

Query:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT
                  PEGFITQGQ++KVCKLNRSIYGLKQASRSW IRFDTAIKSYGFDQN+D+PCVYK+I   KV+FLVLYVDDI+LI NDVGYLT++K WLA 
Subjt:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT

Query:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY
        QFQMK L EAQYVL IQI R+R+NKTLALSQA+Y+DK+L RYSMQ SK+GLLPFRHGVHLSKEQ  KTPQEVEDMRRIPYASAVGSLMY MLC RPDICY
Subjt:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY

Query:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA
         VGIVSRYQSNPGLDHWT  K +LKYLRRTRDY LVYG KDLILT YTD DFQTDKDSRKSTSGSVFTLNGGA VWRSIKQGCIADS +E EYVA CEAA
Subjt:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA

Query:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM
        KE VWLRKFL +LEVV NM  PITLYCDNSGA ANSKEPR +KR KHIERKYHL REIVQR DV V KIASEHNIADPFTKTLT +VFEGHLESLGLRDM
Subjt:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM

Query:  YI
        YI
Subjt:  YI

A0A5A7UYE8 Gag/pol protein1.5e-28566.33Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------
        DDYSRYGYLYLM HKSEALEKFKEYK EVEN L K IK LRSDRGGEYMD RFQDYMIEHGI+S+LSAP TPQQ                          
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQ--------------------------

Query:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY
              VETAV ILN VPSKSVSE PFELW+G KPSL HFRIW C AH     PK+     ++       +      F  P   +        FL     
Subjt:  -----IVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSFLQPTRKQGDCIDKRHFLGGRSY

Query:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM
           KPRSKLVL+EATDE TRVVD+ GPSSRVD E +TS QS PSQSL MPRRSGRV+SQP+RYLGL ETQVVIPDDGVEDPLSY+ AMNDVDKDQW+KAM
Subjt:  EEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYRHAMNDVDKDQWIKAM

Query:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------
        DLE+ESM FNSVWELVD P+G                                                                               
Subjt:  DLEIESMDFNSVWELVDQPDG-------------------------------------------------------------------------------

Query:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT
                  PEGFITQGQ++KVCKLNRSIYGLKQASRSW IRFDTAIKSYGFDQN+D+PCVYK+I   KV+FLVLYVDDI+LI NDVGYLT++K WLA 
Subjt:  ----------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLAT

Query:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY
        QFQMK L EAQYVL IQI R+R+NKTLALSQA+Y+DK+L RYSMQ SK+GLLPFRHGVHLSKEQ  KTPQEVEDMRRIPYASAVGSLMY MLC RPDICY
Subjt:  QFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICY

Query:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA
         VGIVSRYQSNPGLDHWT  K +LKYLRRTRDY LVYG KDLILT YTD DFQTDKDSRKSTSGSVFTLNGGA VWRSIKQGCIADS +E EYVA CEAA
Subjt:  VVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAA

Query:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM
        KE VWLRKFL +LEVV NM  PITLYCDNSGA ANSKEPR +KR KHIERKYHL REIVQR DV V KIASEHNIADPFTKTLT +VFEGHLESLGLRDM
Subjt:  KEVVWLRKFLTNLEVVLNMEFPITLYCDNSGA-ANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDM

Query:  YI
        YI
Subjt:  YI

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.5e-5335.14Show/hide
Query:  VCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVY---KRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREAQYVLRIQIF
        VCKLN++IYGLKQA+R W   F+ A+K   F  +    C+Y   K  IN+ + +++LYVDD+++   D+  + N K +L  +F+M  L E ++ + I+I 
Subjt:  VCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVY---KRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREAQYVLRIQIF

Query:  RNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHL----SKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLD
           +   + LSQ++YV KILS+++M+       P    ++     S E C+            P  S +G LMY MLC RPD+   V I+SRY S    +
Subjt:  RNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHL----SKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLD

Query:  HWTTCKNILKYLRRTRDYTLVYGTKDLI----LTEYTDFDFQTDKDSRKSTSGSVFTL-NGGATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKFL
         W   K +L+YL+ T D  L++  K+L     +  Y D D+   +  RKST+G +F + +     W + +Q  +A S+ E EY+A  EA +E +WL+  L
Subjt:  HWTTCKNILKYLRRTRDYTLVYGTKDLI----LTEYTDFDFQTDKDSRKSTSGSVFTL-NGGATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKFL

Query:  TNLEVVLNMEFPITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGL
        T++ +   +E PI +Y DN G  + +  P  +KR KHI+ KYH  RE VQ   + +  I +E+ +AD FTK L    F    + LGL
Subjt:  TNLEVVLNMEFPITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGL

P0CV72 Secreted RxLR effector protein 1611.0e-2545.93Show/hide
Query:  MRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVY---GTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNG
        M+ +PY SAVG++MY M+  RPD+   VG++S++ S+P   HW   K +L+YL+ T+ Y L +   GT  L+   Y+D D+  D +SR+STSG +F LNG
Subjt:  MRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVY---GTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNG

Query:  GATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWL
        G   WRS KQ  +A S+ E EY+A  EA +E VWL
Subjt:  GATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-9930.3Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQ---------------------------
        DD SR  ++Y++  K +  + F+++ A VE    + +K LRSD GGEY    F++Y   HGI+ + + P TPQ                           
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQ---------------------------

Query:  ----QIVETAVQILNTVPSKSVS-EIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSF--LQPTRKQ----GDCIDKRH
            + V+TA  ++N  PS  ++ EIP  +W   + S  H +++ C A     K + T +  K +P  ++   +E + +    P +K+     D + +  
Subjt:  ----QIVETAVQILNTVPSKSVS-EIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNERWSF--LQPTRKQ----GDCIDKRH

Query:  FLGGRSYEEPKPRSKLVLN---------------EATDEPTRVVDQAG----PSSRVDGEASTSSQSSPSQSLGMP-RRSGRVISQPDRYLGLVETQVVI
         +   +    K ++ ++ N                 TDE +   +Q G       ++D         +  +    P RRS R   +  RY       V+I
Subjt:  FLGGRSYEEPKPRSKLVLN---------------EATDEPTRVVDQAG----PSSRVDGEASTSSQSSPSQSLGMP-RRSGRVISQPDRYLGLVETQVVI

Query:  PDDGVEDPLSYRHAMNDVDKDQWIKAMDLEIESMDFNSVWELVDQPDG----------------------------------------------------
         DD   +P S +  ++  +K+Q +KAM  E+ES+  N  ++LV+ P G                                                    
Subjt:  PDDGVEDPLSYRHAMNDVDKDQWIKAMDLEIESMDFNSVWELVDQPDG----------------------------------------------------

Query:  -------------------------------------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVY-KRIINDKVS
                                             PEGF   G+K  VCKLN+S+YGLKQA R W ++FD+ +KS  + +    PCVY KR   +   
Subjt:  -------------------------------------PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVY-KRIINDKVS

Query:  FLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEV
         L+LYVDD++++  D G +  +K  L+  F MK L  AQ +L ++I R R ++ L LSQ  Y++++L R++M+ +K    P    + LSK+ C  T +E 
Subjt:  FLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEV

Query:  EDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGG
         +M ++PY+SAVGSLMY M+C RPDI + VG+VSR+  NPG +HW   K IL+YLR T    L +G  D IL  YTD D   D D+RKS++G +FT +GG
Subjt:  EDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGG

Query:  ATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKFLTNLEVVLNMEFPITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASE
        A  W+S  Q C+A S  E EY+A  E  KE++WL++FL  L +    +    +YCD+  A + SK    + R KHI+ +YH  RE+V  E + V+KI++ 
Subjt:  ATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKFLTNLEVVLNMEFPITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASE

Query:  HNIADPFTKTLTTQVFEGHLESLGL
         N AD  TK +    FE   E +G+
Subjt:  HNIADPFTKTLTTQVFEGHLESLGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-4131.19Show/hide
Query:  PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREA
        P GFI + +   VCKL +++YGLKQA R+W +     + + GF  ++    ++       + ++++YVDDI++  ND   L N  + L+ +F +K   E 
Subjt:  PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREA

Query:  QYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQS
         Y L I+    R    L LSQ  Y+  +L+R +M  +K    P      LS    +K     E      Y   VGSL Y +   RPDI Y V  +S++  
Subjt:  QYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQS

Query:  NPGLDHWTTCKNILKYLRRTRDYTL-VYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKF
         P  +H    K IL+YL  T ++ + +     L L  Y+D D+  DKD   ST+G +  L      W S KQ  +  S+ E EY +    + E+ W+   
Subjt:  NPGLDHWTTCKNILKYLRRTRDYTL-VYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKF

Query:  LTNLEVVLNMEFPITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGL
        LT L + L    P  +YCDN GA      P  + R KHI   YH  R  VQ   + V+ +++   +AD  TK L+   F+     +G+
Subjt:  LTNLEVVLNMEFPITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-4431.19Show/hide
Query:  PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREA
        P GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  +I    ++       + ++++YVDDI++  ND   L +  + L+ +F +K   + 
Subjt:  PEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREA

Query:  QYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQS
         Y L I+    R  + L LSQ  Y   +L+R +M  +K    P      L+    +K P   E      Y   VGSL Y +   RPD+ Y V  +S+Y  
Subjt:  QYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQS

Query:  NPGLDHWTTCKNILKYLRRTRDYTL-VYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKF
         P  DHW   K +L+YL  T D+ + +     L L  Y+D D+  D D   ST+G +  L      W S KQ  +  S+ E EY +    + E+ W+   
Subjt:  NPGLDHWTTCKNILKYLRRTRDYTL-VYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKF

Query:  LTNLEVVLNMEFPITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGL
        LT L + L+   P  +YCDN GA      P  + R KHI   YH  R  VQ   + V+ +++   +AD  TK L+   F+     +G+
Subjt:  LTNLEVVLNMEFPITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-0641.1Show/hide
Query:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQ
        D ++RY +LY +  KS+  + F  +K+ VEN  +  I TL SD GGE++  R  DY+ +HGI    S P+TP+
Subjt:  DDYSRYGYLYLMHHKSEALEKFKEYKAEVENALRKTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.8e-3430.36Show/hide
Query:  VCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREAQYVLRIQIFRNR
        VC L +SIYGLKQASR W ++F   +  +GF Q+      + +I       +++YVDDII+  N+   +  +K+ L + F+++ L   +Y L ++I R+ 
Subjt:  VCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREAQYVLRIQIFRNR

Query:  ENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLDHWTTCKN
            + + Q  Y   +L    +   K   +P    V  S    + +  +  D +   Y   +G LMY +   R DI + V  +S++   P L H      
Subjt:  ENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLDHWTTCKN

Query:  ILKYLRRTRDYTLVYGTK-DLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKFLTNLEVVLNMEF
        IL Y++ T    L Y ++ ++ L  ++D  FQ+ KD+R+ST+G    L      W+S KQ  ++ S+ E EY A   A  E++WL +F   L++ L+   
Subjt:  ILKYLRRTRDYTLVYGTK-DLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAAKEVVWLRKFLTNLEVVLNMEF

Query:  PITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTRE
        P  L+CDN+ A + +     ++R KHIE   H  RE
Subjt:  PITLYCDNSGAAN-SKEPRRNKRDKHIERKYHLTRE

ATMG00810.1 DNA/RNA polymerases superfamily protein2.2e-1529.91Show/hide
Query:  FLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEV
        +L+LYVDDI+L  +    L  +   L++ F MK L    Y L IQI        L LSQ  Y ++IL+   M   K    P    ++ S    +K P   
Subjt:  FLVLYVDDIILIRNDVGYLTNIKNWLATQFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEV

Query:  EDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTL-VYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNG
        +      + S VG+L Y +   RPDI Y V IV +    P L  +   K +L+Y++ T  + L ++    L +  + D D+     +R+ST+G    L  
Subjt:  EDMRRIPYASAVGSLMYDMLCMRPDICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTL-VYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNG

Query:  GATVWRSIKQGCIADSAIEVEYVATCEAAKEVVW
            W + +Q  ++ S+ E EY A    A E+ W
Subjt:  GATVWRSIKQGCIADSAIEVEYVATCEAAKEVVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAGACTCAGGCGAAGGAAGCTCGGAGATCTTGTCGAGGGAAACAGATCTGGCCGGCGACTGGCGAGGGAGGGAAGCTCGGAGATCTGGCTGAGGGAAGCACAGGG
GAAGGAAACTCAGACGAAGGAAGCTCGGTGGAAGGTCAGATGGACAACAGACGGTGGCAAACGGGGGAAGAGGGGTCGACGGCGACAAACGGCGAAGAGGGAGAGTGAGA
GAGAGAGGGATGATTATTCGAGGTATGGGTATTTATACCTAATGCATCATAAGTCTGAGGCTCTTGAAAAGTTCAAAGAGTATAAGGCTGAAGTAGAGAATGCATTAAGG
AAAACCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTTAGATTCCAAGACTATATGATAGAACATGGAATTAAATCTAAACTCTCAGCACCTAATAC
ACCACAGCAAATTGTAGAGACTGCAGTTCAAATCTTGAACACTGTTCCATCAAAGAGTGTTTCAGAAATACCTTTTGAATTATGGAAGGGGCATAAACCTAGTTTACAAC
ACTTCAGGATTTGGGATTGTCTGGCACATTGTGCTAGTGACAAACCCAAAGAAACTGTAATCTCGTTCAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGG
TGGTCTTTTCTTCAACCCACAAGAAAACAAGGTGATTGTATCGACAAACGCCACTTTCTTGGAGGAAGATCATATGAGGAACCAAAACCACGTAGTAAATTAGTGCTAAA
TGAAGCTACAGATGAACCAACAAGAGTTGTTGATCAAGCTGGACCTTCATCAAGAGTTGATGGAGAAGCCAGCACCTCAAGTCAGTCTAGTCCTTCTCAATCGTTGGGAA
TGCCTCGACGCAGTGGGAGGGTTATTTCCCAACCTGACCGCTACTTGGGTTTAGTTGAAACTCAAGTCGTCATACCTGATGACGGCGTAGAAGATCCATTGTCTTATAGA
CATGCAATGAATGACGTAGACAAAGACCAATGGATCAAAGCCATGGACCTTGAAATAGAGTCAATGGACTTCAATTCAGTGTGGGAACTTGTAGACCAACCTGATGGGCC
CGAAGGGTTCATAACCCAAGGTCAGAAGCGAAAAGTTTGCAAGCTCAATCGATCCATTTATGGGTTGAAACAAGCATCCAGATCTTGGACTATAAGATTTGATACTGCGA
TCAAGTCTTATGGTTTTGACCAAAACATTGATAAGCCTTGTGTTTACAAGAGGATCATCAACGACAAAGTATCTTTCTTAGTACTTTATGTGGATGATATCATACTCATT
CGGAATGATGTAGGATACCTTACTAACATAAAGAATTGGTTGGCGACCCAATTCCAAATGAAATATTTGAGAGAGGCGCAATATGTTCTTAGGATTCAGATCTTCCGGAA
TCGCGAGAACAAAACGCTAGCTCTATCTCAAGCATCTTATGTAGACAAAATTTTGTCCCGATATTCGATGCAGATATCCAAGAGGGGCTTATTACCCTTCAGGCATGGAG
TTCATCTGTCTAAGGAACAGTGTTCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCTGCAGTAGGTAGCCTAATGTATGATATGTTGTGCATG
AGGCCAGACATTTGCTATGTAGTGGGAATAGTCAGTAGGTACCAATCCAATCCAGGGTTAGACCACTGGACAACATGTAAAAATATCCTCAAGTATCTTAGGAGAACGAG
GGACTATACACTTGTATATGGGACTAAGGATTTGATCCTTACTGAATACACTGATTTTGATTTTCAGACCGATAAGGATTCTAGAAAATCCACATCGGGATCAGTTTTCA
CCCTTAACGGGGGAGCTACAGTATGGCGAAGCATCAAGCAAGGATGCATCGCTGACTCCGCGATAGAGGTTGAGTATGTCGCTACTTGTGAAGCAGCTAAAGAGGTTGTT
TGGCTAAGAAAATTCCTTACTAATTTGGAAGTTGTTCTAAATATGGAATTTCCCATCACCTTATACTGTGACAACAGTGGTGCAGCCAATTCGAAGGAACCTCGTAGGAA
TAAGCGAGACAAGCACATCGAGAGGAAGTATCACCTGACACGAGAAATAGTGCAACGAGAAGATGTGACAGTCATGAAGATCGCTTCGGAGCACAACATTGCTGATCCGT
TTACAAAGACACTCACGACTCAAGTGTTCGAGGGTCATCTGGAGAGTCTAGGTCTACGAGACATGTACATAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACAGACTCAGGCGAAGGAAGCTCGGAGATCTTGTCGAGGGAAACAGATCTGGCCGGCGACTGGCGAGGGAGGGAAGCTCGGAGATCTGGCTGAGGGAAGCACAGGG
GAAGGAAACTCAGACGAAGGAAGCTCGGTGGAAGGTCAGATGGACAACAGACGGTGGCAAACGGGGGAAGAGGGGTCGACGGCGACAAACGGCGAAGAGGGAGAGTGAGA
GAGAGAGGGATGATTATTCGAGGTATGGGTATTTATACCTAATGCATCATAAGTCTGAGGCTCTTGAAAAGTTCAAAGAGTATAAGGCTGAAGTAGAGAATGCATTAAGG
AAAACCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTTAGATTCCAAGACTATATGATAGAACATGGAATTAAATCTAAACTCTCAGCACCTAATAC
ACCACAGCAAATTGTAGAGACTGCAGTTCAAATCTTGAACACTGTTCCATCAAAGAGTGTTTCAGAAATACCTTTTGAATTATGGAAGGGGCATAAACCTAGTTTACAAC
ACTTCAGGATTTGGGATTGTCTGGCACATTGTGCTAGTGACAAACCCAAAGAAACTGTAATCTCGTTCAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGG
TGGTCTTTTCTTCAACCCACAAGAAAACAAGGTGATTGTATCGACAAACGCCACTTTCTTGGAGGAAGATCATATGAGGAACCAAAACCACGTAGTAAATTAGTGCTAAA
TGAAGCTACAGATGAACCAACAAGAGTTGTTGATCAAGCTGGACCTTCATCAAGAGTTGATGGAGAAGCCAGCACCTCAAGTCAGTCTAGTCCTTCTCAATCGTTGGGAA
TGCCTCGACGCAGTGGGAGGGTTATTTCCCAACCTGACCGCTACTTGGGTTTAGTTGAAACTCAAGTCGTCATACCTGATGACGGCGTAGAAGATCCATTGTCTTATAGA
CATGCAATGAATGACGTAGACAAAGACCAATGGATCAAAGCCATGGACCTTGAAATAGAGTCAATGGACTTCAATTCAGTGTGGGAACTTGTAGACCAACCTGATGGGCC
CGAAGGGTTCATAACCCAAGGTCAGAAGCGAAAAGTTTGCAAGCTCAATCGATCCATTTATGGGTTGAAACAAGCATCCAGATCTTGGACTATAAGATTTGATACTGCGA
TCAAGTCTTATGGTTTTGACCAAAACATTGATAAGCCTTGTGTTTACAAGAGGATCATCAACGACAAAGTATCTTTCTTAGTACTTTATGTGGATGATATCATACTCATT
CGGAATGATGTAGGATACCTTACTAACATAAAGAATTGGTTGGCGACCCAATTCCAAATGAAATATTTGAGAGAGGCGCAATATGTTCTTAGGATTCAGATCTTCCGGAA
TCGCGAGAACAAAACGCTAGCTCTATCTCAAGCATCTTATGTAGACAAAATTTTGTCCCGATATTCGATGCAGATATCCAAGAGGGGCTTATTACCCTTCAGGCATGGAG
TTCATCTGTCTAAGGAACAGTGTTCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCTGCAGTAGGTAGCCTAATGTATGATATGTTGTGCATG
AGGCCAGACATTTGCTATGTAGTGGGAATAGTCAGTAGGTACCAATCCAATCCAGGGTTAGACCACTGGACAACATGTAAAAATATCCTCAAGTATCTTAGGAGAACGAG
GGACTATACACTTGTATATGGGACTAAGGATTTGATCCTTACTGAATACACTGATTTTGATTTTCAGACCGATAAGGATTCTAGAAAATCCACATCGGGATCAGTTTTCA
CCCTTAACGGGGGAGCTACAGTATGGCGAAGCATCAAGCAAGGATGCATCGCTGACTCCGCGATAGAGGTTGAGTATGTCGCTACTTGTGAAGCAGCTAAAGAGGTTGTT
TGGCTAAGAAAATTCCTTACTAATTTGGAAGTTGTTCTAAATATGGAATTTCCCATCACCTTATACTGTGACAACAGTGGTGCAGCCAATTCGAAGGAACCTCGTAGGAA
TAAGCGAGACAAGCACATCGAGAGGAAGTATCACCTGACACGAGAAATAGTGCAACGAGAAGATGTGACAGTCATGAAGATCGCTTCGGAGCACAACATTGCTGATCCGT
TTACAAAGACACTCACGACTCAAGTGTTCGAGGGTCATCTGGAGAGTCTAGGTCTACGAGACATGTACATAGGCTAA
Protein sequenceShow/hide protein sequence
MHRLRRRKLGDLVEGNRSGRRLAREGSSEIWLREAQGKETQTKEARWKVRWTTDGGKRGKRGRRRQTAKRESERERDDYSRYGYLYLMHHKSEALEKFKEYKAEVENALR
KTIKTLRSDRGGEYMDFRFQDYMIEHGIKSKLSAPNTPQQIVETAVQILNTVPSKSVSEIPFELWKGHKPSLQHFRIWDCLAHCASDKPKETVISFKIMPICWLSQRNER
WSFLQPTRKQGDCIDKRHFLGGRSYEEPKPRSKLVLNEATDEPTRVVDQAGPSSRVDGEASTSSQSSPSQSLGMPRRSGRVISQPDRYLGLVETQVVIPDDGVEDPLSYR
HAMNDVDKDQWIKAMDLEIESMDFNSVWELVDQPDGPEGFITQGQKRKVCKLNRSIYGLKQASRSWTIRFDTAIKSYGFDQNIDKPCVYKRIINDKVSFLVLYVDDIILI
RNDVGYLTNIKNWLATQFQMKYLREAQYVLRIQIFRNRENKTLALSQASYVDKILSRYSMQISKRGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYDMLCM
RPDICYVVGIVSRYQSNPGLDHWTTCKNILKYLRRTRDYTLVYGTKDLILTEYTDFDFQTDKDSRKSTSGSVFTLNGGATVWRSIKQGCIADSAIEVEYVATCEAAKEVV
WLRKFLTNLEVVLNMEFPITLYCDNSGAANSKEPRRNKRDKHIERKYHLTREIVQREDVTVMKIASEHNIADPFTKTLTTQVFEGHLESLGLRDMYIG