; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036124 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036124
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:39596069..39605238
RNA-Seq ExpressionLag0036124
SyntenyLag0036124
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032431.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]1.3e-16236.4Show/hide
Query:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPK-----QVVPPVG-----------------------QRENLPEN
        + L +RFCRL Y V L +V   L  S+   + MPPR   R  GR GRG+G GR  P+     Q   P                         Q++  P  
Subjt:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPK-----QVVPPVG-----------------------QRENLPEN

Query:  PQVEQPGEQSGPAAQVPQNNEAALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIERTIDVSN
        P    P   S P    PQ     LS E + +RDFRK++  +FDG+  DPT A LWLSS+ET+FR+M C EDQK+ C VF+L D  + WW++ ER +    
Subjt:  PQVEQPGEQSGPAAQVPQNNEAALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIERTIDVSN

Query:  GPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRVAELIDRR
          +TW Q +  F+ K + A+ R  K+ EF+ L QG+ TVE+Y+ EF  LS F P ++A EA + ++F+ GL+ +IQG V A  P  +A A+R+A  +  +
Subjt:  GPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRVAELIDRR

Query:  PSVVSSE-----PSSGYKR-------------------------------------------------RCV-----------------------------
            SS+      +SG KR                                                 RC+                             
Subjt:  PSVVSSE-----PSSGYKR-------------------------------------------------RCV-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAELKELKVQIQELLDKGY
                                                       PE+L GLPP   V+F IELEP T PIS+APYRM PAELKELKVQ+QELLDKG+
Subjt:  -----------------------------------------------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAELKELKVQIQELLDKGY

Query:  IRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTSF---------HGLDE
        IRPSVSPWGAPVL                               IDDLFDQLQGA +FSKIDLRSGYHQLRI++ D+ KT FR+ +          GL  
Subjt:  IRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTSF---------HGLDE

Query:  Q----------------------------------------------------------------------GHVVLKQGVTVDPAKIEEVTNWSRPTSVS
                                                                               GHVV K GV+VDPAKIE VT W+RP++VS
Subjt:  Q----------------------------------------------------------------------GHVVLKQGVTVDPAKIEEVTNWSRPTSVS

Query:  EIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVL
        E+RSFLGLAGYY RFV+ FS I  PLTQLTRKG  F+W++ACE SFQ LKQ+LV+APVLTVPDGSG FVIYS+ASKKGLGCVLM                
Subjt:  EIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVL

Query:  QQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLT
        QQ                                                                 K+V               Y S QLK +E+NY T
Subjt:  QQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLT

Query:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGKANVVADALSRKTAHVSALITTQAHILDDLERAEI
        HDLELAAVVFALKIWRHYLYGEKIQIF DHKSLKY FT KELNMRQRRWLELVKDY C+I YH GKANVVADALSRK +H +ALIT QA +  D+ERAEI
Subjt:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGKANVVADALSRKTAHVSALITTQAHILDDLERAEI

Query:  MIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEA
          +VG VT +LAQLTV+ +LRQRII  Q +DP LV KR   EA
Subjt:  MIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEA

KAA0040695.1 pol protein [Cucumis melo var. makuwa]7.6e-16335.99Show/hide
Query:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA
        + L +RFCRL Y V + +V   L  S+   + MPPR   R  GR GRGRG GR  P+  V PV Q  + P  P                + Q  EQ  PA
Subjt:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA

Query:  AQVPQNNEA-----------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQ
        +  P  + A                        LS E + +RDFRK++  +FDG+  DPT A++WLSS+ET+FR+M C EDQKV C VF+L D    WW+
Subjt:  AQVPQNNEA-----------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQ

Query:  SIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATA
        + ER +      +TW QF+  F+ K + A+ R  KQ EF+ L QG+ TVE+Y+ EF  LS F P ++A EA + ++F+ GL+ +IQG V A  P  +A A
Subjt:  SIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATA

Query:  VRVAELIDRRPSVVSSE-----PSSGYKR-------------------------------------------------RCV-------------------
        +R+A  +  +    SS+      +SG KR                                                 RC+                   
Subjt:  VRVAELIDRRPSVVSSE-----PSSGYKR-------------------------------------------------RCV-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------PEDLTGLPPAGVVDFCIEL
                                                                                         P++L GLPP   V+F IEL
Subjt:  ---------------------------------------------------------------------------------PEDLTGLPPAGVVDFCIEL

Query:  EPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSG
        EP T PIS+APYRMAPAELKELKVQ+QELLDKG+IRPSVSPWGAPVL                               IDDLFDQLQGA +FS+IDLRSG
Subjt:  EPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSG

Query:  YHQLRIREDDISKTAFRTSF---------HGLDEQ-----------------------------------------------------------------
        YHQLRI+++D+ KTAFR+ +          GL                                                                    
Subjt:  YHQLRIREDDISKTAFRTSF---------HGLDEQ-----------------------------------------------------------------

Query:  -----GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSG
             GHVV K GV+VDPAKIE VT W+RP++VSE+RSFLGLAGYY +FV+ FS I  PLTQLTRKG  F+W++ACE SFQ LKQ+LV+APVLTV DGSG
Subjt:  -----GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSG

Query:  EFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRD
         FVIYS+ASKKGLGCVLM                QQ                                                                
Subjt:  EFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRD

Query:  IKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGK
         K+V               Y S QLK +E+NY THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKY FTQKELNMRQRRWLELVKDY C+I YH GK
Subjt:  IKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGK

Query:  ANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG
        ANVVADALSRK +H +ALIT QA +  DLERAEI ++VG VT +LAQLTV+ +LRQRIID Q +DP LV KR   EAG
Subjt:  ANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG

KAA0046094.1 gag protease polyprotein [Cucumis melo var. makuwa]2.2e-16236.13Show/hide
Query:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA
        + L +RF RL Y V L +V   L  S+   + MPPR   R  GR GRGRG GR  P+  V PV Q  + P  P                + Q  EQ  PA
Subjt:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA

Query:  AQVPQNNEA-------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIER
        +  P    A                    LS E + +RDFRK++  +FDG+  DPT A++WLSS+ET+FR+M C EDQKV C VF+L D    WW++ ER
Subjt:  AQVPQNNEA-------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIER

Query:  TIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRVA
         +      +TW QF+  F+ K + A+ R  K+ EF+ L QG+ TVE+Y+ EF  LS F P ++A EA +V++F+ GL+ +IQG V A  P  +A A+R+A
Subjt:  TIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRVA

Query:  ELIDRRPSVVSSE-----PSSGYKR----------------------------------------------RC---------------------------
          +  +    SS+      +SG KR                                              RC                           
Subjt:  ELIDRRPSVVSSE-----PSSGYKR----------------------------------------------RC---------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------VPEDLTGLPPAGVVDFCIE
                                                                                          PE+L GLPP   V+F IE
Subjt:  ---------------------------------------------------------------------------------VPEDLTGLPPAGVVDFCIE

Query:  LEPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRS
        LEP T PIS+APYRMAPAELKELKVQ+QELLDKG+IRPSVSPWGAPVL                               IDDLFDQLQGA +FSKIDLRS
Subjt:  LEPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRS

Query:  GYHQLRIREDDISKTAFRTSF---------HGLDEQ----------------------------------------------------------------
        GYHQLRI+++D+ KTAF + +          GL                                                                   
Subjt:  GYHQLRIREDDISKTAFRTSF---------HGLDEQ----------------------------------------------------------------

Query:  ------GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGS
              GHVV K GV+VDPAKIE VT W+RP++VSE+RSFLGLAGYY RFV+ FS I  PLTQLTRKG  F+W++ACE SFQ LKQ+LV+APVLTVPDGS
Subjt:  ------GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGS

Query:  GEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDR
        G FVIYS+ASKKGLGCVLM                QQ                                                               
Subjt:  GEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDR

Query:  DIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLG
          K+V               Y S QLK +E+NY THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKY FTQKELNMRQRRWLELVKDY C+I YH G
Subjt:  DIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLG

Query:  KANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG
        KANVVADALSRK +H +ALIT QA +  DLER EI ++VG VT +LAQLTV+ +LRQRIID Q +DP LV KR   EAG
Subjt:  KANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG

KAA0058399.1 pol protein [Cucumis melo var. makuwa]1.1e-16636.84Show/hide
Query:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA
        + L +RFCRL Y V L +V   L  S+   + MPPR   R  GR GRGRG GR  P+  V PV Q  + P  P                + Q  EQ  PA
Subjt:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA

Query:  AQVPQNNEA-----------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQ
        +  P    A                        LS E + +RDFRK++  +FDG+  DPT A++WLSS+ET+FR+M C EDQK+ C VF+L D    WW+
Subjt:  AQVPQNNEA-----------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQ

Query:  SIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATA
        + ER +      +TW QF+  F+ K + A+ R  K+ EF+ L QG+ TVE+Y+ EF  LS F P ++A EA + ++F+ GL+ +IQG V A  P  +A A
Subjt:  SIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATA

Query:  VRVAELIDRRPSVVSSE-----PSSGYKR-------------------------------------------------RCV-------------------
        +R+A  +  +    SS+      +SG KR                                                 RC+                   
Subjt:  VRVAELIDRRPSVVSSE-----PSSGYKR-------------------------------------------------RCV-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAEL
                                                                      PE+L GLPP   V+F IELEP T PIS+APYRMAPAEL
Subjt:  --------------------------------------------------------------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAEL

Query:  KELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS
        KELKVQ+QELLDKG+IRPSVSPWGAPVL                               IDDLFDQLQGA +FSKIDLRSGYHQLRI+++D+ KTAFR+ 
Subjt:  KELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS

Query:  F---------HGLDEQ----------------------------------------------------------------------GHVVLKQGVTVDPA
        +          GL                                                                         GHVV K GV+VDPA
Subjt:  F---------HGLDEQ----------------------------------------------------------------------GHVVLKQGVTVDPA

Query:  KIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMH
        KIE VT W+RP++VSE+RSFLGLAGYY RFV+ FS I  PLTQLTRKG  F+W++ACE SFQ LKQ+LV+APVLTVPDGSG FVIYS+ASKKGLGCVLM 
Subjt:  KIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMH

Query:  HDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNT
                       QQ                                                                 K+V               
Subjt:  HDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNT

Query:  YVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGKANVVADALSRKTAHVSALI
        Y S QLK +EKNY THDLELAAVVFALKIWRHYLYGE IQIFTDHKSLKY FTQKELNMRQRRWLELVKDY C+I YH GKANVVADALSRK +H +ALI
Subjt:  YVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGKANVVADALSRKTAHVSALI

Query:  TTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG
        T QA +  DLERAEI ++VG VT +LAQLTV+ +LRQRIID Q +DP LV KR   EAG
Subjt:  TTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG

KAA0061201.1 pol protein [Cucumis melo var. makuwa]2.6e-16338.45Show/hide
Query:  MPPR---ARGRGKGRGRGRGRGRTTPKQVVPPVGQRENLPEN--PQVEQPGEQSGPAAQVP----QNNEAA---------------------LSWETRCI
        MPPR    RG G+G GRG GRG+     V P V     + +     +EQ  +    AA  P    Q N+AA                     LS E + +
Subjt:  MPPR---ARGRGKGRGRGRGRGRTTPKQVVPPVGQRENLPEN--PQVEQPGEQSGPAAQVP----QNNEAA---------------------LSWETRCI

Query:  RDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVA
        RDFRK++  +FDG+  +PT A++WL+SIET+FR+M C EDQKV C VF L D    WW++ ER +      +TW QF+  F+ K + A  +  K  EF+ 
Subjt:  RDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVA

Query:  LTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRV--------------------------------------
        L QG+ TVE+Y+ EF  LS FTP +V  EA + E+F+ GL+ ++QG V A  P  +A A+R+                                      
Subjt:  LTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRV--------------------------------------

Query:  --------------------------AELIDRRP----------------------------------------------------SVVSS---------
                                  A++  R+P                                                    S +SS         
Subjt:  --------------------------AELIDRRP----------------------------------------------------SVVSS---------

Query:  -------------------------------------------EPSSG--YKRR-----CV---------------------------------------
                                                    P SG  +K R     C+                                       
Subjt:  -------------------------------------------EPSSG--YKRR-----CV---------------------------------------

Query:  --------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------IDDLFDQLQGAAMFSKIDL
                P++L GLPP   VDF IELE  T PIS+APYRMAPAELKELKVQ+QELLDKG+IRPSVSPWGAPVL       IDDLFDQLQGA +FSKIDL
Subjt:  --------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------IDDLFDQLQGAAMFSKIDL

Query:  RSGYHQLRIREDDISKTAFRTSFHGLDE------------------------------------------------------------------------
        RSGYHQLRIR+ DI KTAFR S +G DE                                                                        
Subjt:  RSGYHQLRIREDDISKTAFRTSFHGLDE------------------------------------------------------------------------

Query:  --------QGHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVP
                 GHVV  +GV+VDPAKIE VTNW RP++VSEIRSFLGLAGYY RFV+ FS I +PLTQLTRKG  F+W+ ACE SFQELKQ+LV+APVLTVP
Subjt:  --------QGHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVP

Query:  DGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNID
        DGSG FVIYS+ASKKGLGCVLM                QQ                                                            
Subjt:  DGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNID

Query:  LDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQY
             K+V               Y S QLK +E+NY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKY FTQKELNMRQRRWLELVKDY C+I Y
Subjt:  LDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQY

Query:  HLGKANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG
        H GKAN+VADALSRK AH +ALIT Q  +L D ERAEI ++VG+VTA+LAQL+V+ +LRQ+II  Q +DP L  KR  VE G
Subjt:  HLGKANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG

TrEMBL top hitse value%identityAlignment
A0A5A7SSA3 Reverse transcriptase6.3e-16336.4Show/hide
Query:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPK-----QVVPPVG-----------------------QRENLPEN
        + L +RFCRL Y V L +V   L  S+   + MPPR   R  GR GRG+G GR  P+     Q   P                         Q++  P  
Subjt:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPK-----QVVPPVG-----------------------QRENLPEN

Query:  PQVEQPGEQSGPAAQVPQNNEAALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIERTIDVSN
        P    P   S P    PQ     LS E + +RDFRK++  +FDG+  DPT A LWLSS+ET+FR+M C EDQK+ C VF+L D  + WW++ ER +    
Subjt:  PQVEQPGEQSGPAAQVPQNNEAALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIERTIDVSN

Query:  GPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRVAELIDRR
          +TW Q +  F+ K + A+ R  K+ EF+ L QG+ TVE+Y+ EF  LS F P ++A EA + ++F+ GL+ +IQG V A  P  +A A+R+A  +  +
Subjt:  GPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRVAELIDRR

Query:  PSVVSSE-----PSSGYKR-------------------------------------------------RCV-----------------------------
            SS+      +SG KR                                                 RC+                             
Subjt:  PSVVSSE-----PSSGYKR-------------------------------------------------RCV-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAELKELKVQIQELLDKGY
                                                       PE+L GLPP   V+F IELEP T PIS+APYRM PAELKELKVQ+QELLDKG+
Subjt:  -----------------------------------------------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAELKELKVQIQELLDKGY

Query:  IRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTSF---------HGLDE
        IRPSVSPWGAPVL                               IDDLFDQLQGA +FSKIDLRSGYHQLRI++ D+ KT FR+ +          GL  
Subjt:  IRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTSF---------HGLDE

Query:  Q----------------------------------------------------------------------GHVVLKQGVTVDPAKIEEVTNWSRPTSVS
                                                                               GHVV K GV+VDPAKIE VT W+RP++VS
Subjt:  Q----------------------------------------------------------------------GHVVLKQGVTVDPAKIEEVTNWSRPTSVS

Query:  EIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVL
        E+RSFLGLAGYY RFV+ FS I  PLTQLTRKG  F+W++ACE SFQ LKQ+LV+APVLTVPDGSG FVIYS+ASKKGLGCVLM                
Subjt:  EIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVL

Query:  QQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLT
        QQ                                                                 K+V               Y S QLK +E+NY T
Subjt:  QQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLT

Query:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGKANVVADALSRKTAHVSALITTQAHILDDLERAEI
        HDLELAAVVFALKIWRHYLYGEKIQIF DHKSLKY FT KELNMRQRRWLELVKDY C+I YH GKANVVADALSRK +H +ALIT QA +  D+ERAEI
Subjt:  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGKANVVADALSRKTAHVSALITTQAHILDDLERAEI

Query:  MIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEA
          +VG VT +LAQLTV+ +LRQRII  Q +DP LV KR   EA
Subjt:  MIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEA

A0A5A7TGX4 Reverse transcriptase3.7e-16335.99Show/hide
Query:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA
        + L +RFCRL Y V + +V   L  S+   + MPPR   R  GR GRGRG GR  P+  V PV Q  + P  P                + Q  EQ  PA
Subjt:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA

Query:  AQVPQNNEA-----------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQ
        +  P  + A                        LS E + +RDFRK++  +FDG+  DPT A++WLSS+ET+FR+M C EDQKV C VF+L D    WW+
Subjt:  AQVPQNNEA-----------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQ

Query:  SIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATA
        + ER +      +TW QF+  F+ K + A+ R  KQ EF+ L QG+ TVE+Y+ EF  LS F P ++A EA + ++F+ GL+ +IQG V A  P  +A A
Subjt:  SIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATA

Query:  VRVAELIDRRPSVVSSE-----PSSGYKR-------------------------------------------------RCV-------------------
        +R+A  +  +    SS+      +SG KR                                                 RC+                   
Subjt:  VRVAELIDRRPSVVSSE-----PSSGYKR-------------------------------------------------RCV-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------PEDLTGLPPAGVVDFCIEL
                                                                                         P++L GLPP   V+F IEL
Subjt:  ---------------------------------------------------------------------------------PEDLTGLPPAGVVDFCIEL

Query:  EPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSG
        EP T PIS+APYRMAPAELKELKVQ+QELLDKG+IRPSVSPWGAPVL                               IDDLFDQLQGA +FS+IDLRSG
Subjt:  EPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSG

Query:  YHQLRIREDDISKTAFRTSF---------HGLDEQ-----------------------------------------------------------------
        YHQLRI+++D+ KTAFR+ +          GL                                                                    
Subjt:  YHQLRIREDDISKTAFRTSF---------HGLDEQ-----------------------------------------------------------------

Query:  -----GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSG
             GHVV K GV+VDPAKIE VT W+RP++VSE+RSFLGLAGYY +FV+ FS I  PLTQLTRKG  F+W++ACE SFQ LKQ+LV+APVLTV DGSG
Subjt:  -----GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSG

Query:  EFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRD
         FVIYS+ASKKGLGCVLM                QQ                                                                
Subjt:  EFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRD

Query:  IKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGK
         K+V               Y S QLK +E+NY THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKY FTQKELNMRQRRWLELVKDY C+I YH GK
Subjt:  IKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGK

Query:  ANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG
        ANVVADALSRK +H +ALIT QA +  DLERAEI ++VG VT +LAQLTV+ +LRQRIID Q +DP LV KR   EAG
Subjt:  ANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG

A0A5A7TSQ8 Reverse transcriptase1.1e-16236.13Show/hide
Query:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA
        + L +RF RL Y V L +V   L  S+   + MPPR   R  GR GRGRG GR  P+  V PV Q  + P  P                + Q  EQ  PA
Subjt:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA

Query:  AQVPQNNEA-------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIER
        +  P    A                    LS E + +RDFRK++  +FDG+  DPT A++WLSS+ET+FR+M C EDQKV C VF+L D    WW++ ER
Subjt:  AQVPQNNEA-------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIER

Query:  TIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRVA
         +      +TW QF+  F+ K + A+ R  K+ EF+ L QG+ TVE+Y+ EF  LS F P ++A EA +V++F+ GL+ +IQG V A  P  +A A+R+A
Subjt:  TIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRVA

Query:  ELIDRRPSVVSSE-----PSSGYKR----------------------------------------------RC---------------------------
          +  +    SS+      +SG KR                                              RC                           
Subjt:  ELIDRRPSVVSSE-----PSSGYKR----------------------------------------------RC---------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------VPEDLTGLPPAGVVDFCIE
                                                                                          PE+L GLPP   V+F IE
Subjt:  ---------------------------------------------------------------------------------VPEDLTGLPPAGVVDFCIE

Query:  LEPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRS
        LEP T PIS+APYRMAPAELKELKVQ+QELLDKG+IRPSVSPWGAPVL                               IDDLFDQLQGA +FSKIDLRS
Subjt:  LEPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRS

Query:  GYHQLRIREDDISKTAFRTSF---------HGLDEQ----------------------------------------------------------------
        GYHQLRI+++D+ KTAF + +          GL                                                                   
Subjt:  GYHQLRIREDDISKTAFRTSF---------HGLDEQ----------------------------------------------------------------

Query:  ------GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGS
              GHVV K GV+VDPAKIE VT W+RP++VSE+RSFLGLAGYY RFV+ FS I  PLTQLTRKG  F+W++ACE SFQ LKQ+LV+APVLTVPDGS
Subjt:  ------GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGS

Query:  GEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDR
        G FVIYS+ASKKGLGCVLM                QQ                                                               
Subjt:  GEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDR

Query:  DIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLG
          K+V               Y S QLK +E+NY THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKY FTQKELNMRQRRWLELVKDY C+I YH G
Subjt:  DIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLG

Query:  KANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG
        KANVVADALSRK +H +ALIT QA +  DLER EI ++VG VT +LAQLTV+ +LRQRIID Q +DP LV KR   EAG
Subjt:  KANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG

A0A5A7UY04 Reverse transcriptase5.5e-16736.84Show/hide
Query:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA
        + L +RFCRL Y V L +V   L  S+   + MPPR   R  GR GRGRG GR  P+  V PV Q  + P  P                + Q  EQ  PA
Subjt:  RGLGYRFCRLVYIVDLRYVH-GLSTSIRSRQIMPPRARGRGKGR-GRGRGRGRTTPKQVVPPVGQRENLPENP---------------QVEQPGEQSGPA

Query:  AQVPQNNEA-----------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQ
        +  P    A                        LS E + +RDFRK++  +FDG+  DPT A++WLSS+ET+FR+M C EDQK+ C VF+L D    WW+
Subjt:  AQVPQNNEA-----------------------ALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQ

Query:  SIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATA
        + ER +      +TW QF+  F+ K + A+ R  K+ EF+ L QG+ TVE+Y+ EF  LS F P ++A EA + ++F+ GL+ +IQG V A  P  +A A
Subjt:  SIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATA

Query:  VRVAELIDRRPSVVSSE-----PSSGYKR-------------------------------------------------RCV-------------------
        +R+A  +  +    SS+      +SG KR                                                 RC+                   
Subjt:  VRVAELIDRRPSVVSSE-----PSSGYKR-------------------------------------------------RCV-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAEL
                                                                      PE+L GLPP   V+F IELEP T PIS+APYRMAPAEL
Subjt:  --------------------------------------------------------------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAEL

Query:  KELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS
        KELKVQ+QELLDKG+IRPSVSPWGAPVL                               IDDLFDQLQGA +FSKIDLRSGYHQLRI+++D+ KTAFR+ 
Subjt:  KELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------IDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS

Query:  F---------HGLDEQ----------------------------------------------------------------------GHVVLKQGVTVDPA
        +          GL                                                                         GHVV K GV+VDPA
Subjt:  F---------HGLDEQ----------------------------------------------------------------------GHVVLKQGVTVDPA

Query:  KIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMH
        KIE VT W+RP++VSE+RSFLGLAGYY RFV+ FS I  PLTQLTRKG  F+W++ACE SFQ LKQ+LV+APVLTVPDGSG FVIYS+ASKKGLGCVLM 
Subjt:  KIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMH

Query:  HDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNT
                       QQ                                                                 K+V               
Subjt:  HDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNT

Query:  YVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGKANVVADALSRKTAHVSALI
        Y S QLK +EKNY THDLELAAVVFALKIWRHYLYGE IQIFTDHKSLKY FTQKELNMRQRRWLELVKDY C+I YH GKANVVADALSRK +H +ALI
Subjt:  YVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGKANVVADALSRKTAHVSALI

Query:  TTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG
        T QA +  DLERAEI ++VG VT +LAQLTV+ +LRQRIID Q +DP LV KR   EAG
Subjt:  TTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG

A0A5A7V5X5 Pol protein1.3e-16338.45Show/hide
Query:  MPPR---ARGRGKGRGRGRGRGRTTPKQVVPPVGQRENLPEN--PQVEQPGEQSGPAAQVP----QNNEAA---------------------LSWETRCI
        MPPR    RG G+G GRG GRG+     V P V     + +     +EQ  +    AA  P    Q N+AA                     LS E + +
Subjt:  MPPR---ARGRGKGRGRGRGRGRTTPKQVVPPVGQRENLPEN--PQVEQPGEQSGPAAQVP----QNNEAA---------------------LSWETRCI

Query:  RDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVA
        RDFRK++  +FDG+  +PT A++WL+SIET+FR+M C EDQKV C VF L D    WW++ ER +      +TW QF+  F+ K + A  +  K  EF+ 
Subjt:  RDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIERTIDVSNGPVTWTQFRGEFFRKCYPAASRFKKQAEFVA

Query:  LTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRV--------------------------------------
        L QG+ TVE+Y+ EF  LS FTP +V  EA + E+F+ GL+ ++QG V A  P  +A A+R+                                      
Subjt:  LTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRV--------------------------------------

Query:  --------------------------AELIDRRP----------------------------------------------------SVVSS---------
                                  A++  R+P                                                    S +SS         
Subjt:  --------------------------AELIDRRP----------------------------------------------------SVVSS---------

Query:  -------------------------------------------EPSSG--YKRR-----CV---------------------------------------
                                                    P SG  +K R     C+                                       
Subjt:  -------------------------------------------EPSSG--YKRR-----CV---------------------------------------

Query:  --------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------IDDLFDQLQGAAMFSKIDL
                P++L GLPP   VDF IELE  T PIS+APYRMAPAELKELKVQ+QELLDKG+IRPSVSPWGAPVL       IDDLFDQLQGA +FSKIDL
Subjt:  --------PEDLTGLPPAGVVDFCIELEPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------IDDLFDQLQGAAMFSKIDL

Query:  RSGYHQLRIREDDISKTAFRTSFHGLDE------------------------------------------------------------------------
        RSGYHQLRIR+ DI KTAFR S +G DE                                                                        
Subjt:  RSGYHQLRIREDDISKTAFRTSFHGLDE------------------------------------------------------------------------

Query:  --------QGHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVP
                 GHVV  +GV+VDPAKIE VTNW RP++VSEIRSFLGLAGYY RFV+ FS I +PLTQLTRKG  F+W+ ACE SFQELKQ+LV+APVLTVP
Subjt:  --------QGHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVP

Query:  DGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNID
        DGSG FVIYS+ASKKGLGCVLM                QQ                                                            
Subjt:  DGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNID

Query:  LDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQY
             K+V               Y S QLK +E+NY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKY FTQKELNMRQRRWLELVKDY C+I Y
Subjt:  LDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQY

Query:  HLGKANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG
        H GKAN+VADALSRK AH +ALIT Q  +L D ERAEI ++VG+VTA+LAQL+V+ +LRQ+II  Q +DP L  KR  VE G
Subjt:  HLGKANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVGKVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein7.2e-2322.25Show/hide
Query:  ETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVLIDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS--------------
        E+  I+  P    P +   L++ +       Y++P++ P     LI+ L  ++QG+ +F+K+DL+S YH +R+R+ D  K AFR                
Subjt:  ETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVLIDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS--------------

Query:  ------------------------------FHGLDEQGHV-----------------------------------VLKQGVTVDPAKIEEVTNWSRPTSV
                                       H   E  HV                                   + ++G T     I++V  W +P + 
Subjt:  ------------------------------FHGLDEQGHV-----------------------------------VLKQGVTVDPAKIEEVTNWSRPTSV

Query:  SEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSV
         E+R FLG   Y  +F+   S +  PL  L +K   + W      + + +KQ LVS PVL   D S + ++ ++AS   +G VL                
Subjt:  SEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSV

Query:  LQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYL
                         SQ+ D                                                    +++ Y    P  Y S ++   + NY 
Subjt:  LQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYL

Query:  THDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYGCDIQYHLGKANVVADALSR
          D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D+  +I Y  G AN +ADALSR
Subjt:  THDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYGCDIQYHLGKANVVADALSR

P0CT41 Transposon Tf2-12 polyprotein7.2e-2322.25Show/hide
Query:  ETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVLIDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS--------------
        E+  I+  P    P +   L++ +       Y++P++ P     LI+ L  ++QG+ +F+K+DL+S YH +R+R+ D  K AFR                
Subjt:  ETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVLIDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS--------------

Query:  ------------------------------FHGLDEQGHV-----------------------------------VLKQGVTVDPAKIEEVTNWSRPTSV
                                       H   E  HV                                   + ++G T     I++V  W +P + 
Subjt:  ------------------------------FHGLDEQGHV-----------------------------------VLKQGVTVDPAKIEEVTNWSRPTSV

Query:  SEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSV
         E+R FLG   Y  +F+   S +  PL  L +K   + W      + + +KQ LVS PVL   D S + ++ ++AS   +G VL                
Subjt:  SEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSV

Query:  LQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYL
                         SQ+ D                                                    +++ Y    P  Y S ++   + NY 
Subjt:  LQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYL

Query:  THDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYGCDIQYHLGKANVVADALSR
          D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D+  +I Y  G AN +ADALSR
Subjt:  THDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYGCDIQYHLGKANVVADALSR

P10394 Retrovirus-related Pol polyprotein from transposon 4121.0e-2421.28Show/hide
Query:  ETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------------IDDLFDQLQGAAMFSKID
        +  P+    YR   ++++E++ Q+Q+L+    + PSVS + +P+L                                     IDD+ DQL  A  FS +D
Subjt:  ETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVL-------------------------------------IDDLFDQLQGAAMFSKID

Query:  LRSGYHQLRIREDDISKTAFRTS---------------------------FHGLDEQ-------------------------------------------
        L SG+HQ+ + E     T+F TS                           F G++                                             
Subjt:  LRSGYHQLRIREDDISKTAFRTS---------------------------FHGLDEQ-------------------------------------------

Query:  ---------GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVP
                 GH    +G+  D  K + + N+  P      R F+    YY RF++ F+     +T+L +K   F W + C+ +F  LK +L++  +L  P
Subjt:  ---------GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVP

Query:  DGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNID
        D S EF I ++ASK+  G VL                                                                               
Subjt:  DGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNID

Query:  LDRDIKLVNLHENQLYSGEE-PNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQ
                     Q ++G + P  Y S      E N  T + ELAA+ +A+  +R Y+YG+   + TDH+ L YLF+    + +  R    +++Y   ++
Subjt:  LDRDIKLVNLHENQLYSGEE-PNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQ

Query:  YHLGKANVVADALSRKT
        Y  GK N VADALSR T
Subjt:  YHLGKANVVADALSRKT

P20825 Retrovirus-related Pol polyprotein from transposon 2975.2e-2923.78Show/hide
Query:  NPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVLI------------------------------------DDLFDQLQGAAMFSKIDLRS
        +PI    Y +A     E++ Q+QE+L++G IR S SP+ +P  +                                    D++  +L     F+ IDL  
Subjt:  NPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVLI------------------------------------DDLFDQLQGAAMFSKIDLRS

Query:  GYHQLRIREDDISKTAFRT-SFH---------------------------------------------GLDEQ---------------------------
        G+HQ+ + E+ ISKTAF T S H                                              L E                            
Subjt:  GYHQLRIREDDISKTAFRT-SFH---------------------------------------------GLDEQ---------------------------

Query:  ------GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWN--EACEASFQELKQRLVSAPVLTVPD
              GH+V   G+  +P K++ + ++  PT   EIR+FLGL GYY +F+  ++ I  P+T   +K T       E  EA F++LK  ++  P+L +PD
Subjt:  ------GHVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWN--EACEASFQELKQRLVSAPVLTVPD

Query:  GSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDL
           +FV+ ++AS   LG VL                      QN H                                                      
Subjt:  GSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDL

Query:  DRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYH
                           P +++S  L D+E NY   + EL A+V+A K +RHYL G +  I +DH+ L++L   KE   +  RW   + +Y   I Y 
Subjt:  DRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYH

Query:  LGKANVVADALSR
         GK N VADALSR
Subjt:  LGKANVVADALSR

Q9UR07 Transposon Tf2-11 polyprotein7.2e-2322.25Show/hide
Query:  ETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVLIDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS--------------
        E+  I+  P    P +   L++ +       Y++P++ P     LI+ L  ++QG+ +F+K+DL+S YH +R+R+ D  K AFR                
Subjt:  ETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVLIDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTS--------------

Query:  ------------------------------FHGLDEQGHV-----------------------------------VLKQGVTVDPAKIEEVTNWSRPTSV
                                       H   E  HV                                   + ++G T     I++V  W +P + 
Subjt:  ------------------------------FHGLDEQGHV-----------------------------------VLKQGVTVDPAKIEEVTNWSRPTSV

Query:  SEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSV
         E+R FLG   Y  +F+   S +  PL  L +K   + W      + + +KQ LVS PVL   D S + ++ ++AS   +G VL                
Subjt:  SEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMHHDYAARRLEEANSV

Query:  LQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYL
                         SQ+ D                                                    +++ Y    P  Y S ++   + NY 
Subjt:  LQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYEKNYL

Query:  THDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYGCDIQYHLGKANVVADALSR
          D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D+  +I Y  G AN +ADALSR
Subjt:  THDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYLFTQKE--LNMRQRRWLELVKDYGCDIQYHLGKANVVADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.4e-1843.3Show/hide
Query:  HVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFV
        H++  +GV+ DPAK+E +  W  P + +E+R FLGL GYY RFV+ +  IV PLT+L +K  S  W E    +F+ LK  + + PVL +PD    FV
Subjt:  HVVLKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAATATGCATGATGTTGGTGTGGATAGTGAGGATTCTGCAGATGCTCCTCTTGTACAAGAAGCCACTAAGACAAGAAAGCGTGGACCTACTAGGATGCATGAAAT
CACTCGTGTTAGTAGTGAAGGAAAAATCTTCATCTCTCGAAGATCCAACCGAATGATGTATTTTATGGAAAATGGCACGTATGAACAAGGAAAGATTGTTGATGAACGAA
CCCAAGAAGTGGTGGATCGTATTGATGAGATTTTGGCTACACAAGCTCATAAAGGTGATATTTCATCCCCCAACAAGGATGTGTTATCACTAGCATTAGGTACCCAAGAT
CGTCCTGGGAAAGTTCAAGGGGTAGGAAAGTTTATCACCCAAACCCAATACTTTCATAAACCAAAACAAACCTCAAAACCGAAGGTTGGAGAGCATGAAGCAAATGCTAA
ACAATTTGCTATTATGACAGCACACGTTAAATTGGAAGATGTTGTAGAAGAAGAGGTTGAAGGAATAGAGAAAAAGCAAAAGGTGAAGCCATCGACTAAAGGTGACTCAT
GTAGTAAGATGGTAACTACCACAACTAAGGATGGAACTCCTTGTCGGTTGGCATTTGGATCTATTGATAACATTGTCGCATGTGGGACAATATTTGATTCAAGTATAAAA
GCTCTCTCTCCCCCTCAGCCTTCAGCTCGCCGCCGCACGCCGCCTTCAGCTCACCGCCGCCGACCATTGCGTCTCCGCCTCTGTCGTTCGCCACGCCGCCGCCGCTCGCC
GACGGTCCGCCACCGCTTGTCACCTGCCGCCCGTCGCCTGCTGCTTGTCTCTCGCCGCCGTGAACCGCCGCCATCTCGCCGCCGTGAACCACCGCCATTCGCCGTCTCCC
GAAGCACCGCTGCAGGCATGAGCAGCCATTGCCACTCTCGATTTCGTCCTTCATTTCAGATCTGTTCGACGCCTTCAGCTACTTCCAATTTCAAGGTGCCCGTAGCGTTT
GGAGGTCGAGTTGTCGAAGTTGAGCGATTTTGGACTACTAGACAGCAAGCTGTGCTTCTGTTTTGGGTTCTAGTCATGAAAAGTAGGGTCGTTACAGTTGGTATCAGAGG
TCTAGGTTATAGGTTCTGTAGACTTGTTTACATCGTAGACCTTAGGTATGTCCATGGCTTATCAACAAGCATACGGTCACGCCAGATCATGCCTCCTCGAGCACGAGGAC
GTGGAAAAGGTCGAGGAAGAGGTCGTGGTAGGGGTCGCACAACCCCTAAACAAGTTGTGCCACCTGTGGGGCAAAGAGAAAATCTGCCAGAAAACCCACAAGTGGAACAG
CCTGGTGAACAGTCAGGGCCTGCGGCCCAAGTGCCTCAGAACAATGAAGCCGCCCTATCATGGGAGACTAGGTGCATAAGAGATTTTAGGAAGTGGGACCTCTGTTCGTT
CGATGGAGCCTCAAGAGACCCCACGGTAGCAGAGTTGTGGCTGTCCTCCATCGAGACTGTCTTTCGCCACATGAACTGCCTGGAGGACCAGAAAGTGTACTGCGTTGTGT
TCTTGTTACGAGATAACGCCTTGATTTGGTGGCAATCGATCGAAAGGACTATAGACGTCAGCAACGGACCTGTCACATGGACTCAGTTCAGGGGGGAGTTCTTTAGGAAA
TGTTACCCTGCAGCCTCTCGATTCAAGAAGCAAGCAGAGTTCGTGGCTCTTACGCAGGGAAACCGAACCGTAGAGGAGTACGAGACAGAGTTCACCAGACTGTCTCTATT
TACTCCTGCCCTGGTAGCTGTAGAGGCTGATAAGGTAGAACGTTTCATCCCAGGCTTAAAGGAAGAGATTCAGGGCAGTGTGGCAGCCCATGAACCCCAAGACTATGCCA
CAGCAGTCAGGGTGGCAGAGCTGATTGATCGACGACCGTCAGTTGTCTCTTCAGAACCTTCCTCGGGTTATAAGCGGAGATGTGTTCCCGAGGATCTGACAGGACTACCG
CCTGCAGGAGTGGTTGACTTCTGCATTGAACTTGAGCCAGAGACTAATCCCATTTCTAAAGCTCCATATAGGATGGCTCCAGCGGAATTGAAAGAGCTTAAGGTGCAGAT
TCAGGAGCTATTGGACAAAGGTTATATACGCCCCAGCGTCTCGCCGTGGGGTGCTCCAGTTTTGATTGATGACCTCTTTGATCAGCTTCAAGGTGCGGCAATGTTTTCTA
AAATAGATCTACGGTCAGGTTATCATCAACTGAGGATTAGAGAGGACGATATATCTAAGACAGCTTTCCGAACCAGTTTTCATGGACTTGATGAACAGGGGCACGTGGTG
TTGAAACAAGGTGTCACAGTAGATCCAGCCAAAATAGAGGAAGTGACCAATTGGTCTAGACCCACTTCAGTCAGCGAGATTCGGAGTTTCTTGGGGTTAGCAGGATATTA
CCATCGTTTTGTGCAAGGCTTTTCTAGTATTGTCGCACCCCTAACCCAGTTGACTCGAAAGGGTACTTCGTTTTTGTGGAACGAAGCTTGTGAGGCAAGCTTCCAAGAGC
TTAAGCAAAGACTAGTTTCAGCACCAGTACTTACAGTACCTGATGGCTCAGGAGAATTTGTCATTTACAGCAATGCTTCAAAGAAGGGACTGGGTTGTGTGTTAATGCAT
CATGATTATGCTGCTAGGCGACTTGAGGAAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACATCACAACTCGCGCGTATAGGTCACAGCGTCGAGACGC
TGCGACTTTAGCGTCTCGACGCTGTGTTATTTCACTTACTGAAACACGTGTTTTGAAGGCAGCGTCGAGACGTTGTCTTGACAGCGTCTCGACGCTACGACGAAAAAAAC
AGAATATAGACTTAGATAGGGATATCAAGTTGGTTAATTTGCATGAGAATCAATTATACTCGGGAGAGGAACCCAACACTTATGTGTCTCATCAGCTGAAAGATTATGAG
AAGAACTACCTGACCCATGACTTGGAATTAGCAGCAGTGGTCTTTGCTTTGAAGATTTGGAGGCACTATCTGTACGGTGAGAAGATCCAAATCTTCACGGACCACAAAAG
CTTGAAATATCTCTTCACCCAGAAGGAGTTAAATATGAGGCAGCGGAGATGGTTAGAGTTAGTGAAAGATTATGGTTGTGACATCCAGTACCACCTGGGCAAGGCAAATG
TGGTTGCAGATGCCCTCAGTAGGAAGACTGCTCATGTGTCTGCTTTGATTACTACTCAGGCTCATATATTGGATGACTTGGAACGTGCAGAGATTATGATTGCAGTGGGC
AAAGTTACAGCACGCCTAGCTCAGTTGACAGTTAAGTCGAGCTTGAGACAGAGGATCATAGATGAACAACCAAGTGACCCTGACCTTGTTGTTAAGCGCAATCAAGTAGA
GGCGGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACAATATGCATGATGTTGGTGTGGATAGTGAGGATTCTGCAGATGCTCCTCTTGTACAAGAAGCCACTAAGACAAGAAAGCGTGGACCTACTAGGATGCATGAAAT
CACTCGTGTTAGTAGTGAAGGAAAAATCTTCATCTCTCGAAGATCCAACCGAATGATGTATTTTATGGAAAATGGCACGTATGAACAAGGAAAGATTGTTGATGAACGAA
CCCAAGAAGTGGTGGATCGTATTGATGAGATTTTGGCTACACAAGCTCATAAAGGTGATATTTCATCCCCCAACAAGGATGTGTTATCACTAGCATTAGGTACCCAAGAT
CGTCCTGGGAAAGTTCAAGGGGTAGGAAAGTTTATCACCCAAACCCAATACTTTCATAAACCAAAACAAACCTCAAAACCGAAGGTTGGAGAGCATGAAGCAAATGCTAA
ACAATTTGCTATTATGACAGCACACGTTAAATTGGAAGATGTTGTAGAAGAAGAGGTTGAAGGAATAGAGAAAAAGCAAAAGGTGAAGCCATCGACTAAAGGTGACTCAT
GTAGTAAGATGGTAACTACCACAACTAAGGATGGAACTCCTTGTCGGTTGGCATTTGGATCTATTGATAACATTGTCGCATGTGGGACAATATTTGATTCAAGTATAAAA
GCTCTCTCTCCCCCTCAGCCTTCAGCTCGCCGCCGCACGCCGCCTTCAGCTCACCGCCGCCGACCATTGCGTCTCCGCCTCTGTCGTTCGCCACGCCGCCGCCGCTCGCC
GACGGTCCGCCACCGCTTGTCACCTGCCGCCCGTCGCCTGCTGCTTGTCTCTCGCCGCCGTGAACCGCCGCCATCTCGCCGCCGTGAACCACCGCCATTCGCCGTCTCCC
GAAGCACCGCTGCAGGCATGAGCAGCCATTGCCACTCTCGATTTCGTCCTTCATTTCAGATCTGTTCGACGCCTTCAGCTACTTCCAATTTCAAGGTGCCCGTAGCGTTT
GGAGGTCGAGTTGTCGAAGTTGAGCGATTTTGGACTACTAGACAGCAAGCTGTGCTTCTGTTTTGGGTTCTAGTCATGAAAAGTAGGGTCGTTACAGTTGGTATCAGAGG
TCTAGGTTATAGGTTCTGTAGACTTGTTTACATCGTAGACCTTAGGTATGTCCATGGCTTATCAACAAGCATACGGTCACGCCAGATCATGCCTCCTCGAGCACGAGGAC
GTGGAAAAGGTCGAGGAAGAGGTCGTGGTAGGGGTCGCACAACCCCTAAACAAGTTGTGCCACCTGTGGGGCAAAGAGAAAATCTGCCAGAAAACCCACAAGTGGAACAG
CCTGGTGAACAGTCAGGGCCTGCGGCCCAAGTGCCTCAGAACAATGAAGCCGCCCTATCATGGGAGACTAGGTGCATAAGAGATTTTAGGAAGTGGGACCTCTGTTCGTT
CGATGGAGCCTCAAGAGACCCCACGGTAGCAGAGTTGTGGCTGTCCTCCATCGAGACTGTCTTTCGCCACATGAACTGCCTGGAGGACCAGAAAGTGTACTGCGTTGTGT
TCTTGTTACGAGATAACGCCTTGATTTGGTGGCAATCGATCGAAAGGACTATAGACGTCAGCAACGGACCTGTCACATGGACTCAGTTCAGGGGGGAGTTCTTTAGGAAA
TGTTACCCTGCAGCCTCTCGATTCAAGAAGCAAGCAGAGTTCGTGGCTCTTACGCAGGGAAACCGAACCGTAGAGGAGTACGAGACAGAGTTCACCAGACTGTCTCTATT
TACTCCTGCCCTGGTAGCTGTAGAGGCTGATAAGGTAGAACGTTTCATCCCAGGCTTAAAGGAAGAGATTCAGGGCAGTGTGGCAGCCCATGAACCCCAAGACTATGCCA
CAGCAGTCAGGGTGGCAGAGCTGATTGATCGACGACCGTCAGTTGTCTCTTCAGAACCTTCCTCGGGTTATAAGCGGAGATGTGTTCCCGAGGATCTGACAGGACTACCG
CCTGCAGGAGTGGTTGACTTCTGCATTGAACTTGAGCCAGAGACTAATCCCATTTCTAAAGCTCCATATAGGATGGCTCCAGCGGAATTGAAAGAGCTTAAGGTGCAGAT
TCAGGAGCTATTGGACAAAGGTTATATACGCCCCAGCGTCTCGCCGTGGGGTGCTCCAGTTTTGATTGATGACCTCTTTGATCAGCTTCAAGGTGCGGCAATGTTTTCTA
AAATAGATCTACGGTCAGGTTATCATCAACTGAGGATTAGAGAGGACGATATATCTAAGACAGCTTTCCGAACCAGTTTTCATGGACTTGATGAACAGGGGCACGTGGTG
TTGAAACAAGGTGTCACAGTAGATCCAGCCAAAATAGAGGAAGTGACCAATTGGTCTAGACCCACTTCAGTCAGCGAGATTCGGAGTTTCTTGGGGTTAGCAGGATATTA
CCATCGTTTTGTGCAAGGCTTTTCTAGTATTGTCGCACCCCTAACCCAGTTGACTCGAAAGGGTACTTCGTTTTTGTGGAACGAAGCTTGTGAGGCAAGCTTCCAAGAGC
TTAAGCAAAGACTAGTTTCAGCACCAGTACTTACAGTACCTGATGGCTCAGGAGAATTTGTCATTTACAGCAATGCTTCAAAGAAGGGACTGGGTTGTGTGTTAATGCAT
CATGATTATGCTGCTAGGCGACTTGAGGAAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACATCACAACTCGCGCGTATAGGTCACAGCGTCGAGACGC
TGCGACTTTAGCGTCTCGACGCTGTGTTATTTCACTTACTGAAACACGTGTTTTGAAGGCAGCGTCGAGACGTTGTCTTGACAGCGTCTCGACGCTACGACGAAAAAAAC
AGAATATAGACTTAGATAGGGATATCAAGTTGGTTAATTTGCATGAGAATCAATTATACTCGGGAGAGGAACCCAACACTTATGTGTCTCATCAGCTGAAAGATTATGAG
AAGAACTACCTGACCCATGACTTGGAATTAGCAGCAGTGGTCTTTGCTTTGAAGATTTGGAGGCACTATCTGTACGGTGAGAAGATCCAAATCTTCACGGACCACAAAAG
CTTGAAATATCTCTTCACCCAGAAGGAGTTAAATATGAGGCAGCGGAGATGGTTAGAGTTAGTGAAAGATTATGGTTGTGACATCCAGTACCACCTGGGCAAGGCAAATG
TGGTTGCAGATGCCCTCAGTAGGAAGACTGCTCATGTGTCTGCTTTGATTACTACTCAGGCTCATATATTGGATGACTTGGAACGTGCAGAGATTATGATTGCAGTGGGC
AAAGTTACAGCACGCCTAGCTCAGTTGACAGTTAAGTCGAGCTTGAGACAGAGGATCATAGATGAACAACCAAGTGACCCTGACCTTGTTGTTAAGCGCAATCAAGTAGA
GGCGGGGTAG
Protein sequenceShow/hide protein sequence
MDNMHDVGVDSEDSADAPLVQEATKTRKRGPTRMHEITRVSSEGKIFISRRSNRMMYFMENGTYEQGKIVDERTQEVVDRIDEILATQAHKGDISSPNKDVLSLALGTQD
RPGKVQGVGKFITQTQYFHKPKQTSKPKVGEHEANAKQFAIMTAHVKLEDVVEEEVEGIEKKQKVKPSTKGDSCSKMVTTTTKDGTPCRLAFGSIDNIVACGTIFDSSIK
ALSPPQPSARRRTPPSAHRRRPLRLRLCRSPRRRRSPTVRHRLSPAARRLLLVSRRREPPPSRRREPPPFAVSRSTAAGMSSHCHSRFRPSFQICSTPSATSNFKVPVAF
GGRVVEVERFWTTRQQAVLLFWVLVMKSRVVTVGIRGLGYRFCRLVYIVDLRYVHGLSTSIRSRQIMPPRARGRGKGRGRGRGRGRTTPKQVVPPVGQRENLPENPQVEQ
PGEQSGPAAQVPQNNEAALSWETRCIRDFRKWDLCSFDGASRDPTVAELWLSSIETVFRHMNCLEDQKVYCVVFLLRDNALIWWQSIERTIDVSNGPVTWTQFRGEFFRK
CYPAASRFKKQAEFVALTQGNRTVEEYETEFTRLSLFTPALVAVEADKVERFIPGLKEEIQGSVAAHEPQDYATAVRVAELIDRRPSVVSSEPSSGYKRRCVPEDLTGLP
PAGVVDFCIELEPETNPISKAPYRMAPAELKELKVQIQELLDKGYIRPSVSPWGAPVLIDDLFDQLQGAAMFSKIDLRSGYHQLRIREDDISKTAFRTSFHGLDEQGHVV
LKQGVTVDPAKIEEVTNWSRPTSVSEIRSFLGLAGYYHRFVQGFSSIVAPLTQLTRKGTSFLWNEACEASFQELKQRLVSAPVLTVPDGSGEFVIYSNASKKGLGCVLMH
HDYAARRLEEANSVLQQNWEQNCHITTRAYRSQRRDAATLASRRCVISLTETRVLKAASRRCLDSVSTLRRKKQNIDLDRDIKLVNLHENQLYSGEEPNTYVSHQLKDYE
KNYLTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYLFTQKELNMRQRRWLELVKDYGCDIQYHLGKANVVADALSRKTAHVSALITTQAHILDDLERAEIMIAVG
KVTARLAQLTVKSSLRQRIIDEQPSDPDLVVKRNQVEAG