; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036297 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036297
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:43622211..43628050
RNA-Seq ExpressionLag0036297
SyntenyLag0036297
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]2.2e-14337.11Show/hide
Query:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH
        FS S TTY+ PL+LI  DLWGP+  LS  G+RYY  FVDAFSR++WI+ L++KSEA + FV FKT VE QF   + SLQTD G EF+ F  +L  +GI H
Subjt:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH

Query:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS
        RVSCP+T QQNG+ ERKHR IV+ GLTLL  +S+PL FWD++F T V+L NRLP+ +L    P+E LF+  PDY  LKVF C+CFP LRPYN+HKL +RS
Subjt:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS

Query:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDL-----PHNDT
          CTF+GYS  HKGYKC+S++GRVYIS  V+F+E  FP++     S    + V  +   +S   +    S   +  P    S + P +++      H   
Subjt:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDL-----PHNDT

Query:  NGSKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVL-----STHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNET
          S D++ + P   +S P+  PV ++ +  +   V+  +     +THPM+TR+K GI KPK  +      EP +V   L+  +WK AM  EYDAL +N T
Subjt:  NGSKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVL-----STHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNET

Query:  WDLVPTP------------------------------------------------------------------------------LN-------------
        W LVP P                                                                              LN             
Subjt:  WDLVPTP------------------------------------------------------------------------------LN-------------

Query:  ---------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYY
                             ++AP AWFE+L   L S GF+++K+D SL  R +  +  Y+LVYVDDI + G+  A I+SLI +L+ +FSLKDLG ++Y
Subjt:  ---------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYY

Query:  FLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDLESEFGTSRPRHSPATWPRHHTRLKPLL
        FLGI+VS+  + GL LSQ+KYI D+L +TKMV      TP+ +G  L    G+   D+H YRS V A               ++   T P     +  + 
Subjt:  FLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDLESEFGTSRPRHSPATWPRHHTRLKPLL

Query:  RLKTVPTNQH-----------GGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL--
        +    PT +H            GT  +G  L KSS   L GF D DWASD DDR+STSG CVF G NLI W SKKQ I+SRS+ E EYR LA    E+  
Subjt:  RLKTVPTNQH-----------GGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL--

Query:  ----------------------------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVREPSSIGLRG
                                                                  V+H+P++DQ+ DVLTK +S   F++ R KL +   S++ LRG
Subjt:  ----------------------------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVREPSSIGLRG

KYP61341.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]3.2e-14237.03Show/hide
Query:  STTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINHRVS
        S T Y+TP  L+ +DLWGP+   ST G++YY +FVDA +R+TWIY L+SK++ F  F +F  +V+ Q+  P+ +LQTD G E++ F  +L+  GI HR+ 
Subjt:  STTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINHRVS

Query:  CPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRSSPC
        CP+T  QNG+ ERKHRHIV++GLTL++ + +P+ FWD +F T+V+LINRLPS  +    P  KLF + PDY SL++F C+CFP LRPYN HKL FRS  C
Subjt:  CPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRSSPC

Query:  TFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGSKDSSS
         F+GYS  HKGYKCL+ DGR+YIS+ V+F+E  FP+  + + S AS + +  ++P+   P  +   +    S      SPS        N    +  S+ 
Subjt:  TFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGSKDSSS

Query:  SCPNNSISQPITLPVTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFV--DIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTP----
        S P +S  +P + P+ + +N             HPM TR+K GI KP+   PT +   +EP   K+ L    W  AMQ EY+AL+ N TW LVP P    
Subjt:  SCPNNSISQPITLPVTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFV--DIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTP----

Query:  --------------------------------------------------------------------------LN------------------------
                                                                                  LN                        
Subjt:  --------------------------------------------------------------------------LN------------------------

Query:  ---------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSG
                 ++AP AWF++L   L  L F  SK DPSL     G    YILVYVDDI ITGN+ + + +L+++L   FSLKDLG L +FLGI+V     G
Subjt:  ---------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSG

Query:  GLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRPRHSPATWPRHH-TRLKPLLRLKTVPTN
         L L+QSKYI D+L+RT M G+  I++PM+SG  LS    E F D  LYRSVV A +   +   E  F  ++     +    HH   +K +LR       
Subjt:  GLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRPRHSPATWPRHH-TRLKPLLRLKTVPTN

Query:  QHGGTYNYGSMLH---KSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL------------------
           GT ++G  L     SS  S+H + D DWASDPDDR+STSG  +F G NL+ W SKKQ++++RS+TEAEYR LALA TE+                  
Subjt:  QHGGTYNYGSMLH---KSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL------------------

Query:  -----------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVRE
                                                 V H+PA DQ  DVLTK LS   F +LRSKL V E
Subjt:  -----------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVRE

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]9.6e-14738.2Show/hide
Query:  SSSSGSVCVRPVQARQPSCHALFSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISL
        S+ SGS      Q  + S +  F  S T Y+ PLQL+V+DLWGP+   S++GF YY SFVDA+SRYTW+YFL++KS+  +AF+ FK   E QFG  L + 
Subjt:  SSSSGSVCVRPVQARQPSCHALFSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISL

Query:  QTDEGAEFKPFIPFLHNHGINHRVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLK
        QTD G EF+    +   +GI HR+SCP+TS+QNGIIERKHRHIV++GLTLL+ +S+PL +W DAFST+VFLINRLP+ VL    P E LF  +P+Y  LK
Subjt:  QTDEGAEFKPFIPFLHNHGINHRVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLK

Query:  VFDCNCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFAS--------VQNKSFASPAF-VIQNL------PIISKP
        VF C CFP LRPYN HKL FRSSPCTF+GYS  HKGYKCL+  GR++ISR V+FDE  FPFA         V + +   P   +++NL      P +S P
Subjt:  VFDCNCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFAS--------VQNKSFASPAF-VIQNL------PIISKP

Query:  CTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGS---KDSSSSCPNNS--ISQPITLPVTNINNQSSQ----LPVSSVLSTHPMVTRSKRGIFKPKAHL
         TSS +S   +    L +   S   DL + D++ +    + S+S P++S   + P T+P++  +++ ++     PV+     H MVTRSK GIFKPK + 
Subjt:  CTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGS---KDSSSSCPNNS--ISQPITLPVTNINNQSSQ----LPVSSVLSTHPMVTRSKRGIFKPKAHL

Query:  PTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPLN-------------------------------------------------------
              EP   +E +   +WK AM EE+ AL+KN+TW LV  P N                                                       
Subjt:  PTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPLN-------------------------------------------------------

Query:  -----------------------------------------------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYI
                                                                   ++AP AWF++L + L   GF ++K+D SL  R +     ++
Subjt:  -----------------------------------------------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYI

Query:  LVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYR
        LVYVDDI +TG+S  +I  LI+ L G FSLKDLG L YFLGI+V     GGL LSQ KYI D+L +TKM GA  + TPM+SG  LSA  G+   +V  YR
Subjt:  LVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYR

Query:  SVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLI
        SVV A +   +   E  F  ++  +        H   +K +LR          GT + G +L  S   +L GF D DW SD DDR+STSG CVF G +L+
Subjt:  SVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLI

Query:  MWGSKKQSIISRSNTEAEYRCLALAATEL------------------------------------------------------------VQHLPASDQIV
         W SKKQ   SRS+TEAEYR LA   +E+                                                            V H+P  DQ+ 
Subjt:  MWGSKKQSIISRSNTEAEYRCLALAATEL------------------------------------------------------------VQHLPASDQIV

Query:  DVLTKPLSVVSFLKLRSKLNVREPSSIGLRGG
        DV TKPLS   F KLR KL V   +S+ L+ G
Subjt:  DVLTKPLSVVSFLKLRSKLNVREPSSIGLRGG

RVX03712.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.4e-14340.23Show/hide
Query:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH
        F  ST++Y+ PL+LI TDLWGP+   S+HG +YY  F+DA+SR+TWIY L+ KSEAFQ F+ FK+ VE Q G  + ++Q+D G E++ F  +L ++GI H
Subjt:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH

Query:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS
        R+SCPYT +QNG+ ERKHRHIV+ G+ LL+ +S+P  +WD+AF TSV LINRLP+ VL   SPLE LF ++P Y  LKVF C C+P LRP+N HKL FRS
Subjt:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS

Query:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGSKD
         PCTF+GYS   KGYKCLS +G + ISR V+FDEH FPFA +Q++                K  TSS  S  + SLP      S P   LP         
Subjt:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGSKD

Query:  SSSSCPNNSISQPITLPVTNINNQSSQLPVSSV--LSTHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPL
        SS+ C  +S + P   P T+ +N +SQ P SS     TH M+TRSK GIFKPKA+L   +   P +V E L+   WK AM +EY AL++N TWDLVP P 
Subjt:  SSSSCPNNSISQPITLPVTNINNQSSQLPVSSV--LSTHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPL

Query:  N---------------------------------------------------------------------------------------------------
        +                                                                                                   
Subjt:  N---------------------------------------------------------------------------------------------------

Query:  -------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSY
                     ++AP AWFE+L   L  LGF ++K+D SL    + T+  YILVYVDDI ITGN+   +  +IT+L+ +F+LKDLG + YF+GI+V +
Subjt:  -------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSY

Query:  PLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRLKPLLRLKT
          S G+ LSQ+KYI ++L +TKM+    + TPMVS   LS      F +  LYRS V A +   +   +  +  +R  +   A    H   +K +LR   
Subjt:  PLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRLKPLLRLKT

Query:  VPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL
        V T  HG    + S  H     ++ GF D DWASD DDR STSG+C+F G NL+ W S+KQ  + +S+TEAEYR +A    E+
Subjt:  VPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL

RVX14937.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.2e-14637.96Show/hide
Query:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH
        FS S TTY+ PL+LI +DLWGP+  LS  G+RYY  FVDAFSR++WI+ L++KSEA + FV FKT VE QF   + SLQTD G EF+ F  +L  +GI H
Subjt:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH

Query:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS
        RVSCP+T QQNG+ ERKHR IV+ GLTLL   S+PL FWD++F T V+L NRLP+ VL    P+E LF+  PDY  LKVF C+CFP LRPYN+HKL +RS
Subjt:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS

Query:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDL-----PHNDT
          CTF+GYS  HKGYKC+S++GRVYISR V+F+E  FP++     S   P+ V  +   +S   +    S   +  P    S + P +++      H   
Subjt:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDL-----PHNDT

Query:  NGSKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVL-----STHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNET
          S D++ + P   +S P+  PV ++ +  +   V+  +     +THPM+TR+K GI KPK  +      EP +V   L+  +WK AM  EYDAL +N T
Subjt:  NGSKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVL-----STHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNET

Query:  WDLVPTP------------------------------------------------------------------------------LN-------------
        W LVP P                                                                              LN             
Subjt:  WDLVPTP------------------------------------------------------------------------------LN-------------

Query:  ---------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYY
                             ++AP AWFE+L   L S GF+++K+D SL  R + ++  Y+LVYVDDI + G+    I+SLI +L+ +FSLKDLG ++Y
Subjt:  ---------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYY

Query:  FLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRL
        FLGI+VS+  + GL LSQ+KYI D+L +TKMV      TP+ +G  L A  G+   D+H YRS V A +   +   E  F  ++  +        H   +
Subjt:  FLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRL

Query:  KPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL---------
        K +LR          GT  +G  L KSS   L GF D DWASD DDR+STSG CVF G NLI W SKKQ  +SRS+TEAEYR LA    E+         
Subjt:  KPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL---------

Query:  ---------------------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVREPSSIGLRG
                                                           V+H+P++DQ+ DVLTK +S   F++ R KL +   S++ LRG
Subjt:  ---------------------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVREPSSIGLRG

TrEMBL top hitse value%identityAlignment
A0A151RUP0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-14237.03Show/hide
Query:  STTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINHRVS
        S T Y+TP  L+ +DLWGP+   ST G++YY +FVDA +R+TWIY L+SK++ F  F +F  +V+ Q+  P+ +LQTD G E++ F  +L+  GI HR+ 
Subjt:  STTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINHRVS

Query:  CPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRSSPC
        CP+T  QNG+ ERKHRHIV++GLTL++ + +P+ FWD +F T+V+LINRLPS  +    P  KLF + PDY SL++F C+CFP LRPYN HKL FRS  C
Subjt:  CPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRSSPC

Query:  TFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGSKDSSS
         F+GYS  HKGYKCL+ DGR+YIS+ V+F+E  FP+  + + S AS + +  ++P+   P  +   +    S      SPS        N    +  S+ 
Subjt:  TFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGSKDSSS

Query:  SCPNNSISQPITLPVTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFV--DIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTP----
        S P +S  +P + P+ + +N             HPM TR+K GI KP+   PT +   +EP   K+ L    W  AMQ EY+AL+ N TW LVP P    
Subjt:  SCPNNSISQPITLPVTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFV--DIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTP----

Query:  --------------------------------------------------------------------------LN------------------------
                                                                                  LN                        
Subjt:  --------------------------------------------------------------------------LN------------------------

Query:  ---------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSG
                 ++AP AWF++L   L  L F  SK DPSL     G    YILVYVDDI ITGN+ + + +L+++L   FSLKDLG L +FLGI+V     G
Subjt:  ---------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSG

Query:  GLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRPRHSPATWPRHH-TRLKPLLRLKTVPTN
         L L+QSKYI D+L+RT M G+  I++PM+SG  LS    E F D  LYRSVV A +   +   E  F  ++     +    HH   +K +LR       
Subjt:  GLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRPRHSPATWPRHH-TRLKPLLRLKTVPTN

Query:  QHGGTYNYGSMLH---KSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL------------------
           GT ++G  L     SS  S+H + D DWASDPDDR+STSG  +F G NL+ W SKKQ++++RS+TEAEYR LALA TE+                  
Subjt:  QHGGTYNYGSMLH---KSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL------------------

Query:  -----------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVRE
                                                 V H+PA DQ  DVLTK LS   F +LRSKL V E
Subjt:  -----------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVRE

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-14738.2Show/hide
Query:  SSSSGSVCVRPVQARQPSCHALFSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISL
        S+ SGS      Q  + S +  F  S T Y+ PLQL+V+DLWGP+   S++GF YY SFVDA+SRYTW+YFL++KS+  +AF+ FK   E QFG  L + 
Subjt:  SSSSGSVCVRPVQARQPSCHALFSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISL

Query:  QTDEGAEFKPFIPFLHNHGINHRVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLK
        QTD G EF+    +   +GI HR+SCP+TS+QNGIIERKHRHIV++GLTLL+ +S+PL +W DAFST+VFLINRLP+ VL    P E LF  +P+Y  LK
Subjt:  QTDEGAEFKPFIPFLHNHGINHRVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLK

Query:  VFDCNCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFAS--------VQNKSFASPAF-VIQNL------PIISKP
        VF C CFP LRPYN HKL FRSSPCTF+GYS  HKGYKCL+  GR++ISR V+FDE  FPFA         V + +   P   +++NL      P +S P
Subjt:  VFDCNCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFAS--------VQNKSFASPAF-VIQNL------PIISKP

Query:  CTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGS---KDSSSSCPNNS--ISQPITLPVTNINNQSSQ----LPVSSVLSTHPMVTRSKRGIFKPKAHL
         TSS +S   +    L +   S   DL + D++ +    + S+S P++S   + P T+P++  +++ ++     PV+     H MVTRSK GIFKPK + 
Subjt:  CTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGS---KDSSSSCPNNS--ISQPITLPVTNINNQSSQ----LPVSSVLSTHPMVTRSKRGIFKPKAHL

Query:  PTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPLN-------------------------------------------------------
              EP   +E +   +WK AM EE+ AL+KN+TW LV  P N                                                       
Subjt:  PTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPLN-------------------------------------------------------

Query:  -----------------------------------------------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYI
                                                                   ++AP AWF++L + L   GF ++K+D SL  R +     ++
Subjt:  -----------------------------------------------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYI

Query:  LVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYR
        LVYVDDI +TG+S  +I  LI+ L G FSLKDLG L YFLGI+V     GGL LSQ KYI D+L +TKM GA  + TPM+SG  LSA  G+   +V  YR
Subjt:  LVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYR

Query:  SVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLI
        SVV A +   +   E  F  ++  +        H   +K +LR          GT + G +L  S   +L GF D DW SD DDR+STSG CVF G +L+
Subjt:  SVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLI

Query:  MWGSKKQSIISRSNTEAEYRCLALAATEL------------------------------------------------------------VQHLPASDQIV
         W SKKQ   SRS+TEAEYR LA   +E+                                                            V H+P  DQ+ 
Subjt:  MWGSKKQSIISRSNTEAEYRCLALAATEL------------------------------------------------------------VQHLPASDQIV

Query:  DVLTKPLSVVSFLKLRSKLNVREPSSIGLRGG
        DV TKPLS   F KLR KL V   +S+ L+ G
Subjt:  DVLTKPLSVVSFLKLRSKLNVREPSSIGLRGG

A0A438J431 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-14340.23Show/hide
Query:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH
        F  ST++Y+ PL+LI TDLWGP+   S+HG +YY  F+DA+SR+TWIY L+ KSEAFQ F+ FK+ VE Q G  + ++Q+D G E++ F  +L ++GI H
Subjt:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH

Query:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS
        R+SCPYT +QNG+ ERKHRHIV+ G+ LL+ +S+P  +WD+AF TSV LINRLP+ VL   SPLE LF ++P Y  LKVF C C+P LRP+N HKL FRS
Subjt:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS

Query:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGSKD
         PCTF+GYS   KGYKCLS +G + ISR V+FDEH FPFA +Q++                K  TSS  S  + SLP      S P   LP         
Subjt:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGSKD

Query:  SSSSCPNNSISQPITLPVTNINNQSSQLPVSSV--LSTHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPL
        SS+ C  +S + P   P T+ +N +SQ P SS     TH M+TRSK GIFKPKA+L   +   P +V E L+   WK AM +EY AL++N TWDLVP P 
Subjt:  SSSSCPNNSISQPITLPVTNINNQSSQLPVSSV--LSTHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPL

Query:  N---------------------------------------------------------------------------------------------------
        +                                                                                                   
Subjt:  N---------------------------------------------------------------------------------------------------

Query:  -------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSY
                     ++AP AWFE+L   L  LGF ++K+D SL    + T+  YILVYVDDI ITGN+   +  +IT+L+ +F+LKDLG + YF+GI+V +
Subjt:  -------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSY

Query:  PLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRLKPLLRLKT
          S G+ LSQ+KYI ++L +TKM+    + TPMVS   LS      F +  LYRS V A +   +   +  +  +R  +   A    H   +K +LR   
Subjt:  PLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRLKPLLRLKT

Query:  VPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL
        V T  HG    + S  H     ++ GF D DWASD DDR STSG+C+F G NL+ W S+KQ  + +S+TEAEYR +A    E+
Subjt:  VPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL

A0A438K147 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-14637.96Show/hide
Query:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH
        FS S TTY+ PL+LI +DLWGP+  LS  G+RYY  FVDAFSR++WI+ L++KSEA + FV FKT VE QF   + SLQTD G EF+ F  +L  +GI H
Subjt:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH

Query:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS
        RVSCP+T QQNG+ ERKHR IV+ GLTLL   S+PL FWD++F T V+L NRLP+ VL    P+E LF+  PDY  LKVF C+CFP LRPYN+HKL +RS
Subjt:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS

Query:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDL-----PHNDT
          CTF+GYS  HKGYKC+S++GRVYISR V+F+E  FP++     S   P+ V  +   +S   +    S   +  P    S + P +++      H   
Subjt:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDL-----PHNDT

Query:  NGSKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVL-----STHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNET
          S D++ + P   +S P+  PV ++ +  +   V+  +     +THPM+TR+K GI KPK  +      EP +V   L+  +WK AM  EYDAL +N T
Subjt:  NGSKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVL-----STHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNET

Query:  WDLVPTP------------------------------------------------------------------------------LN-------------
        W LVP P                                                                              LN             
Subjt:  WDLVPTP------------------------------------------------------------------------------LN-------------

Query:  ---------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYY
                             ++AP AWFE+L   L S GF+++K+D SL  R + ++  Y+LVYVDDI + G+    I+SLI +L+ +FSLKDLG ++Y
Subjt:  ---------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYY

Query:  FLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRL
        FLGI+VS+  + GL LSQ+KYI D+L +TKMV      TP+ +G  L A  G+   D+H YRS V A +   +   E  F  ++  +        H   +
Subjt:  FLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDL---ESEFGTSRP-RHSPATWPRHHTRL

Query:  KPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL---------
        K +LR          GT  +G  L KSS   L GF D DWASD DDR+STSG CVF G NLI W SKKQ  +SRS+TEAEYR LA    E+         
Subjt:  KPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL---------

Query:  ---------------------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVREPSSIGLRG
                                                           V+H+P++DQ+ DVLTK +S   F++ R KL +   S++ LRG
Subjt:  ---------------------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVREPSSIGLRG

A5BFT3 Integrase catalytic domain-containing protein1.1e-14337.11Show/hide
Query:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH
        FS S TTY+ PL+LI  DLWGP+  LS  G+RYY  FVDAFSR++WI+ L++KSEA + FV FKT VE QF   + SLQTD G EF+ F  +L  +GI H
Subjt:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH

Query:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS
        RVSCP+T QQNG+ ERKHR IV+ GLTLL  +S+PL FWD++F T V+L NRLP+ +L    P+E LF+  PDY  LKVF C+CFP LRPYN+HKL +RS
Subjt:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS

Query:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDL-----PHNDT
          CTF+GYS  HKGYKC+S++GRVYIS  V+F+E  FP++     S    + V  +   +S   +    S   +  P    S + P +++      H   
Subjt:  SPCTFIGYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDL-----PHNDT

Query:  NGSKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVL-----STHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNET
          S D++ + P   +S P+  PV ++ +  +   V+  +     +THPM+TR+K GI KPK  +      EP +V   L+  +WK AM  EYDAL +N T
Subjt:  NGSKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVL-----STHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNET

Query:  WDLVPTP------------------------------------------------------------------------------LN-------------
        W LVP P                                                                              LN             
Subjt:  WDLVPTP------------------------------------------------------------------------------LN-------------

Query:  ---------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYY
                             ++AP AWFE+L   L S GF+++K+D SL  R +  +  Y+LVYVDDI + G+  A I+SLI +L+ +FSLKDLG ++Y
Subjt:  ---------------------QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYY

Query:  FLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDLESEFGTSRPRHSPATWPRHHTRLKPLL
        FLGI+VS+  + GL LSQ+KYI D+L +TKMV      TP+ +G  L    G+   D+H YRS V A               ++   T P     +  + 
Subjt:  FLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVVVAPRAQDLESEFGTSRPRHSPATWPRHHTRLKPLL

Query:  RLKTVPTNQH-----------GGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL--
        +    PT +H            GT  +G  L KSS   L GF D DWASD DDR+STSG CVF G NLI W SKKQ I+SRS+ E EYR LA    E+  
Subjt:  RLKTVPTNQH-----------GGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL--

Query:  ----------------------------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVREPSSIGLRG
                                                                  V+H+P++DQ+ DVLTK +S   F++ R KL +   S++ LRG
Subjt:  ----------------------------------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVREPSSIGLRG

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.1e-3121.55Show/hide
Query:  CVRPVQARQPSCHALFSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAE
        C+   QAR P          T    PL ++ +D+ GP   ++     Y+  FVD F+ Y   Y ++ KS+ F  F  F    E  F   ++ L  D G E
Subjt:  CVRPVQARQPSCHALFSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAE

Query:  F--KPFIPFLHNHGINHRVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVL--GGLSPLEKLFQKQPDYFSLKVFD
        +       F    GI++ ++ P+T Q NG+ ER  R I +   T++S + +  +FW +A  T+ +LINR+PS  L     +P E    K+P    L+VF 
Subjt:  F--KPFIPFLHNHGINHRVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVL--GGLSPLEKLFQKQPDYFSLKVFD

Query:  CNCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCL-STDGRVYISRHVLFDE------HVFPFASV--------QNKSFA--SPAFVIQNLPIISKPC
           +  ++     K   +S    F+GY     G+K   + + +  ++R V+ DE          F +V        +NK+F   S   +    P  SK C
Subjt:  CNCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCL-STDGRVYISRHVLFDE------HVFPFASV--------QNKSFA--SPAFVIQNLPIISKPC

Query:  TS-----SCESVGNISLPVLNNSPSSPDTDLPH-----------------------------NDTNGSKDSSSSCPNNSISQPIT--LPVTNINNQSSQL
         +       +   N + P  N+S     T+ P+                              D + ++   S  PN S        L    I+N +   
Subjt:  TS-----SCESVGNISLPVLNNSPSSPDTDLPH-----------------------------NDTNGSKDSSSSCPNNSISQPIT--LPVTNINNQSSQL

Query:  PV------SSVLSTHPMVTRSKRGIFKPKAHL---PTFVDIEPPNVKETLKC----SQWKNAMQEEYDALIKNETWDLVPTPLNQK--------------
         +      S  L T P ++ ++      K  L     F D+  PN  + ++     S W+ A+  E +A   N TW +   P N+               
Subjt:  PV------SSVLSTHPMVTRSKRGIFKPKAHL---PTFVDIEPPNVKETLKC----SQWKNAMQEEYDALIKNETWDLVPTPLNQK--------------

Query:  ----------------------------APCA--------------------------------------------------------------------
                                    AP A                                                                    
Subjt:  ----------------------------APCA--------------------------------------------------------------------

Query:  WFERLSLLLNSLGFINSKADPSLLFRCSGTY--CCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKYIMDV
        WFE     L    F+NS  D  +     G      Y+L+YVDD+ I    +  +++    L  KF + DL  + +F+GI++       ++LSQS Y+  +
Subjt:  WFERLSLLLNSLGFINSKADPSLLFRCSGTY--CCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKYIMDV

Query:  LHRTKMVGANLIATPM---VSGPLLSAHQG-----ELFHDVHLYRSVVVAPRAQDLESEFGTSRPRHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNYG
        L +  M   N ++TP+   ++  LL++ +             +Y  +   P      +       +++   W      LK +LR          GT +  
Subjt:  LHRTKMVGANLIATPM---VSGPLLSAHQG-----ELFHDVHLYRSVVVAPRAQDLESEFGTSRPRHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNYG

Query:  SMLHKSSGF--SLHGFADVDWASDPDDRKSTSGFCV-FFGSNLIMWGSKKQSIISRSNTEAEYR-----------------------------------C
         +  K+  F   + G+ D DWA    DRKST+G+    F  NLI W +K+Q+ ++ S+TEAEY                                    C
Subjt:  SMLHKSSGF--SLHGFADVDWASDPDDRKSTSGFCV-FFGSNLIMWGSKKQSIISRSNTEAEYR-----------------------------------C

Query:  LALA------------------ATELVQ-------HLPASDQIVDVLTKPLSVVSFLKLRSKLNV
        +++A                  A E VQ       ++P  +Q+ D+ TKPL    F++LR KL +
Subjt:  LALA------------------ATELVQ-------HLPASDQIVDVLTKPLSVVSFLKLRSKLNV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-4926.12Show/hide
Query:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEF--KPFIPFLHNHGI
        F  S+      L L+ +D+ GP    S  G +Y+ +F+D  SR  W+Y L++K + FQ F KF  LVE++ G+ L  L++D G E+  + F  +  +HGI
Subjt:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEF--KPFIPFLHNHGI

Query:  NHRVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSF
         H  + P T Q NG+ ER +R IV+   ++L  + +P +FW +A  T+ +LINR PS+ L    P      K+  Y  LKVF C  F  +      KL  
Subjt:  NHRVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSF

Query:  RSSPCTFIGYSHIHKGYKCLS-TDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNG
        +S PC FIGY     GY+       +V  SR V+F E     A+  ++   +   +I N                 +++P  +N+P+S ++        G
Subjt:  RSSPCTFIGYSHIHKGYKCLS-TDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNG

Query:  SKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETL---KCSQWKNAMQEEYDALIKNETWDLV
         +          + + +         +    P+    S  P V  S+R  +    ++    D EP ++KE L   + +Q   AMQEE ++L KN T+ LV
Subjt:  SKDSSSSCPNNSISQPITLPVTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETL---KCSQWKNAMQEEYDALIKNETWDLV

Query:  PTP-------------------------------------------------------------------------------------------------
          P                                                                                                 
Subjt:  PTP-------------------------------------------------------------------------------------------------

Query:  ---------LN------QKAPCAWFERLSLLLNSLGFINSKADPSLLF-RCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLG
                 LN      ++AP  W+ +    + S  ++ + +DP + F R S      +L+YVDD+ I G     I+ L  +L   F +KDLG     LG
Subjt:  ---------LN------QKAPCAWFERLSLLLNSLGFINSKADPSLLF-RCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLG

Query:  IK-VSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSG--------PLLSAHQGELFHDVH-------LYRSVVVAPRAQDLESEFG-TSRPRHSP
        +K V    S  L+LSQ KYI  VL R  M  A  ++TP+           P     +G +    +       +Y  V   P   D+    G  SR   +P
Subjt:  IK-VSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSG--------PLLSAHQGELFHDVH-------LYRSVVVAPRAQDLESEFG-TSRPRHSP

Query:  ATWPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATE
             H   +K +LR     T   G    +G      S   L G+ D D A D D+RKS++G+   F    I W SK Q  ++ S TEAEY    +AATE
Subjt:  ATWPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATE

P92519 Uncharacterized mitochondrial protein AtMg008101.1e-3139.66Show/hide
Query:  YILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKV-SYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVH
        Y+L+YVDDI +TG+S   ++ LI +L   FS+KDLG ++YFLGI++ ++P   GLFLSQ+KY   +L+   M+    ++TP+    L S+     + D  
Subjt:  YILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKV-SYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVH

Query:  LYRSVVVAPRAQDLESEFGTSRPRHSPAT----WPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGS
         +RS+V A +   L      +RP  S A        H   L     LK V      GT  +G  +HK+S  ++  F D DWA     R+ST+GFC F G 
Subjt:  LYRSVVVAPRAQDLESEFGTSRPRHSPAT----WPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGS

Query:  NLIMWGSKKQSIISRSNTEAEYRCLALAATEL
        N+I W +K+Q  +SRS+TE EYR LAL A EL
Subjt:  NLIMWGSKKQSIISRSNTEAEYRCLALAATEL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-10731.58Show/hide
Query:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH
        FS ST   + PL+ I +D+W  S  LS   +RYY  FVD F+RYTW+Y L+ KS+  + F+ FK L+E +F   + +  +D G EF     +   HGI+H
Subjt:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH

Query:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS
          S P+T + NG+ ERKHRHIV+ GLTLLSH+S+P T+W  AF+ +V+LINRLP+ +L   SP +KLF   P+Y  L+VF C C+P LRPYN HKL  +S
Subjt:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS

Query:  SPCTFIGYSHIHKGYKCLSTD-GRVYISRHVLFDEHVFPFASV------------QNKSFASPAFVI-QNLPII------------------SKPCTSSC
          C F+GYS     Y CL     R+YISRHV FDE+ FPF++             ++    SP   +    P++                  S P  +S 
Subjt:  SPCTFIGYSHIHKGYKCLSTD-GRVYISRHVLFDEHVFPFASV------------QNKSFASPAFVI-QNLPII------------------SKPCTSSC

Query:  ESVGNISLPVLNNSPSSPDTDLP--------------------------HNDTNGS-------------KDSSSSCPNNSISQPITL------------P
         S  N+     ++ PSSP+   P                          +N TN S               SSS  P  S S   T             P
Subjt:  ESVGNISLPVLNNSPSSPDTDLP--------------------------HNDTNGS-------------KDSSSSCPNNSISQPITL------------P

Query:  VTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFVDI----EPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTP----------------
        +  I N ++Q P    L+TH M TR+K GI KP       V +    EP    + LK  +W+NAM  E +A I N TWDLVP P                
Subjt:  VTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFVDI----EPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTP----------------

Query:  ------LN-------------------------------------------------------------------------------------------Q
              LN                                                                                           +
Subjt:  ------LN-------------------------------------------------------------------------------------------Q

Query:  KAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKYI
        +AP AW+  L   L ++GF+NS +D SL     G    Y+LVYVDDI ITGN    + + +  L  +FS+KD   L+YFLGI+    +  GL LSQ +YI
Subjt:  KAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKYI

Query:  MDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVV-----VAPRAQDLESEFG-TSRPRHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNY
        +D+L RT M+ A  + TPM   P LS + G    D   YR +V     +A    D+       S+  H P     H   LK +LR          GT N+
Subjt:  MDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVV-----VAPRAQDLESEFG-TSRPRHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNY

Query:  GSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL-----------------------------
        G  L K +  SLH ++D DWA D DD  ST+G+ V+ G + I W SKKQ  + RS+TEAEYR +A  ++E+                             
Subjt:  GSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL-----------------------------

Query:  -------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNV-REPSS
                                       V H+   DQ+ D LTKPLS  +F    SK+ V R P S
Subjt:  -------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNV-REPSS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.3e-10932.21Show/hide
Query:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH
        FS ST T S PL+ I +D+W  S  LS   +RYY  FVD F+RYTW+Y L+ KS+    F+ FK+LVE +F   + +L +D G EF     +L  HGI+H
Subjt:  FSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNHGINH

Query:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS
          S P+T + NG+ ERKHRHIV++GLTLLSH+S+P T+W  AFS +V+LINRLP+ +L   SP +KLF + P+Y  LKVF C C+P LRPYN HKL  +S
Subjt:  RVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRS

Query:  SPCTFIGYSHIHKGYKCLS-TDGRVYISRHVLFDEHVFPFA---------------------------------------------SVQNKSFASPAFVI
          C F+GYS     Y CL    GR+Y SRHV FDE  FPF+                                             S +  S  SP    
Subjt:  SPCTFIGYSHIHKGYKCLS-TDGRVYISRHVLFDEHVFPFA---------------------------------------------SVQNKSFASPAFVI

Query:  Q----NLPIISKPCTSSCESV------------------GNISLPVLNN------SPSSPDTD---------LPHNDTNGSKDSSSSCPNNSISQ----P
        Q    NLP  S    SS E                     N + P+LNN      SP+SP+ +          PH  T  +  S  + P++S +     P
Subjt:  Q----NLPIISKPCTSSCESV------------------GNISLPVLNN------SPSSPDTD---------LPHNDTNGSKDSSSSCPNNSISQ----P

Query:  ITLPVTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFVDI----EPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTP------------
          LP   I   ++Q PV    +TH M TR+K GI KP         +    EP    + +K  +W+ AM  E +A I N TWDLVP P            
Subjt:  ITLPVTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFVDI----EPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTP------------

Query:  ----------LN----------------------------------------------------------------------------------------
                  LN                                                                                        
Subjt:  ----------LN----------------------------------------------------------------------------------------

Query:  ---QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQ
           ++AP AW+  L   L ++GF+NS +D SL     G    Y+LVYVDDI ITGN    +   +  L  +FS+K+   L+YFLGI+    +  GL LSQ
Subjt:  ---QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQ

Query:  SKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVV-----VAPRAQDLESEFG-TSRPRHSPATWPRHHTRLKPLLRLKTVPTNQHGG
         +Y +D+L RT M+ A  +ATPM + P L+ H G    D   YR +V     +A    DL       S+  H P     H   LK +LR          G
Subjt:  SKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVV-----VAPRAQDLESEFG-TSRPRHSPATWPRHHTRLKPLLRLKTVPTNQHGG

Query:  TYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL-------------------------
        T ++G  L K +  SLH ++D DWA D DD  ST+G+ V+ G + I W SKKQ  + RS+TEAEYR +A  ++EL                         
Subjt:  TYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATEL-------------------------

Query:  -----------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNV-REPSSIG
                                           V H+   DQ+ D LTKPLS V+F     K+ V + P S G
Subjt:  -----------------------------------VQHLPASDQIVDVLTKPLSVVSFLKLRSKLNV-REPSSIG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-3432.47Show/hide
Query:  QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKY
        ++A   WF + S+ L   GF+ S +D +   + + T    +LVYVDDI I  N+ A +  L ++L   F L+DLG L YFLG++++   + G+ + Q KY
Subjt:  QKAPCAWFERLSLLLNSLGFINSKADPSLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKY

Query:  IMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVV---VAPRAQDLESEFGTSR-PRHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNYG
         +D+L  T ++G    + PM      SAH G  F D   YR ++   +  +   L+  F  ++  + S A    H   +  +L       +   GT   G
Subjt:  IMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVHLYRSVV---VAPRAQDLESEFGTSR-PRHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNYG

Query:  SMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATELV
              +   L  F+D  + S  D R+ST+G+C+F G++LI W SKKQ ++S+S+ EAEYR L+ A  E++
Subjt:  SMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSIISRSNTEAEYRCLALAATELV

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.0e-0440.85Show/hide
Query:  GTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMW--GSKKQSIIS-----RSNTEA
        GT   G     +S   L  FAD DWAS PD R+S +GFC    S + +W  G+ ++SI+S     R N EA
Subjt:  GTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMW--GSKKQSIIS-----RSNTEA

ATMG00810.1 DNA/RNA polymerases superfamily protein7.6e-3339.66Show/hide
Query:  YILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKV-SYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVH
        Y+L+YVDDI +TG+S   ++ LI +L   FS+KDLG ++YFLGI++ ++P   GLFLSQ+KY   +L+   M+    ++TP+    L S+     + D  
Subjt:  YILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKV-SYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDVH

Query:  LYRSVVVAPRAQDLESEFGTSRPRHSPAT----WPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGS
         +RS+V A +   L      +RP  S A        H   L     LK V      GT  +G  +HK+S  ++  F D DWA     R+ST+GFC F G 
Subjt:  LYRSVVVAPRAQDLESEFGTSRPRHSPAT----WPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGS

Query:  NLIMWGSKKQSIISRSNTEAEYRCLALAATEL
        N+I W +K+Q  +SRS+TE EYR LAL A EL
Subjt:  NLIMWGSKKQSIISRSNTEAEYRCLALAATEL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.7e-0948.1Show/hide
Query:  MVTRSKRGIFK--PKAHL--PTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPLNQK-APCAWFERLSL
        M+TRSK GI K  PK  L   T +  EP +V   LK   W  AMQEE DAL +N+TW LVP P+NQ    C W  +  L
Subjt:  MVTRSKRGIFK--PKAHL--PTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPLNQK-APCAWFERLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCCTACCCTCTTTATGGCACGAGAGGGATTTCTGTTTGCTGGTTGGACCACAAACAGGTTGTTCATTAGAGGAGCACTGGTACTTAAGGACCAAGAGGTAGCCCA
GGGAAATATATCTGCAGTGAGAAGAGTGCAGCTATGGTTCTATAGTGGAGTGAACCACAGTCCATTAGGTCCCACCGGTAGCTCTATAAGGGCGTTGAGTGGAGAACCAA
GGAATTTTGGCGAAGCAATTCAAGAAATCTTCAAGGGTAGGGCTGACGTCAGCGAGGCGCGGTTCAGGCGCAAGGAGGAGGCGCGGTCCAGCTCGTCCGGTTCGGTCTGC
GTCAGGCCGGTTCAGGCGCGGCAACCATCATGCCATGCCCTTTTCTCTCCATCCACTACTACTTACTCTACACCTTTACAACTCATTGTAACAGATTTATGGGGACCTTC
TTACAAGCTTTCCACACATGGTTTTAGATATTATAATAGTTTTGTTGATGCTTTTTCTCGATACACATGGATATATTTCCTTCAATCTAAGTCTGAAGCATTTCAAGCTT
TTGTTAAATTCAAAACTCTTGTTGAAAAACAGTTTGGGAAGCCTCTTATTTCTCTTCAAACCGATGAGGGTGCGGAGTTTAAGCCTTTCATTCCTTTTCTACATAATCAT
GGCATTAATCATCGTGTCTCATGTCCATATACATCTCAACAGAATGGCATAATTGAACGAAAGCATAGACATATTGTTGATGTTGGTCTTACCTTGTTGTCTCATTCCTC
TATGCCTCTAACATTTTGGGATGATGCCTTTTCTACAAGTGTCTTTCTTATTAACAGGTTACCTTCTATGGTCCTTGGTGGTTTGAGTCCCTTGGAGAAGCTCTTCCAGA
AGCAACCAGATTATTTCTCACTTAAGGTATTCGATTGTAACTGCTTTCCCTGTCTTCGCCCTTATAACTCTCACAAACTCAGTTTTCGATCAAGTCCATGTACATTCATT
GGGTATAGTCATATTCACAAGGGTTATAAGTGTTTGTCTACTGATGGTCGTGTGTATATATCTCGACATGTGTTGTTTGATGAGCATGTGTTTCCTTTTGCTTCTGTTCA
AAATAAATCTTTTGCTTCTCCTGCATTTGTCATTCAAAATTTACCTATTATATCTAAACCATGTACATCTTCTTGTGAATCTGTTGGCAACATCTCTCTGCCTGTTCTTA
ATAATAGTCCTTCTTCGCCTGACACTGATTTACCTCATAATGATACTAATGGTTCTAAAGACTCTAGTTCATCTTGCCCGAACAATTCCATTTCTCAACCTATTACTTTA
CCTGTCACAAATATCAATAATCAATCTAGCCAGTTACCTGTGTCTAGTGTCCTAAGTACTCATCCCATGGTTACTCGAAGTAAAAGGGGCATATTCAAACCTAAGGCTCA
CTTACCTACATTTGTTGACATAGAACCCCCCAATGTTAAAGAGACCCTTAAATGTTCTCAGTGGAAGAATGCTATGCAAGAAGAATATGATGCTCTAATAAAAAATGAAA
CTTGGGATCTTGTTCCTACACCTTTAAATCAAAAGGCCCCGTGCGCTTGGTTTGAACGACTGAGTTTGCTTCTTAACTCTCTTGGTTTTATAAACTCTAAGGCTGATCCT
TCTTTATTGTTTCGATGCTCGGGTACTTACTGTTGCTACATTCTTGTTTATGTTGATGACATAAATATCACAGGGAATTCTTTGGCTGATATTTCCTCTCTTATTACAGA
GTTGGATGGGAAGTTCTCTCTCAAAGATCTTGGTTCTCTTTACTATTTTTTGGGCATCAAGGTATCTTACCCCCTTTCTGGTGGCTTATTTTTGTCTCAAAGCAAATATA
TTATGGATGTTTTACATAGAACTAAGATGGTTGGTGCAAATCTCATCGCTACTCCTATGGTTAGTGGCCCTTTACTATCAGCACATCAAGGTGAATTGTTTCATGATGTT
CACTTATATAGGAGTGTCGTTGTAGCGCCCCGGGCCCAGGATTTGGAATCTGAATTCGGCACCTCACGGCCCCGACATTCCCCTGCGACCTGGCCACGTCACCATACTCG
TCTTAAACCGCTTCTAAGATTGAAGACTGTCCCCACAAACCAACACGGGGGTACGTACAATTATGGGAGTATGTTGCATAAGTCATCTGGTTTCTCTCTTCATGGCTTTG
CTGATGTCGATTGGGCATCTGATCCTGATGATCGTAAGTCCACCTCTGGCTTTTGTGTTTTCTTTGGAAGCAATTTAATTATGTGGGGCTCTAAGAAACAATCAATTATA
TCTCGGTCCAACACTGAAGCTGAATATCGTTGCTTAGCTCTTGCTGCAACTGAGTTGGTCCAACATTTACCAGCTTCTGATCAGATTGTTGATGTTCTTACCAAGCCTCT
GTCCGTTGTTTCTTTTCTCAAGCTTCGGTCCAAGCTCAATGTTCGAGAGCCCTCTTCCATTGGCTTGAGGGGGGGGGGTGTTAACATTAAGGCAGCCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCCTACCCTCTTTATGGCACGAGAGGGATTTCTGTTTGCTGGTTGGACCACAAACAGGTTGTTCATTAGAGGAGCACTGGTACTTAAGGACCAAGAGGTAGCCCA
GGGAAATATATCTGCAGTGAGAAGAGTGCAGCTATGGTTCTATAGTGGAGTGAACCACAGTCCATTAGGTCCCACCGGTAGCTCTATAAGGGCGTTGAGTGGAGAACCAA
GGAATTTTGGCGAAGCAATTCAAGAAATCTTCAAGGGTAGGGCTGACGTCAGCGAGGCGCGGTTCAGGCGCAAGGAGGAGGCGCGGTCCAGCTCGTCCGGTTCGGTCTGC
GTCAGGCCGGTTCAGGCGCGGCAACCATCATGCCATGCCCTTTTCTCTCCATCCACTACTACTTACTCTACACCTTTACAACTCATTGTAACAGATTTATGGGGACCTTC
TTACAAGCTTTCCACACATGGTTTTAGATATTATAATAGTTTTGTTGATGCTTTTTCTCGATACACATGGATATATTTCCTTCAATCTAAGTCTGAAGCATTTCAAGCTT
TTGTTAAATTCAAAACTCTTGTTGAAAAACAGTTTGGGAAGCCTCTTATTTCTCTTCAAACCGATGAGGGTGCGGAGTTTAAGCCTTTCATTCCTTTTCTACATAATCAT
GGCATTAATCATCGTGTCTCATGTCCATATACATCTCAACAGAATGGCATAATTGAACGAAAGCATAGACATATTGTTGATGTTGGTCTTACCTTGTTGTCTCATTCCTC
TATGCCTCTAACATTTTGGGATGATGCCTTTTCTACAAGTGTCTTTCTTATTAACAGGTTACCTTCTATGGTCCTTGGTGGTTTGAGTCCCTTGGAGAAGCTCTTCCAGA
AGCAACCAGATTATTTCTCACTTAAGGTATTCGATTGTAACTGCTTTCCCTGTCTTCGCCCTTATAACTCTCACAAACTCAGTTTTCGATCAAGTCCATGTACATTCATT
GGGTATAGTCATATTCACAAGGGTTATAAGTGTTTGTCTACTGATGGTCGTGTGTATATATCTCGACATGTGTTGTTTGATGAGCATGTGTTTCCTTTTGCTTCTGTTCA
AAATAAATCTTTTGCTTCTCCTGCATTTGTCATTCAAAATTTACCTATTATATCTAAACCATGTACATCTTCTTGTGAATCTGTTGGCAACATCTCTCTGCCTGTTCTTA
ATAATAGTCCTTCTTCGCCTGACACTGATTTACCTCATAATGATACTAATGGTTCTAAAGACTCTAGTTCATCTTGCCCGAACAATTCCATTTCTCAACCTATTACTTTA
CCTGTCACAAATATCAATAATCAATCTAGCCAGTTACCTGTGTCTAGTGTCCTAAGTACTCATCCCATGGTTACTCGAAGTAAAAGGGGCATATTCAAACCTAAGGCTCA
CTTACCTACATTTGTTGACATAGAACCCCCCAATGTTAAAGAGACCCTTAAATGTTCTCAGTGGAAGAATGCTATGCAAGAAGAATATGATGCTCTAATAAAAAATGAAA
CTTGGGATCTTGTTCCTACACCTTTAAATCAAAAGGCCCCGTGCGCTTGGTTTGAACGACTGAGTTTGCTTCTTAACTCTCTTGGTTTTATAAACTCTAAGGCTGATCCT
TCTTTATTGTTTCGATGCTCGGGTACTTACTGTTGCTACATTCTTGTTTATGTTGATGACATAAATATCACAGGGAATTCTTTGGCTGATATTTCCTCTCTTATTACAGA
GTTGGATGGGAAGTTCTCTCTCAAAGATCTTGGTTCTCTTTACTATTTTTTGGGCATCAAGGTATCTTACCCCCTTTCTGGTGGCTTATTTTTGTCTCAAAGCAAATATA
TTATGGATGTTTTACATAGAACTAAGATGGTTGGTGCAAATCTCATCGCTACTCCTATGGTTAGTGGCCCTTTACTATCAGCACATCAAGGTGAATTGTTTCATGATGTT
CACTTATATAGGAGTGTCGTTGTAGCGCCCCGGGCCCAGGATTTGGAATCTGAATTCGGCACCTCACGGCCCCGACATTCCCCTGCGACCTGGCCACGTCACCATACTCG
TCTTAAACCGCTTCTAAGATTGAAGACTGTCCCCACAAACCAACACGGGGGTACGTACAATTATGGGAGTATGTTGCATAAGTCATCTGGTTTCTCTCTTCATGGCTTTG
CTGATGTCGATTGGGCATCTGATCCTGATGATCGTAAGTCCACCTCTGGCTTTTGTGTTTTCTTTGGAAGCAATTTAATTATGTGGGGCTCTAAGAAACAATCAATTATA
TCTCGGTCCAACACTGAAGCTGAATATCGTTGCTTAGCTCTTGCTGCAACTGAGTTGGTCCAACATTTACCAGCTTCTGATCAGATTGTTGATGTTCTTACCAAGCCTCT
GTCCGTTGTTTCTTTTCTCAAGCTTCGGTCCAAGCTCAATGTTCGAGAGCCCTCTTCCATTGGCTTGAGGGGGGGGGGTGTTAACATTAAGGCAGCCCATTGA
Protein sequenceShow/hide protein sequence
MGPTLFMAREGFLFAGWTTNRLFIRGALVLKDQEVAQGNISAVRRVQLWFYSGVNHSPLGPTGSSIRALSGEPRNFGEAIQEIFKGRADVSEARFRRKEEARSSSSGSVC
VRPVQARQPSCHALFSPSTTTYSTPLQLIVTDLWGPSYKLSTHGFRYYNSFVDAFSRYTWIYFLQSKSEAFQAFVKFKTLVEKQFGKPLISLQTDEGAEFKPFIPFLHNH
GINHRVSCPYTSQQNGIIERKHRHIVDVGLTLLSHSSMPLTFWDDAFSTSVFLINRLPSMVLGGLSPLEKLFQKQPDYFSLKVFDCNCFPCLRPYNSHKLSFRSSPCTFI
GYSHIHKGYKCLSTDGRVYISRHVLFDEHVFPFASVQNKSFASPAFVIQNLPIISKPCTSSCESVGNISLPVLNNSPSSPDTDLPHNDTNGSKDSSSSCPNNSISQPITL
PVTNINNQSSQLPVSSVLSTHPMVTRSKRGIFKPKAHLPTFVDIEPPNVKETLKCSQWKNAMQEEYDALIKNETWDLVPTPLNQKAPCAWFERLSLLLNSLGFINSKADP
SLLFRCSGTYCCYILVYVDDINITGNSLADISSLITELDGKFSLKDLGSLYYFLGIKVSYPLSGGLFLSQSKYIMDVLHRTKMVGANLIATPMVSGPLLSAHQGELFHDV
HLYRSVVVAPRAQDLESEFGTSRPRHSPATWPRHHTRLKPLLRLKTVPTNQHGGTYNYGSMLHKSSGFSLHGFADVDWASDPDDRKSTSGFCVFFGSNLIMWGSKKQSII
SRSNTEAEYRCLALAATELVQHLPASDQIVDVLTKPLSVVSFLKLRSKLNVREPSSIGLRGGGVNIKAAH