; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008934 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008934
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:32708017..32711370
RNA-Seq ExpressionLag0008934
SyntenyLag0008934
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]3.5e-20439.56Show/hide
Query:  AVAASSSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSP
        A AA S+  N + SS        ++VKLD  NY LWK +VL ++RG K+D Y+LGT+  P E I T+++S K    N  + EW   DQ L GW+  SM+ 
Subjt:  AVAASSSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSP

Query:  AIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKT
         IA  +++ +T +++W   + + G  +++ +   +    + +KG MKM DYL  MK   + LKL GNPVS  DL+   L GLDSEY P+V  + D+   +
Subjt:  AIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKT

Query:  WQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREE
        W +L + L+ FE  + + +  TN   +    AT    NR     +  +   SN N    N+ G   G G G+ G NP Q                +    
Subjt:  WQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREE

Query:  ITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYNVL
           C+ RF++ ++  + S     QG      S +A++A+   + D  W  DSGA+NHVT                     N   LA+      K+    L
Subjt:  ITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYNVL

Query:  NNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVS
        ++IL+VP I KNL+S++ L  DNN++VEF  N C VKDK + KV+L  +L++ LYQ+      T ++     S    WH RLGH ++KV+  VL+SC V 
Subjt:  NNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVS

Query:  TFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTL
           +++  FC+ACQ GK H LPF  S SH  +PLELVH D+WGP+PI++  G+KYY+ FVDDF+R   IYPLK K E   +F Q+K L  N+F K+IK +
Subjt:  TFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTL

Query:  QTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------
        Q D GGE++       + GI+FR SCP+TSQQNG  ERKHRHI E GLTLLAQA MPL YWWEAFS+AVY+IN LP+ +  + SP+              
Subjt:  QTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------

Query:  ------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL---------------VPHHSSPVA
              YPCL+ Y  HK Q+H+T+CVFLGYS +HKGYKCL+S GR+FIS HV+FNE  FPF    + +  P    +               +   S P+ 
Subjt:  ------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL---------------VPHHSSPVA

Query:  EFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPT
        E +  + T +   QD      +++ + + ++ P+     +  ++      S G             SH + TR+KSGI K K      +  ++   +EP 
Subjt:  EFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPT

Query:  SVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKN
        + KEAL    WK AM  E   L  NKTW LVP     N++ SKW+FK K K + S +R KARLVA+GF Q  G+D+ ETFSPV+KA T++IIL++AV  N
Subjt:  SVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKN

Query:  WSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        W +RQLD+NN FLNG L+E V+M Q  G++D + P+++CKL KA+YGL+QA RA
Subjt:  WSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]5.5e-20539.81Show/hide
Query:  SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAAD
        SSA N   S   + L ++++VKLD  NY LWK +VL+++RG K+D Y+LGT   P + + +  +S K+   N  + +W+  DQAL GWL  SM+  IA  
Subjt:  SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAAD

Query:  VINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELS
        +++ +T +++W   + + G  +K+ +   +    NT+KG MKM +YL  MK  S+ LKL G+P+S  DL+   L GLD+EY P+V  + D+   +W ++ 
Subjt:  VINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELS

Query:  SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR
        + L+ FE  L +++             + L LN  + F N+ +F      S GN    N  G   G G G+M N   Q   GT  + ++           
Subjt:  SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR

Query:  EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYN
             C  RF+  +   + S     QG      S SA+IA+P    D +W  DSGA NHVT                     N   L +      K++  
Subjt:  EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYN

Query:  VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCN
         L+++L+VP+I KNL+S++ LT DNN++VEF +N C VKDK + + +L   L++ LYQ+      + K      S    WH +LGH ++KV+  VLK CN
Subjt:  VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCN

Query:  VSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK
        V    ++   FC+ACQ GK H LPF  S SH  +PL L+H D+WGP+PI+S  G+KYY+ F+DDF+R   I+PLK K +   +F Q+K L  N+F KKIK
Subjt:  VSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK

Query:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------
         +Q D GGE+++      + GI+FR SCP+TSQQNG  ERKHRH+ E+GLTLLAQA MPL YWWEAFS+AVY+IN LP+ +  + SP+   F        
Subjt:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------

Query:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGP------SNIALVPHHSSPVAEFQTSSP
                YPCL+ Y  HK Q H+T+CVF+GYS +HKGYKC++S GR+F+S HV+FNE+ FPF    + +  P      ++  L+P  S+         P
Subjt:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGP------SNIALVPHHSSPVAEFQTSSP

Query:  TLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTAL-------------NSHPMQTRAKSGIFKQKDWGAFLVNSSFS
          +T       S + S ++       +S   + T++    D  +D S +   + ++ +             N+H M+TR+K GI K K     +  +  S
Subjt:  TLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTAL-------------NSHPMQTRAKSGIFKQKDWGAFLVNSSFS

Query:  PEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILA
         E EP SVKEAL    WK AM+ E   L  N TWTLVP     N+I SKWIFK K KS+ S +R KARLVA+GF Q  G+DF ETFSPVVK+ T++IIL 
Subjt:  PEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILA

Query:  VAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        +AV  NW +RQLD+NN FLNG+L+E V+M Q  GYID + P+++CKL KA+YGL+QA RA
Subjt:  VAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.4e-18940.02Show/hide
Query:  MSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKD
        M+  +A  +++ +T +++W+  + + G  +++ +   +     T+KG +KM +YL  MK+ +++L L G+ VS  DLV+  LAGLD+EY PIV  + DK+
Subjt:  MSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKD

Query:  LKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQR-QFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATI
          TW E+ + L+ +E  L + +  +N       L  + + N  ++  N+R + N   G RG   N G+  G G G+     T+   + ++  +    A  
Subjt:  LKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQR-QFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATI

Query:  FREEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMD
             + CY RF +++   ++    S + +   N + +AY+A+P  + D  W  DSGA+NHVT                     N  NL +       +D
Subjt:  FREEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMD

Query:  YNV----LNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKS
               L +IL+VP+I KNL+SI+ LT DN++ VEFH   C VKDK + +++L   +++ LYQ+   S  T K      S    WH +LGH +SKV+  
Subjt:  YNV----LNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKS

Query:  VLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANR
        V+K CN+     E+  FC+ACQ GK+H LPF  SVS   +PL+LVH D+WGP+PI S+ G+KYY+ F+DD++R   IYPLK K + F +F Q++ LV N+
Subjt:  VLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANR

Query:  FEKKIKTLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-
        F K+IKTLQ D GGEF+S +  L   GI+ R SCP+TS QNG  ERKHRH+VE GLTLLAQA MPL YWWEAFS+AV++IN LPT ++ + SP++Q F  
Subjt:  FEKKIKTLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-

Query:  --------------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSP
                      YPCL+ Y  HK Q H+TKCVFLGYS +HKGYKCL+S+GR+FIS HVVFNE  FPF    + +  P+ I   P  +S +     +  
Subjt:  --------------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSP

Query:  TLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALK
         ++   Q  + +   S+++   H    +       + +  +  ++   E  ++   ++N H M TR+K GI K K      V  +   E EP +  EAL+
Subjt:  TLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALK

Query:  SSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLD
        + +WK AM  E   L  NKTWTLVP     N+I  KW+FK K K++ + +R KARLVA+GF Q  G+D+ ETFSPV+KA T++IIL++AV  NW IRQ+D
Subjt:  SSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLD

Query:  VNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        +NN FLNG L+E V+MRQ  G++D+S P ++CKL KA+YGL+QA R+
Subjt:  VNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]6.3e-20139.79Show/hide
Query:  LSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKAL
        L + ++VKLD  N+ LWK +VL ++RG K D Y+LGTK  P + + +   + K+   N  Y++W   DQAL GWL  SM+  IA  V++ +T +++W   
Subjt:  LSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKAL

Query:  EEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELSSILINFEGTLARYS
        + + G  +++ +   +    NT K  MKM  YLA MK  ++ LKL G+P+S  DL+   L GLDSEY P+V  + D+   +W +  + L+ FE  L + +
Subjt:  EEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELSSILINFEGTLARYS

Query:  TPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHF--NNLHA
           N +    + + + A   +S      +F    G RG+ N+ G   G G  +M   P              +    F      CY RF++ +   N +A
Subjt:  TPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHF--NNLHA

Query:  SGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNL--------------------------AVKMDYNVLNNILHVPEIRKNLISIA
         G G          S SA++A+P    D +W  DSGA+NHVT  +  L                          + K++   L N+L+VPEI KNL+S++
Subjt:  SGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNL--------------------------AVKMDYNVLNNILHVPEIRKNLISIA

Query:  SLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQI----ELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDAC
         LT+DNN +VEF  NYC VKDK + K +L   L++ LYQ+    E P+ + P + I   S   +WH +LGH ++KV++ VLK  NV    ++   FC+AC
Subjt:  SLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQI----ELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDAC

Query:  QKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSFTS
        Q GK H LPF  S SH  +PL+L+H D+WGP+PI+S   +KYY+ F+DDF+R   I+PLK K E   +F+Q+K LV N+F KKIK ++ D GGE++    
Subjt:  QKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSFTS

Query:  FLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLY
           D+GI+F+ SCP+TSQQNG  ERKHRH+ E+GLTLLAQA MPL+YWWEAFS+AVY+IN LP+ +  + SP+   F                YPCL+ Y
Subjt:  FLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLY

Query:  QSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL------VPHHSSPVAEFQTSSPTLSTPPQ--------
          HK Q H+T+CVFLGYS +HKGYKC++S GR+F+S HVVFNE+ FPF+   + +  P  +         P   + +    T+  T +   Q        
Subjt:  QSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL------VPHHSSPVAEFQTSSPTLSTPPQ--------

Query:  ----DQYVSPQLSAHSPEGH----SCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEAL
            DQ V      H+ E +        S       SM  +      +   P Q  T  N+H M+TR+K+G++K K     L   +   + EP SV EAL
Subjt:  ----DQYVSPQLSAHSPEGH----SCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEAL

Query:  KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQL
           +W  AM+ E   L  NKTWTLVP     N+I SKWIFK K K++ + +R KARLVA+GF Q  GVD+ ETFSPVVK+ T++IIL++AV  +W +RQL
Subjt:  KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQL

Query:  DVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        D+NN FLNG L+E+V+M Q  GYIDQ+ P ++C+L+KA+YGL+QA RA
Subjt:  DVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]1.1e-19239.58Show/hide
Query:  SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAAD
        SSA N   S+  + L ++++VKLD  NY LWK +VL ++RG K D Y+LGTK  P + + +  +S K+   N  +++WM  DQAL GWL  SM+  IA  
Subjt:  SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAAD

Query:  VINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELS
        +++ +T +++W   + + G  +K+ +   +    NT+KG MKM +YL  MK  S+ LKL G+P+S  DL+   L GLD+EY P+V  + D+   +W ++ 
Subjt:  VINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELS

Query:  SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR
        + L+ FE  L           D  +  + L LN  + F N+ +F      S GN    N  G   G G G+M N   Q   GT    ++           
Subjt:  SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR

Query:  EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYN
             C  RF+  +   + S     QG      S SA++A+P    D +W  DSGA+NHVT                     N   L +      K++  
Subjt:  EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYN

Query:  VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCN
         L+++L+VP+I KNL+S++ LT DNN+ VEF +N C VKDK + + +L   L++ LYQ+   S Q+ K      S    WH +LGH ++KV++ VLK CN
Subjt:  VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCN

Query:  VSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK
        V    ++   FC+ACQ GK H LPF  S SH  +PL L+H D+WGP+PI+S  G+KYY+ F+DDF+R   I+PLK K +   +F Q+K L  N+F KKIK
Subjt:  VSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK

Query:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------
         +Q D GGE+++      + GI+FR SCP+TSQQNG  ERKHRH+VE+GLTLLAQA MPL YWWEAFS+AVY+IN L + +  + SP+   F        
Subjt:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------

Query:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTPP
                YPCL+ Y  HK Q H+T+CVF+GYS +HKG     + G    S + + N+ +     +   S+  S+     H  +  +   T++ +     
Subjt:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTPP

Query:  QDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKA
         D +V  +   +SP                   + G+S        Q +T  N+H ++TR+K+GI K K     +  +  S E EP SVKEAL    WK 
Subjt:  QDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKA

Query:  AMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFL
        AM+ E   L  N TWTLVP     N+I SKWIFK K KS+ S +R KARLVA+GF Q  G+DFHETFSPVVK+ T++IIL +AV  NW +RQLD+NN FL
Subjt:  AMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFL

Query:  NGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        NG+L+E V+M Q  GYID + P+++CKL KA+YGL+QA RA
Subjt:  NGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

TrEMBL top hitse value%identityAlignment
A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)3.0e-20139.79Show/hide
Query:  LSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKAL
        L + ++VKLD  N+ LWK +VL ++RG K D Y+LGTK  P + + +   + K+   N  Y++W   DQAL GWL  SM+  IA  V++ +T +++W   
Subjt:  LSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKAL

Query:  EEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELSSILINFEGTLARYS
        + + G  +++ +   +    NT K  MKM  YLA MK  ++ LKL G+P+S  DL+   L GLDSEY P+V  + D+   +W +  + L+ FE  L + +
Subjt:  EEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELSSILINFEGTLARYS

Query:  TPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHF--NNLHA
           N +    + + + A   +S      +F    G RG+ N+ G   G G  +M   P              +    F      CY RF++ +   N +A
Subjt:  TPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHF--NNLHA

Query:  SGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNL--------------------------AVKMDYNVLNNILHVPEIRKNLISIA
         G G          S SA++A+P    D +W  DSGA+NHVT  +  L                          + K++   L N+L+VPEI KNL+S++
Subjt:  SGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNL--------------------------AVKMDYNVLNNILHVPEIRKNLISIA

Query:  SLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQI----ELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDAC
         LT+DNN +VEF  NYC VKDK + K +L   L++ LYQ+    E P+ + P + I   S   +WH +LGH ++KV++ VLK  NV    ++   FC+AC
Subjt:  SLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQI----ELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDAC

Query:  QKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSFTS
        Q GK H LPF  S SH  +PL+L+H D+WGP+PI+S   +KYY+ F+DDF+R   I+PLK K E   +F+Q+K LV N+F KKIK ++ D GGE++    
Subjt:  QKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSFTS

Query:  FLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLY
           D+GI+F+ SCP+TSQQNG  ERKHRH+ E+GLTLLAQA MPL+YWWEAFS+AVY+IN LP+ +  + SP+   F                YPCL+ Y
Subjt:  FLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLY

Query:  QSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL------VPHHSSPVAEFQTSSPTLSTPPQ--------
          HK Q H+T+CVFLGYS +HKGYKC++S GR+F+S HVVFNE+ FPF+   + +  P  +         P   + +    T+  T +   Q        
Subjt:  QSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL------VPHHSSPVAEFQTSSPTLSTPPQ--------

Query:  ----DQYVSPQLSAHSPEGH----SCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEAL
            DQ V      H+ E +        S       SM  +      +   P Q  T  N+H M+TR+K+G++K K     L   +   + EP SV EAL
Subjt:  ----DQYVSPQLSAHSPEGH----SCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEAL

Query:  KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQL
           +W  AM+ E   L  NKTWTLVP     N+I SKWIFK K K++ + +R KARLVA+GF Q  GVD+ ETFSPVVK+ T++IIL++AV  +W +RQL
Subjt:  KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQL

Query:  DVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        D+NN FLNG L+E+V+M Q  GYIDQ+ P ++C+L+KA+YGL+QA RA
Subjt:  DVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

A0A2K3NEN7 Copia-like polyprotein (Fragment)5.2e-19339.58Show/hide
Query:  SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAAD
        SSA N   S+  + L ++++VKLD  NY LWK +VL ++RG K D Y+LGTK  P + + +  +S K+   N  +++WM  DQAL GWL  SM+  IA  
Subjt:  SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAAD

Query:  VINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELS
        +++ +T +++W   + + G  +K+ +   +    NT+KG MKM +YL  MK  S+ LKL G+P+S  DL+   L GLD+EY P+V  + D+   +W ++ 
Subjt:  VINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELS

Query:  SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR
        + L+ FE  L           D  +  + L LN  + F N+ +F      S GN    N  G   G G G+M N   Q   GT    ++           
Subjt:  SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR

Query:  EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYN
             C  RF+  +   + S     QG      S SA++A+P    D +W  DSGA+NHVT                     N   L +      K++  
Subjt:  EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYN

Query:  VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCN
         L+++L+VP+I KNL+S++ LT DNN+ VEF +N C VKDK + + +L   L++ LYQ+   S Q+ K      S    WH +LGH ++KV++ VLK CN
Subjt:  VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCN

Query:  VSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK
        V    ++   FC+ACQ GK H LPF  S SH  +PL L+H D+WGP+PI+S  G+KYY+ F+DDF+R   I+PLK K +   +F Q+K L  N+F KKIK
Subjt:  VSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK

Query:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------
         +Q D GGE+++      + GI+FR SCP+TSQQNG  ERKHRH+VE+GLTLLAQA MPL YWWEAFS+AVY+IN L + +  + SP+   F        
Subjt:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------

Query:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTPP
                YPCL+ Y  HK Q H+T+CVF+GYS +HKG     + G    S + + N+ +     +   S+  S+     H  +  +   T++ +     
Subjt:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTPP

Query:  QDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKA
         D +V  +   +SP                   + G+S        Q +T  N+H ++TR+K+GI K K     +  +  S E EP SVKEAL    WK 
Subjt:  QDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKA

Query:  AMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFL
        AM+ E   L  N TWTLVP     N+I SKWIFK K KS+ S +R KARLVA+GF Q  G+DFHETFSPVVK+ T++IIL +AV  NW +RQLD+NN FL
Subjt:  AMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFL

Query:  NGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        NG+L+E V+M Q  GYID + P+++CKL KA+YGL+QA RA
Subjt:  NGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.7e-20439.56Show/hide
Query:  AVAASSSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSP
        A AA S+  N + SS        ++VKLD  NY LWK +VL ++RG K+D Y+LGT+  P E I T+++S K    N  + EW   DQ L GW+  SM+ 
Subjt:  AVAASSSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSP

Query:  AIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKT
         IA  +++ +T +++W   + + G  +++ +   +    + +KG MKM DYL  MK   + LKL GNPVS  DL+   L GLDSEY P+V  + D+   +
Subjt:  AIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKT

Query:  WQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREE
        W +L + L+ FE  + + +  TN   +    AT    NR     +  +   SN N    N+ G   G G G+ G NP Q                +    
Subjt:  WQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREE

Query:  ITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYNVL
           C+ RF++ ++  + S     QG      S +A++A+   + D  W  DSGA+NHVT                     N   LA+      K+    L
Subjt:  ITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYNVL

Query:  NNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVS
        ++IL+VP I KNL+S++ L  DNN++VEF  N C VKDK + KV+L  +L++ LYQ+      T ++     S    WH RLGH ++KV+  VL+SC V 
Subjt:  NNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVS

Query:  TFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTL
           +++  FC+ACQ GK H LPF  S SH  +PLELVH D+WGP+PI++  G+KYY+ FVDDF+R   IYPLK K E   +F Q+K L  N+F K+IK +
Subjt:  TFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTL

Query:  QTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------
        Q D GGE++       + GI+FR SCP+TSQQNG  ERKHRHI E GLTLLAQA MPL YWWEAFS+AVY+IN LP+ +  + SP+              
Subjt:  QTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------

Query:  ------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL---------------VPHHSSPVA
              YPCL+ Y  HK Q+H+T+CVFLGYS +HKGYKCL+S GR+FIS HV+FNE  FPF    + +  P    +               +   S P+ 
Subjt:  ------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL---------------VPHHSSPVA

Query:  EFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPT
        E +  + T +   QD      +++ + + ++ P+     +  ++      S G             SH + TR+KSGI K K      +  ++   +EP 
Subjt:  EFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPT

Query:  SVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKN
        + KEAL    WK AM  E   L  NKTW LVP     N++ SKW+FK K K + S +R KARLVA+GF Q  G+D+ ETFSPV+KA T++IIL++AV  N
Subjt:  SVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKN

Query:  WSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        W +RQLD+NN FLNG L+E V+M Q  G++D + P+++CKL KA+YGL+QA RA
Subjt:  WSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

A0A2Z6P4D5 Integrase catalytic domain-containing protein2.7e-20539.81Show/hide
Query:  SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAAD
        SSA N   S   + L ++++VKLD  NY LWK +VL+++RG K+D Y+LGT   P + + +  +S K+   N  + +W+  DQAL GWL  SM+  IA  
Subjt:  SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAAD

Query:  VINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELS
        +++ +T +++W   + + G  +K+ +   +    NT+KG MKM +YL  MK  S+ LKL G+P+S  DL+   L GLD+EY P+V  + D+   +W ++ 
Subjt:  VINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELS

Query:  SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR
        + L+ FE  L +++             + L LN  + F N+ +F      S GN    N  G   G G G+M N   Q   GT  + ++           
Subjt:  SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR

Query:  EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYN
             C  RF+  +   + S     QG      S SA+IA+P    D +W  DSGA NHVT                     N   L +      K++  
Subjt:  EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMDYN

Query:  VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCN
         L+++L+VP+I KNL+S++ LT DNN++VEF +N C VKDK + + +L   L++ LYQ+      + K      S    WH +LGH ++KV+  VLK CN
Subjt:  VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCN

Query:  VSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK
        V    ++   FC+ACQ GK H LPF  S SH  +PL L+H D+WGP+PI+S  G+KYY+ F+DDF+R   I+PLK K +   +F Q+K L  N+F KKIK
Subjt:  VSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK

Query:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------
         +Q D GGE+++      + GI+FR SCP+TSQQNG  ERKHRH+ E+GLTLLAQA MPL YWWEAFS+AVY+IN LP+ +  + SP+   F        
Subjt:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------

Query:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGP------SNIALVPHHSSPVAEFQTSSP
                YPCL+ Y  HK Q H+T+CVF+GYS +HKGYKC++S GR+F+S HV+FNE+ FPF    + +  P      ++  L+P  S+         P
Subjt:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGP------SNIALVPHHSSPVAEFQTSSP

Query:  TLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTAL-------------NSHPMQTRAKSGIFKQKDWGAFLVNSSFS
          +T       S + S ++       +S   + T++    D  +D S +   + ++ +             N+H M+TR+K GI K K     +  +  S
Subjt:  TLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTAL-------------NSHPMQTRAKSGIFKQKDWGAFLVNSSFS

Query:  PEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILA
         E EP SVKEAL    WK AM+ E   L  N TWTLVP     N+I SKWIFK K KS+ S +R KARLVA+GF Q  G+DF ETFSPVVK+ T++IIL 
Subjt:  PEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILA

Query:  VAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        +AV  NW +RQLD+NN FLNG+L+E V+M Q  GYID + P+++CKL KA+YGL+QA RA
Subjt:  VAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

A0A803PM38 Uncharacterized protein8.8e-20941.55Show/hide
Query:  ATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKML---LSNMLYEEWMTVDQALSGWLFGSMSPAIAA
        A NI+    G  L+    +KLD  N+ LW+ MV AI+RG ++D Y+ GT  +P E + +T   G +      N  +E+W+  DQ L GWL+GSM+  IA 
Subjt:  ATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKML---LSNMLYEEWMTVDQALSGWLFGSMSPAIAA

Query:  DVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQEL
        +V+   +   +W ALEE++G  SKA ++++R  +Q  +KG++ M DYL   +Q ++ L L G P   + LVS VL+GLD EY+P+V  I+ +   TWQ+L
Subjt:  DVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQEL

Query:  SSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIF-REEITT
          +L++ +  + R  +     F      T + +N  +   N+     +N    N+NN G   G  N +  NN ++G            G T   R     
Subjt:  SSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIF-REEITT

Query:  C--YMRFEEHFNNLHASGNGSVQGNNSN-----NASSSAYIATPEILHDPKWLADSGATNHVTANARNLAVKMDYNVLNNILHVPEIRKNLISIASLTVD
        C  Y     H  N  AS + + + N  N     N      +A    L     +   G  +  T +A  L       +L  ILHVP I KNL+SI+ LT D
Subjt:  C--YMRFEEHFNNLHASGNGSVQGNNSN-----NASSSAYIATPEILHDPKWLADSGATNHVTANARNLAVKMDYNVLNNILHVPEIRKNLISIASLTVD

Query:  NNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRS----TSFAGL------------------------WHNRLGHASSKVIKSVL
        NNV VEF S+ C VKDK + +V+L   L++ LYQ + P+  T  S  RS    TSF+GL                        WH RLGH S +V+ +VL
Subjt:  NNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRS----TSFAGL------------------------WHNRLGHASSKVIKSVL

Query:  KSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFE
           NV   +N SL FCDACQ GKSH LPF  +      PLELVH D+WGPSPI+S   ++YYI F+DDF+R   IYPLK K EA ++F Q+KLLV N+F 
Subjt:  KSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFE

Query:  KKIKTLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---
         ++K +QTDWGGE++ F  F  D+GI F+H CPHTS QNG  ERKHRHIVEMGLTLLAQA +P  YWW+AF +AVY+IN LPTP+L   +P+E  F    
Subjt:  KKIKTLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---

Query:  ------------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSN--IALVP--------------
                    +PCLR YQ+HKFQ HSTKCV LGYS  HKGYKCLSS+GRL+IS  V+FNE EFPFKS  + ++ P      LVP              
Subjt:  ------------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSN--IALVP--------------

Query:  --HHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAH---------SPEGHSCPASVMPL--------YTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRA
            SS +   QT      TP   + V P LS           S  G+    S + +          S+  P+D S+         +   +++HPM TRA
Subjt:  --HHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAH---------SPEGHSCPASVMPL--------YTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRA

Query:  KSGIFKQKDW---GAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQV
        K+GIFK K +     ++ NSS     EP S++EAL+   W  AM+ E+  L RN TW LVP LP++++I +KW++K KR ++ SF R KARLVA+GF Q 
Subjt:  KSGIFKQKDW---GAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQV

Query:  PGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        PGVDF ETFSPV+KA T++I+L++AV K W +RQLD+NN FLNG + E +YM+Q  G+ D++ P++VCKL K++YGLRQA RA
Subjt:  PGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-5324.42Show/hide
Query:  DDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSK
        D + Y +WK  + A+L  Q V   V G                  L+ N + + W   ++     +   +S +      +  T R++ + L+ VY   S 
Subjt:  DDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSK

Query:  ACVNQFRGILQNTKKGS-MKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAID--DKDLKTWQELSSILINFEGTLARYSTPTNAH
        A     R  L + K  S M ++ +  I  +    L   G  +   D +S++L  L S Y  I+ AI+   ++  T   + + L++ E  +      T+  
Subjt:  ACVNQFRGILQNTKKGS-MKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAID--DKDLKTWQELSSILINFEGTLARYSTPTNAH

Query:  FDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSG-NGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHFNNLHASGN-GSV
             +  +    + ++F N R   P    +GN       +  G  G +  +      ++  + +E E            +M   +  NN     N G V
Subjt:  FDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSG-NGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHFNNLHASGN-GSV

Query:  QGNNSNN---ASSSAYIATPEILHDPKWLADSGATNHVTANARNLA-VKMDYNV-LNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKV
          + +++      S Y  + E++  P  +A +     + A  R +  ++ D+ + L ++L   E   NL+S+  L  +  + +EF  +   +       V
Subjt:  QGNNSNN---ASSSAYIATPEILHDPKWLADSGATNHVTANARNLA-VKMDYNV-LNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKV

Query:  MLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVI-----KSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFS--RSVSHTWQPLELV
            +L N    + + + Q      +  +   LWH R GH S   +     K++    ++   L  S   C+ C  GK  RLPF   +  +H  +PL +V
Subjt:  MLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVI-----KSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFS--RSVSHTWQPLELV

Query:  HCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRS--FTSFLRDNGIEFRHSCPHTSQQNGI
        H D+ GP   V++    Y++ FVD FT     Y +K K + FS F  +       F  K+  L  D G E+ S     F    GI +  + PHT Q NG+
Subjt:  HCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRS--FTSFLRDNGIEFRHSCPHTSQQNGI

Query:  VERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVS--PWE----QAFHYPCLRLY----------QSHKFQHHSTKCVFLGYSLAH
         ER  R I E   T+++ A +  ++W EA  +A Y+IN +P+  L D S  P+E    +  +   LR++          +  KF   S K +F+GY    
Subjt:  VERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVS--PWE----QAFHYPCLRLY----------QSHKFQHHSTKCVFLGYSLAH

Query:  KGYKCLSSSGRLFISCHVV-------FNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTP------------------PQD--QYVSPQL
         G+K   +    FI    V        N     F++  +  S  S     P+ S  +   QT  P  S                    P D  + +  + 
Subjt:  KGYKCLSSSGRLFISCHVV-------FNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTP------------------PQD--QYVSPQL

Query:  SAHSPE-------GHSCPASVMPLYTSSMLPVD-----GSSDGSPELPLQISTA-------------------LNSHPMQTRAKSGI-FKQKD--WGAFL
           S E         S  ++   L  S     D         G+P    +  TA                   +N    + + K  I + ++D      +
Subjt:  SAHSPE-------GHSCPASVMPLYTSSMLPVD-----GSSDGSPELPLQISTA-------------------LNSHPMQTRAKSGI-FKQKD--WGAFL

Query:  VNSSFSPEVEPTSVKEAL---KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVK
        +N+       P S  E       S W+ A+N E+     N TWT+     N N++ S+W+F VK     +  R KARLVA+GF Q   +D+ ETF+PV +
Subjt:  VNSSFSPEVEPTSVKEAL---KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVK

Query:  APTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSC-PDYVCKLDKALYGLRQASR
          + + IL++ +  N  + Q+DV   FLNG L+E +YMR   G    SC  D VCKL+KA+YGL+QA+R
Subjt:  APTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSC-PDYVCKLDKALYGLRQASR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-7326.06Show/hide
Query:  MLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQF---RGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDL
        M  E+W  +D+  +  +   +S  +  ++I+  T R +W  LE +Y   SK   N+    + +           + +L +       L  +G  +  +D 
Subjt:  MLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQF---RGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDL

Query:  VSYVLAGLDSEYIPIVCAIDDKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNR--QSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQ
           +L  L S Y                 L++ +++ + T+           +L D+ + L LN   +   +NQ Q   + G   +   S + YG  +G 
Subjt:  VSYVLAGLDSEYIPIVCAIDDKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNR--QSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQ

Query:  MGNNPTQGTAMVEIEEEEVEGATIFREEITTCY-MRFEEHF-----NNLHASGNGSVQGNNSNNASSSA--------YIATPEILH----DPKWLADSGA
         G +  +                  +  +  CY      HF     N     G  S Q N+ N A+                E +H    + +W+ D+ A
Subjt:  MGNNPTQGTAMVEIEEEEVEGATIFREEITTCY-MRFEEHF-----NNLHASGNGSVQGNNSNNASSSA--------YIATPEILH----DPKWLADSGA

Query:  TNHVT---------------------------ANARNLAVKMDYN---VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEIL
        ++H T                           A   ++ +K +     VL ++ HVP++R NLIS   + +D +    + +N      K S  V+   + 
Subjt:  TNHVT---------------------------ANARNLAVKMDYN---VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEIL

Query:  RNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSI
        R  LY+      Q   +  +      LWH R+GH S K ++ + K   +S     ++  CD C  GK HR+ F  S       L+LV+ D+ GP  I S+
Subjt:  RNDLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSI

Query:  VGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEF--RSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGL
         G KY+++F+DD +R   +Y LKTK + F  F ++  LV     +K+K L++D GGE+  R F  +   +GI    + P T Q NG+ ER +R IVE   
Subjt:  VGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEF--RSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGL

Query:  TLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSP---W-EQAFHYPCLRLYQSHKFQH-----------HSTKCVFLGYSLAHKGYKCLSSSGRLFI
        ++L  A +P ++W EA  +A Y+IN  P+  L    P   W  +   Y  L+++    F H            S  C+F+GY     GY+      +  I
Subjt:  TLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSP---W-EQAFHYPCLRLYQSHKFQH-----------HSTKCVFLGYSLAHKGYKCLSSSGRLFI

Query:  -SCHVVFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQIS
         S  VVF ESE    +++   S      ++P+       F T   T + P   +  + ++S    +    P  V+          +   +G  E+     
Subjt:  -SCHVVFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQIS

Query:  TALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEAL---KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRC
              P++   +  +  ++      V    S + EP S+KE L   + +Q   AM +E+  L +N T+ LV L      +  KW+FK+K+  +    R 
Subjt:  TALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEAL---KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRC

Query:  KARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASR
        KARLV +GF Q  G+DF E FSPVVK  +I+ IL++A   +  + QLDV   FL+G L+E +YM Q  G+        VCKL+K+LYGL+QA R
Subjt:  KARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASR

P92520 Uncharacterized mitochondrial protein AtMg008204.9e-2348.03Show/hide
Query:  MQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFN
        M TR+K+GI K     +  + ++   + EP SV  ALK   W  AM +E+  L+RNKTW LVP   N N++G KW+FK K  S+ + DR KARLVA+GF+
Subjt:  MQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFN

Query:  QVPGVDFHETFSPVVKAPTIQIILAVA
        Q  G+ F ET+SPVV+  TI+ IL VA
Subjt:  QVPGVDFHETFSPVVKAPTIQIILAVA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.7e-14333.75Show/hide
Query:  KLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDT
        KL   NYL+W   V A+  G ++  ++ G+   P   I T          N  Y  W   D+ +   + G++S ++   V    T  ++W+ L ++Y + 
Subjt:  KLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDT

Query:  SKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDL-KTWQELSSILINFEGTLARYSTPTNAH
        S   V Q R  L+   KG+  + DY+  +    + L L+G P+  D+ V  VL  L  EY P++  I  KD   T  E+   L+N E  +   S+ T   
Subjt:  SKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDL-KTWQELSSILINFEGTLARYSTPTNAH

Query:  FDLPDLATHLALNRQSMFDNQRQFNPSN--GNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHFNNLHASGNGSV
          +P  A  ++    +  +N    N +N   NR N+NNS  +  S      NN      + + +   V+G +  R    +    F    N+       + 
Subjt:  FDLPDLATHLALNRQSMFDNQRQFNPSN--GNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHFNNLHASGNGSV

Query:  QGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNLAVKMDYN------------------------------VLNNILHVPEIRKNLISIASL
            +N A  S Y +         WL DSGAT+H+T++  NL++   Y                                L+NIL+VP I KNLIS+  L
Subjt:  QGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNLAVKMDYN------------------------------VLNNILHVPEIRKNLISIASL

Query:  TVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQ------TPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHF--CD
           N V VEF      VKD  +   +L    +++LY+  + S Q      +P S+   +S    WH RLGH +  ++ SV+ + ++S  LN S  F  C 
Subjt:  TVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQ------TPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHF--CD

Query:  ACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSF
         C   KS+++PFS+S  ++ +PLE ++ D+W  SPI+S   Y+YY+ FVD FTR   +YPLK K +   +F  +K L+ NRF+ +I T  +D GGEF + 
Subjt:  ACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSF

Query:  TSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLR
          +   +GI    S PHT + NG+ ERKHRHIVE GLTLL+ AS+P TYW  AF+ AVY+IN LPTP+L   SP+++ F                YP LR
Subjt:  TSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLR

Query:  LYQSHKFQHHSTKCVFLGYSLAHKGYKCLS-SSGRLFISCHVVFNESEFPFKSELVPSSG-------------------------PSNIALVPHH-----
         Y  HK    S +CVFLGYSL    Y CL   + RL+IS HV F+E+ FPF + L   S                          P+     PHH     
Subjt:  LYQSHKFQHHSTKCVFLGYSLAHKGYKCLS-SSGRLFISCHVVFNESEFPFKSELVPSSG-------------------------PSNIALVPHH-----

Query:  SSPVAEFQT-----------------SSPTLSTP------PQDQYVSPQLSAHSPEGHS--CPASVMPLYTSSMLPVDGSSDGSPELPLQISTA------
        SSP A F+                  SSP  + P      P  Q    Q   HS +  S   P +  P   +  L     S  S   P   +++      
Subjt:  SSPVAEFQT-----------------SSPTLSTP------PQDQYVSPQLSAHSPEGHS--CPASVMPLYTSSMLPVDGSSDGSPELPLQISTA------

Query:  -----------------------LNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLP-NLNLI
                               LN+H M TRAK+GI K     +  V  S + E EP +  +ALK  +W+ AM  EI     N TW LVP  P ++ ++
Subjt:  -----------------------LNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLP-NLNLI

Query:  GSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCK
        G +WIF  K  S+ S +R KARLVA+G+NQ PG+D+ ETFSPV+K+ +I+I+L VAV ++W IRQLDVNN FL G L + VYM Q  G+ID+  P+YVCK
Subjt:  GSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCK

Query:  LDKALYGLRQASRA
        L KALYGL+QA RA
Subjt:  LDKALYGLRQASRA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-13732.69Show/hide
Query:  TNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVIN
        TNI++ ++ +        KL   NYL+W   V A+  G ++  ++ G+   P   I T      +   N  Y  W   D+ +   + G++S ++   V  
Subjt:  TNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVIN

Query:  FKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDL-KTWQELSSI
          T  ++W+ L ++Y + S   V Q R I +                    + L L+G P+  D+ V  VL  L  +Y P++  I  KD   +  E+   
Subjt:  FKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDL-KTWQELSSI

Query:  LINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNN-SGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYM
        LIN E  L   ++      ++  +  ++  +R +  +  +     N N  N+NN S S+  S +G   +N      +   +   V+G +  R        
Subjt:  LINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNN-SGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYM

Query:  RFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNLAVKMDYN------------------------------VLNN
        +F+   N   ++   +     +N A +S Y A         WL DSGAT+H+T++  NL+    Y                                LN 
Subjt:  RFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNLAVKMDYN------------------------------VLNN

Query:  ILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFA--GLWHNRLGHASSKVIKSVLKSCNVS
        +L+VP I KNLIS+  L   N V VEF      VKD  +   +L    +++LY+  + S Q         S A    WH+RLGH S  ++ SV+ + ++ 
Subjt:  ILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFA--GLWHNRLGHASSKVIKSVLKSCNVS

Query:  TFLNES--LHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK
          LN S  L  C  C   KSH++PFS S   + +PLE ++ D+W  SPI+SI  Y+YY+ FVD FTR   +YPLK K +   +F  +K LV NRF+ +I 
Subjt:  TFLNES--LHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK

Query:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------
        TL +D GGEF     +L  +GI    S PHT + NG+ ERKHRHIVEMGLTLL+ AS+P TYW  AFS AVY+IN LPTP+L   SP+++ F        
Subjt:  TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH-------

Query:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLS-SSGRLFISCHVVFNESEFPFK----------------------------SELV---
                YP LR Y  HK +  S +C F+GYSL    Y CL   +GRL+ S HV F+E  FPF                             + LV   
Subjt:  --------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLS-SSGRLFISCHVVFNESEFPFK----------------------------SELV---

Query:  -------------PSSGPSNIALVPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSA--------------------HSPEGHSCPASVMPL----YTSSM
                     P S PS +      SS +     SSP+ S P    +  PQ +A                    +SP  +S P    PL     +S  
Subjt:  -------------PSSGPSNIALVPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSA--------------------HSPEGHSCPASVMPL----YTSSM

Query:  LPVDGSSDGSPELPLQISTA---------------------LNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLT
        +P   +S   P  P   ST+                     +N+H M TRAK GI K      +   +S +   EP +  +A+K  +W+ AM  EI    
Subjt:  LPVDGSSDGSPELPLQISTA---------------------LNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLT

Query:  RNKTWTLV-PLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVY
         N TW LV P  P++ ++G +WIF  K  S+ S +R KARLVA+G+NQ PG+D+ ETFSPV+K+ +I+I+L VAV ++W IRQLDVNN FL G L + VY
Subjt:  RNKTWTLV-PLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVY

Query:  MRQLTGYIDQSCPDYVCKLDKALYGLRQASRA
        M Q  G++D+  PDYVC+L KA+YGL+QA RA
Subjt:  MRQLTGYIDQSCPDYVCKLDKALYGLRQASRA

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.6e-0822.86Show/hide
Query:  LDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSP-AIAADVINFKTLREVWKALEEVYGDT
        +++ NY  W+++ L       V  ++ GT                +L +N     W   D  +   L+G+++P       +   T R++W  ++  + + 
Subjt:  LDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSP-AIAADVINFKTLREVWKALEEVYGDT

Query:  SKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKD-LKTWQELSSILINFEGTLARYSTPTNAH
          A   +    L+    G M++ DY   MK+ +++L+ V  PV+  +LV YVL GL+ ++  I+  I  +    ++ + +++L   E  L R   P   H
Subjt:  SKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKD-LKTWQELSSILINFEGTLARYSTPTNAH

Query:  FDLPDLATHLALNRQSMFDN-QRQFNPSNGNRGNDNNSGSYYGSG
         D    +T LA +      N QR      G RG    +  + G G
Subjt:  FDLPDLATHLALNRQSMFDN-QRQFNPSNGNRGNDNNSGSYYGSG

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.6e-3545Show/hide
Query:  EPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAV
        EP++  EA +   W  AM+DEI  +    TW +  L PN   IG KW++K+K  S+ + +R KARLVA+G+ Q  G+DF ETFSPV K  ++++ILA++ 
Subjt:  EPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAV

Query:  MKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYI----DQSCPDYVCKLDKALYGLRQASR
        + N+++ QLD++N FLNG L E +YM+   GY     D   P+ VC L K++YGL+QASR
Subjt:  MKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYI----DQSCPDYVCKLDKALYGLRQASR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.5e-1124.12Show/hide
Query:  LTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFK-TLREVWKALEEV
        +T+ L+  NY +W+++   +     V  ++ G+ + P+ + E               + W   D  +  W++G+++ ++   +I    T R++W +LE +
Subjt:  LTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFK-TLREVWKALEEV

Query:  YGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKD-LKTWQELSSILINFEGTLARYSTP
        + D  +A   QF   L+ T    + + +Y   +K  S+ L  V +P+S   LV ++L GL  +Y  I+  I  K    ++ E  S+L+  E  L+  S  
Subjt:  YGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKD-LKTWQELSSILINFEGTLARYSTP

Query:  TNAHFDLPDLATHL--ALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNN
        + +H + P L+  L     +Q  +  +   N SN  RG         GS +G+  NN
Subjt:  TNAHFDLPDLATHL--ALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNN

ATMG00300.1 Gag-Pol-related retrotransposon family protein9.8e-1136.84Show/hide
Query:  RNDLYQIELPSIQTPKSEIRSTS--FAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWG
        R+D   I   S++T +S +  T+     LWH+RL H S + ++ ++K   + +    SL FC+ C  GK+HR+ FS     T  PL+ VH DLWG
Subjt:  RNDLYQIELPSIQTPKSEIRSTS--FAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWG

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.5e-2448.03Show/hide
Query:  MQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFN
        M TR+K+GI K     +  + ++   + EP SV  ALK   W  AM +E+  L+RNKTW LVP   N N++G KW+FK K  S+ + DR KARLVA+GF+
Subjt:  MQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFN

Query:  QVPGVDFHETFSPVVKAPTIQIILAVA
        Q  G+ F ET+SPVV+  TI+ IL VA
Subjt:  QVPGVDFHETFSPVVKAPTIQIILAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATCTTCGTCCTCATCTTCGTCAGCAGCAACAGCAGCTGTTGCAGCATCTTCCTCTGCAACAAATATTATCAGTTCTTCATTGGGGCATCCGCTGAGTACGGTTCT
TACCGTAAAGCTTGATGATAAGAACTATCTATTGTGGAAAGATATGGTTCTGGCTATTCTTAGAGGTCAGAAAGTTGATATGTATGTCTTGGGGACAAAAGCCCAACCCT
CAGAGTTGATTGAGACCACGACTGAGTCAGGTAAGATGTTACTTTCGAATATGCTTTATGAGGAATGGATGACAGTGGATCAGGCGCTTTCTGGCTGGCTGTTTGGCTCG
ATGTCTCCAGCTATTGCCGCAGATGTGATAAATTTCAAGACATTACGTGAAGTTTGGAAAGCTTTGGAGGAGGTTTATGGCGACACAAGCAAGGCGTGCGTGAATCAGTT
CAGAGGTATACTTCAGAATACAAAGAAAGGCTCCATGAAGATGATCGATTACCTGGCGATCATGAAACAAGCATCAGAGAATTTAAAACTTGTCGGTAACCCTGTTTCTC
TTGATGATTTAGTGTCCTATGTCCTGGCTGGATTAGATTCTGAGTACATTCCAATTGTGTGTGCGATAGATGATAAAGATTTAAAAACTTGGCAAGAACTTAGCTCGATT
CTGATAAATTTTGAAGGAACTTTGGCTCGATATTCCACTCCTACTAATGCTCATTTTGACCTACCTGATTTAGCTACTCATTTAGCTCTTAATAGACAAAGCATGTTCGA
TAATCAAAGGCAATTTAACCCTAGTAATGGAAATCGAGGGAATGACAATAATTCTGGGAGTTATTATGGTTCTGGAAATGGTCAAATGGGCAACAATCCTACTCAAGGAA
CTGCAATGGTGGAAATAGAGGAGGAAGAGGTCGAGGGCGCAACAATTTTCAGAGAGGAAATAACAACGTGCTACATGAGGTTTGAAGAACATTTTAACAATCTGCACGCT
TCTGGTAATGGTTCTGTGCAAGGAAACAACTCAAACAATGCCAGTTCTTCGGCTTACATTGCAACCCCAGAAATTCTTCATGATCCAAAGTGGTTGGCTGATAGCGGAGC
TACCAACCATGTCACGGCTAATGCAAGGAATCTTGCAGTAAAGATGGACTATAATGTTTTGAATAATATTCTTCATGTGCCTGAAATTAGAAAGAATTTGATAAGCATTG
CTAGTTTGACTGTCGATAATAATGTTGTGGTTGAGTTTCATTCTAACTATTGTGTTGTGAAGGACAAGGCTTCAAAGAAGGTGATGTTGCACGAAATTCTTAGAAACGAC
CTCTACCAAATCGAGCTTCCTTCAATACAAACTCCAAAGTCTGAAATCAGGTCAACTTCCTTTGCTGGTTTATGGCATAATCGTTTGGGTCATGCGTCTTCTAAAGTTAT
CAAAAGTGTCCTAAAGTCCTGTAATGTTTCAACTTTTTTGAATGAAAGTCTTCATTTCTGTGATGCTTGTCAAAAAGGAAAGTCTCATCGCTTACCATTTTCTCGTTCTG
TTTCACACACTTGGCAACCACTTGAACTAGTACACTGTGATCTTTGGGGCCCTTCCCCTATTGTATCCATTGTTGGTTATAAATACTACATTAGTTTCGTGGACGATTTC
ACTAGACTTGCCCACATCTATCCTCTTAAAACTAAAGGGGAGGCATTTTCTTCTTTCTCTCAATACAAACTTTTAGTTGCGAATCGTTTTGAGAAGAAAATTAAAACTCT
TCAAACAGATTGGGGGGGTGAGTTTCGATCCTTTACTTCATTTCTTAGAGATAATGGTATTGAGTTTCGTCATTCATGTCCTCACACTAGTCAGCAAAATGGAATAGTTG
AGCGTAAGCATAGGCACATTGTTGAAATGGGGTTGACTCTTTTAGCTCAAGCCTCGATGCCTTTAACCTACTGGTGGGAGGCTTTTTCTTCTGCTGTCTACATTATAAAT
TGCCTCCCTACTCCTATCCTAGGTGATGTTTCTCCATGGGAGCAAGCCTTTCATTATCCCTGTTTAAGGCTTTACCAATCACATAAATTTCAGCATCATAGCACAAAATG
TGTCTTTTTGGGTTATAGTCTTGCACACAAAGGTTATAAGTGTTTAAGCTCAAGTGGTCGTCTCTTTATCTCCTGTCATGTTGTGTTTAATGAGTCTGAATTTCCCTTTA
AATCTGAATTAGTTCCCTCATCCGGTCCTTCAAATATAGCTCTTGTTCCTCATCATTCTTCTCCTGTTGCTGAATTCCAGACATCATCACCTACTTTGTCGACTCCTCCT
CAAGATCAGTATGTTAGTCCACAACTAAGTGCTCACAGTCCAGAAGGTCATTCTTGTCCTGCATCTGTTATGCCATTATATACCTCATCTATGTTACCAGTTGATGGCTC
GTCTGATGGTTCTCCTGAGCTTCCTCTTCAGATATCCACTGCTTTGAACAGTCATCCTATGCAGACTAGAGCGAAAAGTGGAATTTTTAAACAAAAGGACTGGGGTGCAT
TTCTAGTCAACAGTTCTTTTTCTCCAGAGGTTGAACCTACATCAGTTAAGGAGGCGCTTAAGTCAAGTCAGTGGAAGGCAGCAATGAATGATGAAATAGCAGTTTTAACT
CGTAACAAAACATGGACTCTTGTCCCTCTTCTGCCCAACTTAAACTTAATTGGTTCTAAATGGATTTTCAAAGTTAAACGTAAGTCAAACAGCTCGTTTGATCGTTGTAA
GGCCAGATTGGTTGCTCAAGGATTCAATCAAGTTCCAGGAGTTGATTTTCATGAGACGTTTAGTCCAGTGGTTAAAGCTCCAACGATTCAAATCATACTTGCTGTGGCGG
TTATGAAAAATTGGTCCATTCGTCAGTTGGATGTCAATAACATTTTTCTCAATGGCAGGTTGCAAGAAGCTGTTTACATGAGGCAACTGACTGGATATATTGATCAGTCT
TGTCCTGATTATGTGTGTAAGCTTGATAAAGCTTTGTATGGTCTTCGACAAGCTTCTCGGGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTATCTTCGTCCTCATCTTCGTCAGCAGCAACAGCAGCTGTTGCAGCATCTTCCTCTGCAACAAATATTATCAGTTCTTCATTGGGGCATCCGCTGAGTACGGTTCT
TACCGTAAAGCTTGATGATAAGAACTATCTATTGTGGAAAGATATGGTTCTGGCTATTCTTAGAGGTCAGAAAGTTGATATGTATGTCTTGGGGACAAAAGCCCAACCCT
CAGAGTTGATTGAGACCACGACTGAGTCAGGTAAGATGTTACTTTCGAATATGCTTTATGAGGAATGGATGACAGTGGATCAGGCGCTTTCTGGCTGGCTGTTTGGCTCG
ATGTCTCCAGCTATTGCCGCAGATGTGATAAATTTCAAGACATTACGTGAAGTTTGGAAAGCTTTGGAGGAGGTTTATGGCGACACAAGCAAGGCGTGCGTGAATCAGTT
CAGAGGTATACTTCAGAATACAAAGAAAGGCTCCATGAAGATGATCGATTACCTGGCGATCATGAAACAAGCATCAGAGAATTTAAAACTTGTCGGTAACCCTGTTTCTC
TTGATGATTTAGTGTCCTATGTCCTGGCTGGATTAGATTCTGAGTACATTCCAATTGTGTGTGCGATAGATGATAAAGATTTAAAAACTTGGCAAGAACTTAGCTCGATT
CTGATAAATTTTGAAGGAACTTTGGCTCGATATTCCACTCCTACTAATGCTCATTTTGACCTACCTGATTTAGCTACTCATTTAGCTCTTAATAGACAAAGCATGTTCGA
TAATCAAAGGCAATTTAACCCTAGTAATGGAAATCGAGGGAATGACAATAATTCTGGGAGTTATTATGGTTCTGGAAATGGTCAAATGGGCAACAATCCTACTCAAGGAA
CTGCAATGGTGGAAATAGAGGAGGAAGAGGTCGAGGGCGCAACAATTTTCAGAGAGGAAATAACAACGTGCTACATGAGGTTTGAAGAACATTTTAACAATCTGCACGCT
TCTGGTAATGGTTCTGTGCAAGGAAACAACTCAAACAATGCCAGTTCTTCGGCTTACATTGCAACCCCAGAAATTCTTCATGATCCAAAGTGGTTGGCTGATAGCGGAGC
TACCAACCATGTCACGGCTAATGCAAGGAATCTTGCAGTAAAGATGGACTATAATGTTTTGAATAATATTCTTCATGTGCCTGAAATTAGAAAGAATTTGATAAGCATTG
CTAGTTTGACTGTCGATAATAATGTTGTGGTTGAGTTTCATTCTAACTATTGTGTTGTGAAGGACAAGGCTTCAAAGAAGGTGATGTTGCACGAAATTCTTAGAAACGAC
CTCTACCAAATCGAGCTTCCTTCAATACAAACTCCAAAGTCTGAAATCAGGTCAACTTCCTTTGCTGGTTTATGGCATAATCGTTTGGGTCATGCGTCTTCTAAAGTTAT
CAAAAGTGTCCTAAAGTCCTGTAATGTTTCAACTTTTTTGAATGAAAGTCTTCATTTCTGTGATGCTTGTCAAAAAGGAAAGTCTCATCGCTTACCATTTTCTCGTTCTG
TTTCACACACTTGGCAACCACTTGAACTAGTACACTGTGATCTTTGGGGCCCTTCCCCTATTGTATCCATTGTTGGTTATAAATACTACATTAGTTTCGTGGACGATTTC
ACTAGACTTGCCCACATCTATCCTCTTAAAACTAAAGGGGAGGCATTTTCTTCTTTCTCTCAATACAAACTTTTAGTTGCGAATCGTTTTGAGAAGAAAATTAAAACTCT
TCAAACAGATTGGGGGGGTGAGTTTCGATCCTTTACTTCATTTCTTAGAGATAATGGTATTGAGTTTCGTCATTCATGTCCTCACACTAGTCAGCAAAATGGAATAGTTG
AGCGTAAGCATAGGCACATTGTTGAAATGGGGTTGACTCTTTTAGCTCAAGCCTCGATGCCTTTAACCTACTGGTGGGAGGCTTTTTCTTCTGCTGTCTACATTATAAAT
TGCCTCCCTACTCCTATCCTAGGTGATGTTTCTCCATGGGAGCAAGCCTTTCATTATCCCTGTTTAAGGCTTTACCAATCACATAAATTTCAGCATCATAGCACAAAATG
TGTCTTTTTGGGTTATAGTCTTGCACACAAAGGTTATAAGTGTTTAAGCTCAAGTGGTCGTCTCTTTATCTCCTGTCATGTTGTGTTTAATGAGTCTGAATTTCCCTTTA
AATCTGAATTAGTTCCCTCATCCGGTCCTTCAAATATAGCTCTTGTTCCTCATCATTCTTCTCCTGTTGCTGAATTCCAGACATCATCACCTACTTTGTCGACTCCTCCT
CAAGATCAGTATGTTAGTCCACAACTAAGTGCTCACAGTCCAGAAGGTCATTCTTGTCCTGCATCTGTTATGCCATTATATACCTCATCTATGTTACCAGTTGATGGCTC
GTCTGATGGTTCTCCTGAGCTTCCTCTTCAGATATCCACTGCTTTGAACAGTCATCCTATGCAGACTAGAGCGAAAAGTGGAATTTTTAAACAAAAGGACTGGGGTGCAT
TTCTAGTCAACAGTTCTTTTTCTCCAGAGGTTGAACCTACATCAGTTAAGGAGGCGCTTAAGTCAAGTCAGTGGAAGGCAGCAATGAATGATGAAATAGCAGTTTTAACT
CGTAACAAAACATGGACTCTTGTCCCTCTTCTGCCCAACTTAAACTTAATTGGTTCTAAATGGATTTTCAAAGTTAAACGTAAGTCAAACAGCTCGTTTGATCGTTGTAA
GGCCAGATTGGTTGCTCAAGGATTCAATCAAGTTCCAGGAGTTGATTTTCATGAGACGTTTAGTCCAGTGGTTAAAGCTCCAACGATTCAAATCATACTTGCTGTGGCGG
TTATGAAAAATTGGTCCATTCGTCAGTTGGATGTCAATAACATTTTTCTCAATGGCAGGTTGCAAGAAGCTGTTTACATGAGGCAACTGACTGGATATATTGATCAGTCT
TGTCCTGATTATGTGTGTAAGCTTGATAAAGCTTTGTATGGTCTTCGACAAGCTTCTCGGGCCTAG
Protein sequenceShow/hide protein sequence
MLSSSSSSSAATAAVAASSSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGS
MSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELSSI
LINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHFNNLHA
SGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNLAVKMDYNVLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRND
LYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDF
TRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIIN
CLPTPILGDVSPWEQAFHYPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTPP
QDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLT
RNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQS
CPDYVCKLDKALYGLRQASRA