; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015285 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015285
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr12:9627226..9636320
RNA-Seq ExpressionLag0015285
SyntenyLag0015285
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU17915.1 hypothetical protein TSUD_330400, partial [Trifolium subterraneum]1.6e-10331.23Show/hide
Query:  SNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVA
        +N S    L   + +KLDR N+ LWK++ +PI+R  +L+G+ LG+  CP +F+       T++D                      ++K  NP F+ W A
Subjt:  SNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVA

Query:  VDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEE
         DQ LLGWL NSMT  +ATQ++  + +  LW     L G  + ++  YL+  F   +KG +KM DYL  MK+ AD L   G+P+++  L+ Q L GLD E
Subjt:  VDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEE

Query:  YNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKP
        YNPVV  +  + +  W +LQA+LL FE ++E   NL +N++ +   + N+AN     N RG +   NN    S   G RGG   RGRGR       + K 
Subjt:  YNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKP

Query:  ICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVI
         CQVCG+  HIA+ C+ RF+K +      ++ +  NN++Q +             N FLA+  SV D +WY D+G SNHVT   N   + A + G   ++
Subjt:  ICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVI

Query:  IGDGHKLQISHVGNS----CLIDGYNNLSLENVLCVPTIAK---------NLVSVSKLARDN-------------HVYVEFQDDFC-------LVKTKDT
        +G+G KL+I   G +     L DG   LS ++     ++ +         N   + +L   +               YV F DDF        L +  DT
Subjt:  IGDGHKLQISHVGNS----CLIDGYNNLSLENVLCVPTIAK---------NLVSVSKLARDN-------------HVYVEFQDDFC-------LVKTKDT

Query:  G------KVLPRGTLN--------DGLYQFEHVKATSV-----------YVSKTANQDPSRVNKVVSPVFVGL----------------SSVNLVE--SK
               K +     N        DG  +++ V+  ++           Y S+  N    R ++ ++   + L                ++V L+     
Subjt:  G------KVLPRGTLN--------DGLYQFEHVKATSV-----------YVSKTANQDPSRVNKVVSPVFVGL----------------SSVNLVE--SK

Query:  AVLNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSA
         V + +SP   L  K+ D  +L+ FGCAC+PCL+PY  HK QFHT KC++LG S   KG++C+NS+GR+FI RHV FNE+ FPF DGF          + 
Subjt:  AVLNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSA

Query:  TLPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSS
         +P   +T   P  + P+     T+ ST + +      Q +  L                 TQD    +                              +
Subjt:  TLPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSS

Query:  SPNPPTEPSATQIAQPPPVQAIHPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIF
        +P+  T P   + +        H M TR K GI KPKL ++  ++      EP    +AL   +W +AM +E+  L++N TW L+      +++ ++W+F
Subjt:  SPNPPTEPSATQIAQPPPVQAIHPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIF

Query:  RIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV
        +IK  +DG+++R KARLVAK F Q++G+ + ETFSPVVKASTIR++LSI V
Subjt:  RIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]3.2e-11231.39Show/hide
Query:  IKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGWLYNSMT
        +KLDR+N+ LWK+L +P++R  KL+G+ LG   CP +F+       TSSD                      ++K  N  F  W A DQ LLGW+ NSMT
Subjt:  IKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGWLYNSMT

Query:  PEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMVQGREST
         E+ATQ++  + ++ LW   Q L G  + ++  YL+  F   +KG +KM DYL  MK+  D L   G+PV++  L+ Q L GLD EYNPVV  +  + + 
Subjt:  PEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMVQGREST

Query:  TWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKPICQVCGKVGHIALM
        +W +LQA+LL FE ++E   NL +N++ +   + N+AN  +    RG+ S  NN    S   G RGG   RGRG+       + K  CQVCG   HIA+ 
Subjt:  TWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKPICQVCGKVGHIALM

Query:  CYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGHKLQISHVGN
        C+ RF+K +      ++ +   +++Q +             N+FLA+  SV D +WY D+G SNHVT       +   + G   +++G+G KL I   G+
Subjt:  CYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGHKLQISHVGN

Query:  SCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQF-------------------------------
        S L     +L+L ++L VP I KNL+SVSKLA DN++ VEF ++ C VK K TGKV+ +G L DGLYQ                                
Subjt:  SCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQF-------------------------------

Query:  ------------------------------------------------------------------------------------EHVKATSVYVSKTANQ
                                                                                            E V+A   + + T NQ
Subjt:  ------------------------------------------------------------------------------------EHVKATSVYVSKTANQ

Query:  DPSRVN----------KVVSPVFV------------------------------GLS---------------------SVNLVESKAVLNGKSPMEQLLD
           R+           K V  + V                              GL+                      +N + S+   N +SP   +L 
Subjt:  DPSRVN----------KVVSPVFV------------------------------GLS---------------------SVNLVESKAVLNGKSPMEQLLD

Query:  KKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSATLPSPNITMWFPYP
        K+ D K L+TFGCAC+PCL+PY  HK Q+HT +C++LG S   KG++CLNS+GRIFI RHV FNE+ FPF DG                           
Subjt:  KKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSATLPSPNITMWFPYP

Query:  NDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSS-SPNPPTEPSATQI
             F+++ +P  +T + PST              L +A     N   D S P++     A N A T+   +Q     + QT +  S +  T      I
Subjt:  NDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSS-SPNPPTEPSATQI

Query:  AQPPPV-------QAIHPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNS
         Q   V          H + TR K+GI KPKL ++  ++      EP    +AL+   W +AM +E+  L+SN TW LV   +  N+V +KW+F+ K   
Subjt:  AQPPPV-------QAIHPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNS

Query:  DGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV
        DGS++R KARLVAK F Q++G+D+ ETFSPV+KAST+R++LSI V
Subjt:  DGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]4.8e-10831.79Show/hide
Query:  SNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVA
        ++P  N  L  I+ +KLDR N+ LWK+L + ++R  KL+G+ LG   CP QFV       TS+D                       +K VNP F  W+A
Subjt:  SNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVA

Query:  VDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEE
         DQ LLGWL NSM  ++ATQ++  + ++ LW   Q L G  + +   YL+  F   +KG +KM +YL  MK+ +D L   GSP+++  L+ Q L GLD E
Subjt:  VDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEE

Query:  YNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSG---GVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFN
        YNPVV  +  + + +W ++QA+LL FE +L+   N      FSG     S N AN  E    RG +   N+R N+  R   RG  G RG+GR       N
Subjt:  YNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSG---GVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFN

Query:  HKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNS-FLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGN
         K  CQVC   GHIA+ C  RF++ + G              +N  T+       Q S+S F+A+P    D  WY D+G +NHVT   +       + G 
Subjt:  HKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNS-FLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGN

Query:  ERVIIGDGHKLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFEHVKATSVYVS--
          +++G+G KL+I   G++ L    NNL+L +VL VP I KNL+SVSKL  DN++ VEF  + C VK K TG+ L +G L DGLYQ  + K   VY+S  
Subjt:  ERVIIGDGHKLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFEHVKATSVYVS--

Query:  -----KTANQDPSRVNKVVSPVFVGLSS------------------------------------------------------------------------
             K  + +   ++KV+    V +S                                                                         
Subjt:  -----KTANQDPSRVNKVVSPVFVGLSS------------------------------------------------------------------------

Query:  ----------VNLVES-----------------KAV--------------------LNGK----------------------------------------
                   NL E+                 KAV                     NG+                                        
Subjt:  ----------VNLVES-----------------KAV--------------------LNGK----------------------------------------

Query:  --------SPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSG----M
                SP   +  ++ D  AL+ FGCAC+PCL+PY  HK QFHT +C+++G S   KG++C+NS+GRIF+ RHV FNEN FPF  GF  T      +
Subjt:  --------SPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSG----M

Query:  SGSSSATLPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTS
        + +SS  LP+ +                +T       D  +T  Q   S++S   N        +N+ Q  S        N +NS++  I +        
Subjt:  SGSSSATLPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTS

Query:  PQTTSSSPNPPTEPSATQIAQPPPVQAIHPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVV
            S   N  T     Q          H M TR K GI KPK+ ++  ++ D    EP  + +AL    W +AMD+EY  LVSN+TW LV      N++
Subjt:  PQTTSSSPNPPTEPSATQIAQPPPVQAIHPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVV

Query:  GNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV
         +KWIF+ K  SDGS++R KARLVAK F Q++G+DF ETFSPVVK+ST+R++L+I V
Subjt:  GNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]1.4e-11230.39Show/hide
Query:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW
        L   + +KLDR NF LWK+L +P++R  K +G+ LG   CP QFV                              S+   + +NP +  W A DQ LLGW
Subjt:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW

Query:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV
        L NSMT ++ATQV+  + ++ LW   Q L G  + +   YL+  F    K  +KM  YL  MK+ AD L   GSP++S  L+ Q L GLD EYNPVV  +
Subjt:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV

Query:  QGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGG--GTRGRGRARGYGSFNHKPICQVCG
          + + +W + QA+LL FE +L+ Q N  +N++ +   S N A+  E G  +           +  RGG RG    G RG GR R   S   +PICQ+CG
Subjt:  QGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGG--GTRGRGRARGYGSFNHKPICQVCG

Query:  KVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNS-FLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGH
        K GH A  CY RF+K +                    T+   +   + S+S F+A+P    D  WY D+G SNHVT     + +     G   +++G+G 
Subjt:  KVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNS-FLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGH

Query:  KLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFE-------------HVKATSVY
        KL+I   G++ L    N+++L NVL VP I KNL+SVSKL  DN+  VEF +++C VK K TGK L +G L DGLYQ               ++    ++
Subjt:  KLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFE-------------HVKATSVY

Query:  VSKTANQDPSRVNKVVSPVFVGLSS---------------------------------------------------------------------------
          K  + +   + KV+    V +S                                                                            
Subjt:  VSKTANQDPSRVNKVVSPVFVGLSS---------------------------------------------------------------------------

Query:  -------VNLVESK-------------------------------------------------------------------------------------A
                NLVE++                                                                                     +
Subjt:  -------VNLVESK-------------------------------------------------------------------------------------A

Query:  VLNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSAT
        V   +SP   +  K+ D  AL+ FGCAC+PCL+PY  HK QFHT +C++LG S   KG++C+NS+GR+F+ RHV FNEN FPF++GF  T          
Subjt:  VLNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSAT

Query:  LPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSS
                      +P+  V++  P      P        A       + Q       N   DQS      E    N+ S    +    +    ++    
Subjt:  LPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSS

Query:  PNPPTEPSATQIAQPPPVQAI---HPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKW
          P TE        PPP Q I   H M TR KAG++KPKL ++  ++      EP  +++AL+  +W  AMD EY  L++N TW LV      NV+ +KW
Subjt:  PNPPTEPSATQIAQPPPVQAI---HPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKW

Query:  IFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV
        IF+ K  +DG+++R KARLVA+ F Q++GVD+ ETFSPVVK+ST+R++LSI V
Subjt:  IFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]4.8e-10846.19Show/hide
Query:  TSITSSIGTTNFSNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAA
        T+   S+ +  FSNP LNQ+LNQ+  +KLDR N+LLWK LA+PIL+ YKLEGH   E PCP  FV   +  S+S+    EEGA +T    IGAS+S++  
Subjt:  TSITSSIGTTNFSNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAA

Query:  KVVNPLFDSWVAVDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRS
        ++VNPLF+ WV  D LLLGWLYNSMTP+VA Q+MGF N +DLW A QD FGVQS AEED+LRQ+ Q  +K                              
Subjt:  KVVNPLFDSWVAVDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRS

Query:  LVSQVLLGLDEEYNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNV-SFSGGVSVNMANNKEGGNQRGQQS---FPNNRQNYSGRGGHRGGGGT
               GLDE YN V+ ++QG+   +W ++Q++LL+FEK+L+ Q   K N  + +   ++NMA       QR Q +   +  NRQ++SG+         
Subjt:  LVSQVLLGLDEEYNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNV-SFSGGVSVNMANNKEGGNQRGQQS---FPNNRQNYSGRGGHRGGGGT

Query:  RGRGRARGYGSFNHKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDY
                 G+ N+ P CQ+CGK GH AL+CY RFNKEF  P  VQNRN+ ++N  +    P  FV+ QN+  F ATP++V+DPNWY D+G +NHVT + 
Subjt:  RGRGRARGYGSFNHKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDY

Query:  NCIANPANYEGNERVIIGDGHKLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGK
        + + NP  Y G E+V +G+G++L IS+VGN+CL DG  +L L+N+LCVP IAKNL+SVSKLA+DNH+Y+EF    C +K K TGK
Subjt:  NCIANPANYEGNERVIIGDGHKLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGK

TrEMBL top hitse value%identityAlignment
A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)6.9e-11330.39Show/hide
Query:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW
        L   + +KLDR NF LWK+L +P++R  K +G+ LG   CP QFV                              S+   + +NP +  W A DQ LLGW
Subjt:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW

Query:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV
        L NSMT ++ATQV+  + ++ LW   Q L G  + +   YL+  F    K  +KM  YL  MK+ AD L   GSP++S  L+ Q L GLD EYNPVV  +
Subjt:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV

Query:  QGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGG--GTRGRGRARGYGSFNHKPICQVCG
          + + +W + QA+LL FE +L+ Q N  +N++ +   S N A+  E G  +           +  RGG RG    G RG GR R   S   +PICQ+CG
Subjt:  QGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGG--GTRGRGRARGYGSFNHKPICQVCG

Query:  KVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNS-FLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGH
        K GH A  CY RF+K +                    T+   +   + S+S F+A+P    D  WY D+G SNHVT     + +     G   +++G+G 
Subjt:  KVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNS-FLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGH

Query:  KLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFE-------------HVKATSVY
        KL+I   G++ L    N+++L NVL VP I KNL+SVSKL  DN+  VEF +++C VK K TGK L +G L DGLYQ               ++    ++
Subjt:  KLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFE-------------HVKATSVY

Query:  VSKTANQDPSRVNKVVSPVFVGLSS---------------------------------------------------------------------------
          K  + +   + KV+    V +S                                                                            
Subjt:  VSKTANQDPSRVNKVVSPVFVGLSS---------------------------------------------------------------------------

Query:  -------VNLVESK-------------------------------------------------------------------------------------A
                NLVE++                                                                                     +
Subjt:  -------VNLVESK-------------------------------------------------------------------------------------A

Query:  VLNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSAT
        V   +SP   +  K+ D  AL+ FGCAC+PCL+PY  HK QFHT +C++LG S   KG++C+NS+GR+F+ RHV FNEN FPF++GF  T          
Subjt:  VLNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSAT

Query:  LPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSS
                      +P+  V++  P      P        A       + Q       N   DQS      E    N+ S    +    +    ++    
Subjt:  LPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSS

Query:  PNPPTEPSATQIAQPPPVQAI---HPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKW
          P TE        PPP Q I   H M TR KAG++KPKL ++  ++      EP  +++AL+  +W  AMD EY  L++N TW LV      NV+ +KW
Subjt:  PNPPTEPSATQIAQPPPVQAI---HPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKW

Query:  IFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV
        IF+ K  +DG+++R KARLVA+ F Q++GVD+ ETFSPVVK+ST+R++LSI V
Subjt:  IFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.5e-11231.39Show/hide
Query:  IKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGWLYNSMT
        +KLDR+N+ LWK+L +P++R  KL+G+ LG   CP +F+       TSSD                      ++K  N  F  W A DQ LLGW+ NSMT
Subjt:  IKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGWLYNSMT

Query:  PEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMVQGREST
         E+ATQ++  + ++ LW   Q L G  + ++  YL+  F   +KG +KM DYL  MK+  D L   G+PV++  L+ Q L GLD EYNPVV  +  + + 
Subjt:  PEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMVQGREST

Query:  TWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKPICQVCGKVGHIALM
        +W +LQA+LL FE ++E   NL +N++ +   + N+AN  +    RG+ S  NN    S   G RGG   RGRG+       + K  CQVCG   HIA+ 
Subjt:  TWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKPICQVCGKVGHIALM

Query:  CYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGHKLQISHVGN
        C+ RF+K +      ++ +   +++Q +             N+FLA+  SV D +WY D+G SNHVT       +   + G   +++G+G KL I   G+
Subjt:  CYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGHKLQISHVGN

Query:  SCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQF-------------------------------
        S L     +L+L ++L VP I KNL+SVSKLA DN++ VEF ++ C VK K TGKV+ +G L DGLYQ                                
Subjt:  SCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQF-------------------------------

Query:  ------------------------------------------------------------------------------------EHVKATSVYVSKTANQ
                                                                                            E V+A   + + T NQ
Subjt:  ------------------------------------------------------------------------------------EHVKATSVYVSKTANQ

Query:  DPSRVN----------KVVSPVFV------------------------------GLS---------------------SVNLVESKAVLNGKSPMEQLLD
           R+           K V  + V                              GL+                      +N + S+   N +SP   +L 
Subjt:  DPSRVN----------KVVSPVFV------------------------------GLS---------------------SVNLVESKAVLNGKSPMEQLLD

Query:  KKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSATLPSPNITMWFPYP
        K+ D K L+TFGCAC+PCL+PY  HK Q+HT +C++LG S   KG++CLNS+GRIFI RHV FNE+ FPF DG                           
Subjt:  KKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSATLPSPNITMWFPYP

Query:  NDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSS-SPNPPTEPSATQI
             F+++ +P  +T + PST              L +A     N   D S P++     A N A T+   +Q     + QT +  S +  T      I
Subjt:  NDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSS-SPNPPTEPSATQI

Query:  AQPPPV-------QAIHPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNS
         Q   V          H + TR K+GI KPKL ++  ++      EP    +AL+   W +AM +E+  L+SN TW LV   +  N+V +KW+F+ K   
Subjt:  AQPPPV-------QAIHPMVTRGKAGIFKPKL-WMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNS

Query:  DGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV
        DGS++R KARLVAK F Q++G+D+ ETFSPV+KAST+R++LSI V
Subjt:  DGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV

A0A803NU85 Uncharacterized protein3.8e-12731.88Show/hide
Query:  GSGQFCPYNRSGSGRKVEGAKNDGEKKACDVDRRRTTMSSPSSFRSI------CIKA-AIKKMDMQATKELILAEIKVLTRVHHLEPGFHVDGQSFAVYL
        G+ Q+ PY  +  G   E  ++D  KK C    R   +S   S   +      C ++  ++K  +         +I      +  +P     GQ+     
Subjt:  GSGQFCPYNRSGSGRKVEGAKNDGEKKACDVDRRRTTMSSPSSFRSI------CIKA-AIKKMDMQATKELILAEIKVLTRVHHLEPGFHVDGQSFAVYL

Query:  MVRDHTSITSSIGTTNFSNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASA
             T+   +   T  +NP  N  L+Q   +KLDR+NF LWK +   I+R ++LEG+  G    PPQF+            P+E  A  T         
Subjt:  MVRDHTSITSSIGTTNFSNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASA

Query:  SVSAAKVVNPLFDSWVAVDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSP
               VNP +++W+  DQLL+GWL                    LW+A+++L+G  S A  D +R   Q  +KG+  M DYL+  + +AD+L   G P
Subjt:  SVSAAKVVNPLFDSWVAVDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSP

Query:  VTSRSLVSQVLLGLDEEYNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGN--------QRGQQSFPNNRQNYSGR
           + LVS VL GLD EY  +V +++ RE TTW +LQ  LL F+ +LE    + +N       S N A    G N        +RG     NNR N++  
Subjt:  VTSRSLVSQVLLGLDEEYNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGN--------QRGQQSFPNNRQNYSGR

Query:  GGHRGGGGTRGRGR-ARGYGSFNHKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADN
        G  RGG    GRGR  RG GS   KP CQVCGK GH A +CY R+++ F G    Q  N+ N+++Q               ++ +A PE + D +WYAD+
Subjt:  GGHRGGGGTRGRGR-ARGYGSFNHKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADN

Query:  GESNHVTSDYNCIANPANYEGNERVIIGDGHKLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRG
        G SNH+TSD   I N + Y G E++ IGDG KL I HVG   L    + L L N+L VP+I+KNL+SVSKL  DN+V +EF  D C+VK + TG+V+ +G
Subjt:  GESNHVTSDYNCIANPANYEGNERVIIGDGHKLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRG

Query:  TLNDGLYQFEHVKATS---------VYVSKT----------------------------------------------------ANQDPSRVNKVVSP---
        TL DGLYQ    +++           +VS T                                                     NQ  +++ K+ +    
Subjt:  TLNDGLYQFEHVKATS---------VYVSKT----------------------------------------------------ANQDPSRVNKVVSP---

Query:  -------------------------------------VFVGL------------------SSVNLVE--SKAVLNGKSPMEQLLDKKADVKALRTFGCAC
                                             V +GL                  ++V L+     AVL  +SP   L  K+ D K L+TFG AC
Subjt:  -------------------------------------VFVGL------------------SSVNLVE--SKAVLNGKSPMEQLLDKKADVKALRTFGCAC

Query:  FPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSATLPSPNITMWFPYPNDPVSFVSSTNPSTS
        FPCLR Y  HKFQFH+ KC+ LG S   KG++CL+ +GRI+I RHV FNE+EFPF+ GF   +  +      +P P+  +WF  P    +  SS++  TS
Subjt:  FPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSATLPSPNITMWFPYPNDPVSFVSSTNPSTS

Query:  TDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSSPNPPTEPSATQIAQPPPVQAIHPMVTRG
         D  P+ PL                                                        PQ  S +                            
Subjt:  TDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSSPNPPTEPSATQIAQPPPVQAIHPMVTRG

Query:  KAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDF
        +AGIFKP++++ ++       EP  +  AL    W+ AM  E   L  N TW+LV PS  YNVVGNKW++++K N+D S QR+KARLVAK FHQ  G+DF
Subjt:  KAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDF

Query:  FETFSPVVKASTIRVVLSIVVCR
         ETFSPV+KAST+RVVL+I V +
Subjt:  FETFSPVVKASTIRVVLSIVVCR

A0A803PEH4 Uncharacterized protein1.4e-12934.62Show/hide
Query:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW
        LNQ   +KLDR+N+ LWK +   I+R ++L G+  G   CPP+FV                               V   +V NP +++W+  DQLL+GW
Subjt:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW

Query:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV
        LY+SMT  +AT+VMG  +A +L   ++ L+G  S ++ D  R + Q  +KGS  M +YLR  K++++ L   G P     LV+ VL GLD EY  +V  +
Subjt:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV

Query:  QGRESTTWSELQAELLVFEKQLEMQTN--LKSNVSFSGGVSVNM-ANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKPICQVC
        + R +TTW ELQ  LL F+ ++E   N  L SN + S     NM A     G  RG QS  N   N  G   +  G   R RGR RG GS   +P CQV 
Subjt:  QGRESTTWSELQAELLVFEKQLEMQTN--LKSNVSFSGGVSVNM-ANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKPICQVC

Query:  GKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGH
        GK GH A +CY RF++ + G       +D NN     +   T    N N ++F+ATPE +    W+AD+G SNH+TSD   +    +Y G E V++G+G 
Subjt:  GKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGH

Query:  KLQISHVGNSCL-IDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFE-------HVKATSVYVSKTA
        KL+I+H+GN  L I+  N L L+++L VP IAKNLVSVSKLA DN+V +EF  +FCLVK K T KVL  G L D LYQ +       H    S ++S   
Subjt:  KLQISHVGNSCL-IDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFE-------HVKATSVYVSKTA

Query:  NQDPSRVNK------VVSPVFVG---------------LSSVNLVESKAVL----------------------NGKSPMEQL------------------
            S VN+      ++S + VG               L SVN+  SK  +                        KS ++ +                  
Subjt:  NQDPSRVNK------VVSPVFVG---------------LSSVNLVESKAVL----------------------NGKSPMEQL------------------

Query:  ----------------LDKKAD-------------------VKALR---------------TFG----------------------------------CA
                        L  K+D                   +K+LR               T G                                    
Subjt:  ----------------LDKKAD-------------------VKALR---------------TFG----------------------------------CA

Query:  CFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSATLPSPNI---TMWFPYPNDPVSFVSSTN
        CFPCLR YQ+HKFQFH+ KC+ LG S   KG++CL+  GRI+I + V FNE  FPF+ GF         ++   P P I   T W       +  +   +
Subjt:  CFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSATLPSPNI---TMWFPYPNDPVSFVSSTN

Query:  PSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSSPNPPTEPSATQIAQPPPVQ-----
        PS S+    ST   P AS     P   S++PR Q+       P++ H  + S+ +S    S Q       +      +   EP+  +I Q  P +     
Subjt:  PSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSSPNPPTEPSATQIAQPPPVQ-----

Query:  AIHPMVTRGKAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKR
          HPM+TRGK GIFKP+ ++SQ + + TITEP  + +A+    W KAM+ E+  L S  T  LV PS +YN+VGNKW++RIK N+DG+V R KARLVAK 
Subjt:  AIHPMVTRGKAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKR

Query:  FHQSSGVDFFETFSPVVKASTIRVVLSIVVCRQLNGW
        FHQ  G+++ ETFSP++KA+T+R++L+IVV R   GW
Subjt:  FHQSSGVDFFETFSPVVKASTIRVVLSIVVCRQLNGW

A0A803PYD1 Uncharacterized protein4.0e-13736.76Show/hide
Query:  HTSITSSIGTTNFSNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSA
        +TS+ + +  T FSN      L+Q   +KLDR+N+ LWK +   I+R ++L+G   G+  CPP+        S +  A  EEG  ST S           
Subjt:  HTSITSSIGTTNFSNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSA

Query:  AKVVNPLFDSWVAVDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSR
          + N  ++S +  DQLL+GWLY SMT  + ++VMG  +A  LW A+++L+G QS A  D L+   Q  +KG   M +YLR  +  AD L  VG P   +
Subjt:  AKVVNPLFDSWVAVDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSR

Query:  SLVSQVLLGLDEEYNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEG----GNQRGQQSFPNNRQNYSGRG------G
         L   +L GLD EY  +V  ++ R   +  ELQ+ LL ++ +LE   +L  N       SV +A    G    G  +G+ S  +N  N SGRG       
Subjt:  SLVSQVLLGLDEEYNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEG----GNQRGQQSFPNNRQNYSGRG------G

Query:  HRGGGGTRGRGRARGYGSFNHKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNS----FLATPESVIDPNWYAD
         RGG G RGRGR    G  N KP CQVCGK GH A +C  RF++ + G                 Q  P+ F  +Q+  +     +ATP+++ D +WYAD
Subjt:  HRGGGGTRGRGRARGYGSFNHKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNS----FLATPESVIDPNWYAD

Query:  NGESNHVTSDYNCIANPANYEGNERVIIGDGHKLQISHVGNSCL-IDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLP
        +G +NH+T D + + N   Y G E++I+GDG KL I H+G++ L I     L L+++L VP+I KNL+S+S L  DN V VEF  DFC VK + TGKV+ 
Subjt:  NGESNHVTSDYNCIANPANYEGNERVIIGDGHKLQISHVGNSCL-IDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLP

Query:  RGTLNDGLYQFEHVKATSV-----------------------------------------YVSKTANQDPSRVNKVVSPVFVGL----------------
        + TL DGLYQF      ++                                          +S   +Q+    +K    V +GL                
Subjt:  RGTLNDGLYQFEHVKATSV-----------------------------------------YVSKTANQDPSRVNKVVSPVFVGL----------------

Query:  --SSVNLVESKAV--LNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDG
          +SV L+       L GKSP E L  K  D K L+ FG  CFP LRPYQ HKFQ+H+ KC+ LG S V KG++CL+  GRI+I R+V FNE+EFP   G
Subjt:  --SSVNLVESKAV--LNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDG

Query:  FGVTSGMSGSSSATLPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIAST
        F   +        TL +P+   WF  P+ P+    S++ S S    P+ P  P ++ QS           + + +     P+    P A++    SI+  
Subjt:  FGVTSGMSGSSSATLPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIAST

Query:  QQPQPTSPQTTSSSPNPPTEPSATQIAQPPPVQAIHPMVTRGKAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPS
          P P      S   +PP + +             HPM+TR KAGIFKPK ++S +KI      P  + +AL    W+ AM  E+  L    TW LV  S
Subjt:  QQPQPTSPQTTSSSPNPPTEPSATQIAQPPPVQAIHPMVTRGKAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPS

Query:  SSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV
         + N+VG KWIFR K N+DGS QR KARLVAK FHQ  GVDF ETFSPVVKA TIR+VL++VV
Subjt:  SSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.4e-0622.69Show/hide
Query:  VNLVESKAVL-NGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHV------RFNENEFPFRD
        +N + S+A++ + K+P E   +KK  +K LR FG   +  ++  Q  KF   + K I++G  P          N +  + R V        N     F  
Subjt:  VNLVESKAVL-NGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHV------RFNENEFPFRD

Query:  GFGVTSGMSGSSSATLPSPNITMWFPYPN-----DPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQ--NQTQDQSCPLMN------HE
         F   S  S + +    S  I +   +PN     D + F+  +  S + + P  +       +Q+  PN       IQ    +++ +   +N       +
Subjt:  GFGVTSGMSGSSSATLPSPNITMWFPYPN-----DPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQ--NQTQDQSCPLMN------HE

Query:  PNASNSASTSIASTQQPQPTSPQTTSSSPNPPTEPSATQIAQPPPVQ-AIHPMVTRGKAGIFKPKLWMSQSKIDWTITEP-TRITKALATSQWHKAMDQE
         + + S  +   +  +   T+        + PT+    +I      +    P ++  +      K+ ++   I   +      I      S W +A++ E
Subjt:  PNASNSASTSIASTQQPQPTSPQTTSSSPNPPTEPSATQIAQPPPVQ-AIHPMVTRGKAGIFKPKLWMSQSKIDWTITEP-TRITKALATSQWHKAMDQE

Query:  YSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV
         +    NNTW +     + N+V ++W+F +K N  G+  R+KARLVA+ F Q   +D+ ETF+PV + S+ R +LS+V+
Subjt:  YSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1239.25Show/hide
Query:  EPTRITKALA---TSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLS
        EP  + + L+    +Q  KAM +E  +L  N T+ LV        +  KW+F++K++ D  + R+KARLV K F Q  G+DF E FSPVVK ++IR +LS
Subjt:  EPTRITKALA---TSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLS

Query:  IVVCRQL
        +     L
Subjt:  IVVCRQL

P92520 Uncharacterized mitochondrial protein AtMg008209.2e-2245.16Show/hide
Query:  MVTRGKAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQS
        M+TR KAGI K     S +       EP  +  AL    W +AM +E   L  N TW LV P  + N++G KW+F+ K +SDG++ R KARLVAK FHQ 
Subjt:  MVTRGKAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQS

Query:  SGVDFFETFSPVVKASTIRVVLSI
         G+ F ET+SPVV+ +TIR +L++
Subjt:  SGVDFFETFSPVVKASTIRVVLSI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.1e-6625.07Show/hide
Query:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW
        +N   + KL  +N+L+W      +   Y+L G   G    PP                              A+    AA  VNP +  W   D+L+   
Subjt:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW

Query:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV
        +  +++  V   V     A  +W  ++ ++   S+     LR   +Q  KG+  + DY++ + +  D L  +G P+     V +VL  L EEY PV+  +
Subjt:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV

Query:  QGRES-TTWSELQAELLVFEKQL---EMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQN-----YSGRGGHRGGGGTRGRGRARGYGSFNHK
          +++  T +E+   LL  E ++      T +    +     +    NN   GN+  +    NN  N      S    H     ++            + 
Subjt:  QGRES-TTWSELQAELLVFEKQL---EMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQN-----YSGRGGHRGGGGTRGRGRARGYGSFNHK

Query:  PICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQ-NSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNER
          CQ+CG  GH A  C           SQ+Q+     N++Q     P+ F   Q  +N  L +P S    NW  D+G ++H+TSD+N ++    Y G + 
Subjt:  PICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQ-NSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNER

Query:  VIIGDGHKLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQF-------------EH
        V++ DG  + ISH G++ L      L+L N+L VP I KNL+SV +L   N V VEF      VK  +TG  L +G   D LY++               
Subjt:  VIIGDGHKLQISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQF-------------EH

Query:  VKAT-SVYVSKTANQDPSRVNKVVS-------------------------------------------------------------PVFVG---------
         KAT S + ++  +  PS +N V+S                                                              +FV          
Subjt:  VKAT-SVYVSKTANQDPSRVNKVVS-------------------------------------------------------------PVFVG---------

Query:  ------------LSSVNLVESK------------------------------------------------------------------------------
                    ++  NL+E++                                                                              
Subjt:  ------------LSSVNLVESK------------------------------------------------------------------------------

Query:  -------AVLNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLN-SNGRIFIPRHVRFNENEFPFRDGFGVTS
                +L  +SP ++L     +   LR FGCAC+P LRPY  HK    + +C++LG S  Q  + CL+    R++I RHVRF+EN FPF +     S
Subjt:  -------AVLNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLN-SNGRIFIPRHVRFNENEFPFRDGFGVTS

Query:  GMSGS--SSATLPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMN-----HEPNASNSASTSIA
         +      S+ + SP+ T+    P  P    S  + + +   PPS+P  P  + Q    NL S+       + + + P  N      +P  + + + S  
Subjt:  GMSGS--SSATLPSPNITMWFPYPNDPVSFVSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMN-----HEPNASNSASTSIA

Query:  STQQPQPTS-------------PQTTSSSPNPPTEPSATQIAQPPPVQAIHP--------------------MVTRGKAGIFKPKLWMSQSKIDWTITEP
        +T Q  PT+              Q++SSSP+P T  S++  +  PP   IHP                    M TR KAGI KP    S +      +EP
Subjt:  STQQPQPTS-------------PQTTSSSPNPPTEPSATQIAQPPPVQAIHP--------------------MVTRGKAGIFKPKLWMSQSKIDWTITEP

Query:  TRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSY-NVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVVC
            +AL   +W  AM  E +  + N+TW LV P  S+  +VG +WIF  K NSDGS+ R+KARLVAK ++Q  G+D+ ETFSPV+K+++IR+VL + V 
Subjt:  TRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSY-NVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVVC

Query:  R
        R
Subjt:  R

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-5423.75Show/hide
Query:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW
        +N   + KL  +N+L+W      +   Y+L G   G  P PP                              A+    A   VNP +  W   D+L+   
Subjt:  LNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPLFDSWVAVDQLLLGW

Query:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV
        +  +++  V   V     A  +W  ++ ++   S+     LR +                   +  D L  +G P+     V +VL  L ++Y PV+  +
Subjt:  LYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMV

Query:  QGRES-TTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKPICQVCGK
          +++  + +E+   L+  E +L    + +     +  V+    N     N RG     NN  N S        G      + + Y        CQ+C  
Subjt:  QGRES-TTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKPICQVCGK

Query:  VGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGHKL
         GH A  C Q             ++     N+Q + +  T +    N    LA        NW  D+G ++H+TSD+N ++    Y G + V+I DG  +
Subjt:  VGHIALMCYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGHKL

Query:  QISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFEHV-------------KAT-SVYV
         I+H G++ L     +L L  VL VP I KNL+SV +L   N V VEF      VK  +TG  L +G   D LY++                KAT S + 
Subjt:  QISHVGNSCLIDGYNNLSLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFEHV-------------KAT-SVYV

Query:  SKTANQDPSRVNKVVS----PVF-------------------VGLSSVNLVESK----------------------------------------------
        S+  +   + +N V+S    PV                    V  S+  +  SK                                              
Subjt:  SKTANQDPSRVNKVVS----PVF-------------------VGLSSVNLVESK----------------------------------------------

Query:  --------------------------------------------------------------------------------------------------AV
                                                                                                           +
Subjt:  --------------------------------------------------------------------------------------------------AV

Query:  LNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLN-SNGRIFIPRHVRFNENEFPF-RDGFGVT--------S
        L  +SP ++L  +  + + L+ FGCAC+P LRPY  HK +  +++C ++G S  Q  + CL+   GR++  RHV+F+E  FPF    FGV+        S
Subjt:  LNGKSPMEQLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLN-SNGRIFIPRHVRFNENEFPF-RDGFGVT--------S

Query:  GMSGSSSATLPSPNITMWFP-----------------------------YPNDPVSFVSSTNPSTSTDDPPSTPLQP----NASLQSPC---PNLQSATP
          +  S  TLP+  + +  P                              P+  +S  SS+ P+  + + P    QP    N++  SP    PN  S +P
Subjt:  GMSGSSSATLPSPNITMWFP-----------------------------YPNDPVSFVSSTNPSTSTDDPPSTPLQP----NASLQSPC---PNLQSATP

Query:  RIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSSPNPPT--EPSATQIAQPPPVQAIHPMVTRGKAGIFKPKLWMSQSKIDWTITEPTR
           NQ        +   P +S    T   S  +P   S  +TS+ P PP    P   Q+    PV   H M TR K GI KP    S +      +EP  
Subjt:  RIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSSPNPPT--EPSATQIAQPPPVQAIHPMVTRGKAGIFKPKLWMSQSKIDWTITEPTR

Query:  ITKALATSQWHKAMDQEYSTLVSNNTWHLV-SPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVVCR
          +A+   +W +AM  E +  + N+TW LV  P  S  +VG +WIF  K NSDGS+ R+KARLVAK ++Q  G+D+ ETFSPV+K+++IR+VL + V R
Subjt:  ITKALATSQWHKAMDQEYSTLVSNNTWHLV-SPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSIVVCR

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.6e-0727.13Show/hide
Query:  SWVAVDQLLLGWLYNSMTP-EVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLL
        +W   D ++   LY ++TP +     +    ++D+W  I++ F     A    L    +    G +++ DY R MK  AD+L  V  PVT R+LV  VL 
Subjt:  SWVAVDQLLLGWLYNSMTP-EVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLL

Query:  GLDEEYNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSN---VSFSGGVSVNMAN--------NKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGT
        GL+ +++ ++ +++ R+    S   A  ++ E++  ++  +K N   V  S   +V   +         + GGNQ G          Y GRG  RG    
Subjt:  GLDEEYNPVVAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSN---VSFSGGVSVNMAN--------NKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGT

Query:  RGRGRARGY------GSFNHKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGN
        RGRG    Y       S+N  P  Q            YQ +N  +  P  V N N GN
Subjt:  RGRGRARGY------GSFNHKPICQVCGKVGHIALMCYQRFNKEFFGPSQVQNRNDGN

AT3G21630.1 chitin elicitor receptor kinase 11.2e-0555.77Show/hide
Query:  KAAIKKMDMQATKELILAEIKVLTRVHHLE----PGFHVDGQSFAVYLMVRD
        KAAIKKMDM+A+K+  LAE+KVLTRVHH+      G+ V+G  F VY  V +
Subjt:  KAAIKKMDMQATKELILAEIKVLTRVHHLE----PGFHVDGQSFAVYLMVRD

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-1740.82Show/hide
Query:  EPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSI
        EP+   +A     W  AMD E   + + +TW + +   +   +G KW+++IK NSDG+++R+KARLVAK + Q  G+DF ETFSPV K ++++++L+I
Subjt:  EPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKASTIRVVLSI

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.0e-0725.11Show/hide
Query:  AIGASASVSAAKVVNPLFDS-WVAVDQLLLGWLYNSMTPEVATQVMGFD-NAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYAD
        + G    +  +    P+ +  W   D L+  W+Y ++T  +   ++     A+DLW ++++LF     A         +      L + +Y + +KS +D
Subjt:  AIGASASVSAAKVVNPLFDS-WVAVDQLLLGWLYNSMTPEVATQVMGFD-NAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYAD

Query:  NLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMVQGRES-TTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSV-NMANNKEGGNQRGQQSFPNNRQNYS
         L  V SP++ R LV  +L GL E+Y+ ++ +++ +    +++E ++ LL+ E +L  ++  KS++S +   S+ N+        +R  Q + NN  N  
Subjt:  NLGQVGSPVTSRSLVSQVLLGLDEEYNPVVAMVQGRES-TTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSV-NMANNKEGGNQRGQQSFPNNRQNYS

Query:  GRG----GHRGGGGTRGRGRARGYGSFNHKP
        GRG     +RGGG + GR         N  P
Subjt:  GRG----GHRGGGGTRGRGRARGYGSFNHKP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.6e-2345.16Show/hide
Query:  MVTRGKAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQS
        M+TR KAGI K     S +       EP  +  AL    W +AM +E   L  N TW LV P  + N++G KW+F+ K +SDG++ R KARLVAK FHQ 
Subjt:  MVTRGKAGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQS

Query:  SGVDFFETFSPVVKASTIRVVLSI
         G+ F ET+SPVV+ +TIR +L++
Subjt:  SGVDFFETFSPVVKASTIRVVLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACGACCCGACCCGACCCGATTTGGGTCAGTCTTGATGCGGGTCGGGTCGGGTCAGTTTTGTCCGTACAACCGATCAGGATCGGGTCGGAAAGTTGAAGGTGCGAA
GAATGACGGGGAGAAGAAGGCATGCGATGTCGACCGAAGAAGGACGACGATGTCGTCGCCGTCATCTTTCAGATCGATATGCATTAAAGCAGCAATAAAGAAGATGGATA
TGCAAGCTACAAAGGAGCTGATCCTTGCAGAAATTAAGGTTCTAACACGCGTTCACCACCTTGAACCTGGATTTCATGTTGATGGACAATCTTTTGCGGTGTATTTGATG
GTTAGGGACCATACATCAATCACCTCGTCCATCGGAACCACCAACTTCAGCAATCCATCGCTCAACCAGCTATTAAATCAGATCATGATGATTAAGCTGGATCGAAGCAA
TTTTCTTCTATGGAAAAATCTAGCTATGCCCATTCTTCGCAGCTACAAGCTGGAAGGACACCGATTAGGTGAGAATCCATGCCCTCCTCAATTTGTTCAGCCGTCTGTTG
GAATTTCAACCTCGAGTGATGCACCAGCAGAGGAAGGAGCAACAAGCACTCAGTCGGGCGCCATTGGAGCGTCTGCTTCAGTTTCAGCCGCAAAAGTGGTAAATCCACTC
TTCGATTCTTGGGTTGCGGTTGATCAGTTGCTACTTGGGTGGTTGTACAACTCAATGACACCCGAAGTAGCAACACAGGTGATGGGCTTTGATAATGCTCAAGACCTTTG
GGCAGCGATACAAGATTTATTTGGAGTTCAGTCGCATGCAGAAGAGGATTACTTACGTCAGGTGTTCCAACAATGCCAAAAAGGGAGTCTGAAGATGGTTGATTACTTGA
GAGTCATGAAAAGCTACGCAGACAATCTGGGACAGGTAGGAAGTCCGGTTACATCTCGATCCTTAGTCTCGCAAGTTCTCTTGGGCCTCGATGAAGAATACAACCCCGTG
GTTGCAATGGTGCAAGGTAGAGAAAGCACAACGTGGTCAGAATTGCAAGCTGAGCTTCTCGTATTTGAGAAGCAATTGGAAATGCAAACCAACCTGAAAAGCAATGTGTC
ATTCAGTGGAGGTGTTTCAGTTAACATGGCAAACAACAAAGAGGGTGGAAATCAAAGAGGTCAGCAAAGTTTCCCAAATAATCGGCAAAATTACTCTGGCAGAGGAGGTC
ATAGAGGAGGAGGAGGAACCCGTGGTCGTGGTAGAGCTCGTGGTTATGGTTCTTTCAACCACAAACCAATCTGCCAGGTGTGTGGGAAGGTTGGTCATATTGCTCTTATG
TGTTATCAAAGGTTTAACAAGGAATTTTTTGGTCCTTCCCAAGTTCAAAATAGGAATGATGGCAACAACAATCGACAAAATACTCAGACACAACCTACTGCCTTTGTTGC
CAACCAGAATTCAAATTCGTTTTTGGCCACACCTGAATCTGTCATTGATCCAAATTGGTATGCCGACAATGGGGAATCAAATCATGTGACCAGTGACTACAATTGCATTG
CCAATCCTGCCAATTATGAAGGTAATGAACGTGTAATTATAGGGGATGGTCATAAGTTACAAATATCTCATGTTGGGAACTCATGCTTGATTGATGGCTATAATAATCTA
AGTTTGGAAAATGTGTTATGTGTTCCTACTATTGCAAAGAACTTAGTAAGTGTTTCCAAGTTAGCTCGTGATAATCATGTGTATGTTGAATTTCAGGATGATTTTTGTCT
TGTTAAGACCAAGGATACGGGCAAGGTACTGCCGAGAGGAACACTTAATGATGGACTATACCAATTTGAGCATGTGAAGGCCACCTCTGTTTATGTTTCAAAGACAGCTA
ACCAAGATCCATCTAGAGTCAATAAAGTTGTTTCCCCTGTTTTTGTTGGTTTAAGTAGTGTAAATCTGGTTGAGTCTAAAGCTGTTTTAAATGGGAAGTCACCAATGGAA
CAACTTTTAGACAAAAAGGCTGATGTCAAGGCTTTAAGGACATTCGGTTGTGCCTGTTTCCCATGCCTACGACCTTACCAGGCTCACAAGTTCCAGTTCCACACAGAAAA
ATGCATTTACTTGGGGACTAGTCCTGTTCAAAAAGGCCATCGTTGTCTCAACTCCAATGGAAGAATATTTATCCCACGACATGTCAGATTCAATGAAAATGAGTTCCCCT
TCCGTGATGGTTTTGGTGTTACCTCTGGTATGAGTGGCTCCTCTTCAGCAACACTTCCTTCACCAAATATAACCATGTGGTTCCCTTACCCAAATGACCCTGTCAGTTTT
GTGTCATCTACCAATCCCTCTACCAGTACCGATGATCCACCTTCCACTCCTTTACAGCCTAATGCTTCACTACAAAGCCCTTGCCCCAATTTACAGTCAGCAACTCCTCG
AATTCAAAACCAAACTCAAGATCAGTCGTGTCCCTTGATGAACCATGAACCAAATGCCTCAAACTCAGCCTCAACCTCTATTGCCTCAACCCAACAGCCTCAACCTACAT
CACCACAAACTACTTCCAGTTCACCTAATCCACCTACTGAGCCATCTGCCACACAGATTGCCCAACCTCCTCCTGTACAGGCCATTCATCCCATGGTAACTAGGGGGAAA
GCTGGAATTTTTAAACCAAAGTTGTGGATGTCTCAGTCCAAGATTGATTGGACAATTACGGAGCCAACACGAATTACAAAAGCACTTGCCACTTCTCAATGGCACAAGGC
TATGGATCAAGAGTATTCTACTTTGGTGTCAAATAATACATGGCACCTAGTTTCTCCCTCGTCTTCATATAATGTTGTGGGAAACAAATGGATTTTTCGAATAAAGAGGA
ACTCAGATGGATCGGTACAAAGATTCAAGGCTCGGTTGGTAGCCAAGAGGTTTCATCAAAGTTCCGGAGTTGACTTCTTTGAAACGTTTAGCCCCGTCGTCAAAGCTTCG
ACTATTCGAGTTGTGTTGAGCATAGTGGTGTGTCGTCAGCTGAATGGATGGATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACGACCCGACCCGACCCGATTTGGGTCAGTCTTGATGCGGGTCGGGTCGGGTCAGTTTTGTCCGTACAACCGATCAGGATCGGGTCGGAAAGTTGAAGGTGCGAA
GAATGACGGGGAGAAGAAGGCATGCGATGTCGACCGAAGAAGGACGACGATGTCGTCGCCGTCATCTTTCAGATCGATATGCATTAAAGCAGCAATAAAGAAGATGGATA
TGCAAGCTACAAAGGAGCTGATCCTTGCAGAAATTAAGGTTCTAACACGCGTTCACCACCTTGAACCTGGATTTCATGTTGATGGACAATCTTTTGCGGTGTATTTGATG
GTTAGGGACCATACATCAATCACCTCGTCCATCGGAACCACCAACTTCAGCAATCCATCGCTCAACCAGCTATTAAATCAGATCATGATGATTAAGCTGGATCGAAGCAA
TTTTCTTCTATGGAAAAATCTAGCTATGCCCATTCTTCGCAGCTACAAGCTGGAAGGACACCGATTAGGTGAGAATCCATGCCCTCCTCAATTTGTTCAGCCGTCTGTTG
GAATTTCAACCTCGAGTGATGCACCAGCAGAGGAAGGAGCAACAAGCACTCAGTCGGGCGCCATTGGAGCGTCTGCTTCAGTTTCAGCCGCAAAAGTGGTAAATCCACTC
TTCGATTCTTGGGTTGCGGTTGATCAGTTGCTACTTGGGTGGTTGTACAACTCAATGACACCCGAAGTAGCAACACAGGTGATGGGCTTTGATAATGCTCAAGACCTTTG
GGCAGCGATACAAGATTTATTTGGAGTTCAGTCGCATGCAGAAGAGGATTACTTACGTCAGGTGTTCCAACAATGCCAAAAAGGGAGTCTGAAGATGGTTGATTACTTGA
GAGTCATGAAAAGCTACGCAGACAATCTGGGACAGGTAGGAAGTCCGGTTACATCTCGATCCTTAGTCTCGCAAGTTCTCTTGGGCCTCGATGAAGAATACAACCCCGTG
GTTGCAATGGTGCAAGGTAGAGAAAGCACAACGTGGTCAGAATTGCAAGCTGAGCTTCTCGTATTTGAGAAGCAATTGGAAATGCAAACCAACCTGAAAAGCAATGTGTC
ATTCAGTGGAGGTGTTTCAGTTAACATGGCAAACAACAAAGAGGGTGGAAATCAAAGAGGTCAGCAAAGTTTCCCAAATAATCGGCAAAATTACTCTGGCAGAGGAGGTC
ATAGAGGAGGAGGAGGAACCCGTGGTCGTGGTAGAGCTCGTGGTTATGGTTCTTTCAACCACAAACCAATCTGCCAGGTGTGTGGGAAGGTTGGTCATATTGCTCTTATG
TGTTATCAAAGGTTTAACAAGGAATTTTTTGGTCCTTCCCAAGTTCAAAATAGGAATGATGGCAACAACAATCGACAAAATACTCAGACACAACCTACTGCCTTTGTTGC
CAACCAGAATTCAAATTCGTTTTTGGCCACACCTGAATCTGTCATTGATCCAAATTGGTATGCCGACAATGGGGAATCAAATCATGTGACCAGTGACTACAATTGCATTG
CCAATCCTGCCAATTATGAAGGTAATGAACGTGTAATTATAGGGGATGGTCATAAGTTACAAATATCTCATGTTGGGAACTCATGCTTGATTGATGGCTATAATAATCTA
AGTTTGGAAAATGTGTTATGTGTTCCTACTATTGCAAAGAACTTAGTAAGTGTTTCCAAGTTAGCTCGTGATAATCATGTGTATGTTGAATTTCAGGATGATTTTTGTCT
TGTTAAGACCAAGGATACGGGCAAGGTACTGCCGAGAGGAACACTTAATGATGGACTATACCAATTTGAGCATGTGAAGGCCACCTCTGTTTATGTTTCAAAGACAGCTA
ACCAAGATCCATCTAGAGTCAATAAAGTTGTTTCCCCTGTTTTTGTTGGTTTAAGTAGTGTAAATCTGGTTGAGTCTAAAGCTGTTTTAAATGGGAAGTCACCAATGGAA
CAACTTTTAGACAAAAAGGCTGATGTCAAGGCTTTAAGGACATTCGGTTGTGCCTGTTTCCCATGCCTACGACCTTACCAGGCTCACAAGTTCCAGTTCCACACAGAAAA
ATGCATTTACTTGGGGACTAGTCCTGTTCAAAAAGGCCATCGTTGTCTCAACTCCAATGGAAGAATATTTATCCCACGACATGTCAGATTCAATGAAAATGAGTTCCCCT
TCCGTGATGGTTTTGGTGTTACCTCTGGTATGAGTGGCTCCTCTTCAGCAACACTTCCTTCACCAAATATAACCATGTGGTTCCCTTACCCAAATGACCCTGTCAGTTTT
GTGTCATCTACCAATCCCTCTACCAGTACCGATGATCCACCTTCCACTCCTTTACAGCCTAATGCTTCACTACAAAGCCCTTGCCCCAATTTACAGTCAGCAACTCCTCG
AATTCAAAACCAAACTCAAGATCAGTCGTGTCCCTTGATGAACCATGAACCAAATGCCTCAAACTCAGCCTCAACCTCTATTGCCTCAACCCAACAGCCTCAACCTACAT
CACCACAAACTACTTCCAGTTCACCTAATCCACCTACTGAGCCATCTGCCACACAGATTGCCCAACCTCCTCCTGTACAGGCCATTCATCCCATGGTAACTAGGGGGAAA
GCTGGAATTTTTAAACCAAAGTTGTGGATGTCTCAGTCCAAGATTGATTGGACAATTACGGAGCCAACACGAATTACAAAAGCACTTGCCACTTCTCAATGGCACAAGGC
TATGGATCAAGAGTATTCTACTTTGGTGTCAAATAATACATGGCACCTAGTTTCTCCCTCGTCTTCATATAATGTTGTGGGAAACAAATGGATTTTTCGAATAAAGAGGA
ACTCAGATGGATCGGTACAAAGATTCAAGGCTCGGTTGGTAGCCAAGAGGTTTCATCAAAGTTCCGGAGTTGACTTCTTTGAAACGTTTAGCCCCGTCGTCAAAGCTTCG
ACTATTCGAGTTGTGTTGAGCATAGTGGTGTGTCGTCAGCTGAATGGATGGATGTGA
Protein sequenceShow/hide protein sequence
MARPDPTRFGSVLMRVGSGQFCPYNRSGSGRKVEGAKNDGEKKACDVDRRRTTMSSPSSFRSICIKAAIKKMDMQATKELILAEIKVLTRVHHLEPGFHVDGQSFAVYLM
VRDHTSITSSIGTTNFSNPSLNQLLNQIMMIKLDRSNFLLWKNLAMPILRSYKLEGHRLGENPCPPQFVQPSVGISTSSDAPAEEGATSTQSGAIGASASVSAAKVVNPL
FDSWVAVDQLLLGWLYNSMTPEVATQVMGFDNAQDLWAAIQDLFGVQSHAEEDYLRQVFQQCQKGSLKMVDYLRVMKSYADNLGQVGSPVTSRSLVSQVLLGLDEEYNPV
VAMVQGRESTTWSELQAELLVFEKQLEMQTNLKSNVSFSGGVSVNMANNKEGGNQRGQQSFPNNRQNYSGRGGHRGGGGTRGRGRARGYGSFNHKPICQVCGKVGHIALM
CYQRFNKEFFGPSQVQNRNDGNNNRQNTQTQPTAFVANQNSNSFLATPESVIDPNWYADNGESNHVTSDYNCIANPANYEGNERVIIGDGHKLQISHVGNSCLIDGYNNL
SLENVLCVPTIAKNLVSVSKLARDNHVYVEFQDDFCLVKTKDTGKVLPRGTLNDGLYQFEHVKATSVYVSKTANQDPSRVNKVVSPVFVGLSSVNLVESKAVLNGKSPME
QLLDKKADVKALRTFGCACFPCLRPYQAHKFQFHTEKCIYLGTSPVQKGHRCLNSNGRIFIPRHVRFNENEFPFRDGFGVTSGMSGSSSATLPSPNITMWFPYPNDPVSF
VSSTNPSTSTDDPPSTPLQPNASLQSPCPNLQSATPRIQNQTQDQSCPLMNHEPNASNSASTSIASTQQPQPTSPQTTSSSPNPPTEPSATQIAQPPPVQAIHPMVTRGK
AGIFKPKLWMSQSKIDWTITEPTRITKALATSQWHKAMDQEYSTLVSNNTWHLVSPSSSYNVVGNKWIFRIKRNSDGSVQRFKARLVAKRFHQSSGVDFFETFSPVVKAS
TIRVVLSIVVCRQLNGWM