; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026361 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026361
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr10:35463868..35465127
RNA-Seq ExpressionLag0026361
SyntenyLag0026361
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]4.0e-8650.37Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        ++S+EKDS SPIFL SNICNLIS+ LDSTNFVLWKFQL +ILKAHK FGF+DG+  C P+T P+++ST              P  NP YEDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        T+INATLS  ALAYVVG TSSKQ W+VL K YSS SR+N+VNLKSDLQ+I KK                      S  I+EEDLLIY LNGLP++YN FR
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAML--------VAQTHSTSSPRLPSSG-NFERGRSNFNC------------PLHQG----
        TSMRTRSQ  TF+ELHVLL++EESA+ KQ+K ++   Q + +L         A T + +  R    G N+  GR +F+             P+H      
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAML--------VAQTHSTSSPRLPSSG-NFERGRSNFNC------------PLHQG----

Query:  ------------SWAWTNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISHT
                     +   NY+FQGRHPP QL AMVASQN A+   S+   S        L DSGCN H+T+D+N++SL+ EY G++Q+ VG+GQ  PISH+
Subjt:  ------------SWAWTNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISHT

Query:  G
        G
Subjt:  G

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.3e-8450.25Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        ++S+EKDS SPIFL SNICNLIS+ LDSTNFVLWKFQL +ILKAHK +GFIDG+  CPP+T   SSSTST            P  NP YEDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        T+INATLS  ALAYVVG TSSKQ W+VL K YSS SR+N+VNLKSDLQ+I KK                      S  I+EEDLLIY LNGLP++YN FR
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN
        TSMRTRSQ  TF+ELHVLL++EESA+ KQ+K ++   Q + +L++ + S  S       NF RG                                ++  
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN

Query:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH
        C +   +G  A       NY+FQGRHPP QL AMVASQN A+   S+   S        L DSGCN  +T+D+N++SL+ EY G++Q+ +G+GQ  P+SH
Subjt:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH

Query:  TG
        +G
Subjt:  TG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]1.3e-8450.25Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        ++S+EKDS SPIFL SNICNLIS+ LDSTNFVLWKFQL +ILKAHK +GFIDG+  CPP+T   SSSTST            P  NP YEDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        T+INATLS  ALAYVVG TSSKQ W+VL K YSS SR+N+VNLKSDLQ+I KK                      S  I+EEDLLIY LNGLP++YN FR
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN
        TSMRTRSQ  TF+ELHVLL++EESA+ KQ+K ++   Q + +L++ + S  S       NF RG                                ++  
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN

Query:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH
        C +   +G  A       NY+FQGRHPP QL AMVASQN A+   S+   S        L DSGCN  +T+D+N++SL+ EY G++Q+ +G+GQ  P+SH
Subjt:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH

Query:  TG
        +G
Subjt:  TG

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]4.0e-8650.37Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        ++S+EKDS SPIFL SNICNLIS+ LDSTNFVLWKFQL +ILKAHK FGF+DG+  C P+T P+++ST              P  NP YEDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        T+INATLS  ALAYVVG TSSKQ W+VL K YSS SR+N+VNLKSDLQ+I KK                      S  I+EEDLLIY LNGLP++YN FR
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAML--------VAQTHSTSSPRLPSSG-NFERGRSNFNC------------PLHQG----
        TSMRTRSQ  TF+ELHVLL++EESA+ KQ+K ++   Q + +L         A T + +  R    G N+  GR +F+             P+H      
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAML--------VAQTHSTSSPRLPSSG-NFERGRSNFNC------------PLHQG----

Query:  ------------SWAWTNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISHT
                     +   NY+FQGRHPP QL AMVASQN A+   S+   S        L DSGCN H+T+D+N++SL+ EY G++Q+ VG+GQ  PISH+
Subjt:  ------------SWAWTNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISHT

Query:  G
        G
Subjt:  G

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]8.0e-8749.63Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        + +++KD HSPIFL SNICNL+SI LDST+F+LWKFQL +ILKAHK FGFIDGS+  P + + +SS    ET  +P   ++ P INP +EDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        TLINATLS  ALAYVV   +SKQ WEVLEKHYSS+SRTN+VNLKSDLQSI KK                      S+ I++E LLIY LNGL ++YN   
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVA--------------QTHSTSSPR---------LPSSGNFERGRSNFN--------
        TSMRTR+Q+ +F+ELHV +KSEESAIEKQ K E+ +TQ +A+  +              Q+H     +          P+  N  RGRS+ N        
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVA--------------QTHSTSSPR---------LPSSGNFERGRSNFN--------

Query:  ----CPL-----HQGSWAWT--NYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHL---SLSSEYAGDDQISVGSG
            C +     H     +   N+HFQGRHPPPQL AMVA QN +Y +       G  S + WL DS CN H+TADL++L   S++S+Y G++ ISVGSG
Subjt:  ----CPL-----HQGSWAWT--NYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHL---SLSSEYAGDDQISVGSG

Query:  QFLPISHTG
        Q  PI+H G
Subjt:  QFLPISHTG

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X26.2e-8550.25Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        ++S+EKDS SPIFL SNICNLIS+ LDSTNFVLWKFQL +ILKAHK +GFIDG+  CPP+T   SSSTST            P  NP YEDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        T+INATLS  ALAYVVG TSSKQ W+VL K YSS SR+N+VNLKSDLQ+I KK                      S  I+EEDLLIY LNGLP++YN FR
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN
        TSMRTRSQ  TF+ELHVLL++EESA+ KQ+K ++   Q + +L++ + S  S       NF RG                                ++  
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN

Query:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH
        C +   +G  A       NY+FQGRHPP QL AMVASQN A+   S+   S        L DSGCN  +T+D+N++SL+ EY G++Q+ +G+GQ  P+SH
Subjt:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH

Query:  TG
        +G
Subjt:  TG

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X36.2e-8550.25Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        ++S+EKDS SPIFL SNICNLIS+ LDSTNFVLWKFQL +ILKAHK +GFIDG+  CPP+T   SSSTST            P  NP YEDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        T+INATLS  ALAYVVG TSSKQ W+VL K YSS SR+N+VNLKSDLQ+I KK                      S  I+EEDLLIY LNGLP++YN FR
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN
        TSMRTRSQ  TF+ELHVLL++EESA+ KQ+K ++   Q + +L++ + S  S       NF RG                                ++  
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN

Query:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH
        C +   +G  A       NY+FQGRHPP QL AMVASQN A+   S+   S        L DSGCN  +T+D+N++SL+ EY G++Q+ +G+GQ  P+SH
Subjt:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH

Query:  TG
        +G
Subjt:  TG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X16.2e-8550.25Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        ++S+EKDS SPIFL SNICNLIS+ LDSTNFVLWKFQL +ILKAHK +GFIDG+  CPP+T   SSSTST            P  NP YEDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        T+INATLS  ALAYVVG TSSKQ W+VL K YSS SR+N+VNLKSDLQ+I KK                      S  I+EEDLLIY LNGLP++YN FR
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN
        TSMRTRSQ  TF+ELHVLL++EESA+ KQ+K ++   Q + +L++ + S  S       NF RG                                ++  
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN

Query:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH
        C +   +G  A       NY+FQGRHPP QL AMVASQN A+   S+   S        L DSGCN  +T+D+N++SL+ EY G++Q+ +G+GQ  P+SH
Subjt:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH

Query:  TG
        +G
Subjt:  TG

A0A5D3CLI6 T4.53.1e-8450.12Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        ++S+EKDS SPIFL SNICNLIS+ LDSTNFVLWKFQL +ILKAHK +GFIDG+  CPP+T   SSSTST            P  NP YEDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        T+INATLS  ALAYVVG TSSKQ W+VL K YSS SR+N+VNLKSDLQ+I KK                      S  I+EEDLLIY LNGLP++YN FR
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN
        TSMRTRSQ  TF+ELHVLL++EESA+ KQ+K ++   Q + +L++ + S  S       NF RG                                ++  
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGR-------------------------------SNFN

Query:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH
        C +   +G  A       NY+FQGRHPP QL AMVASQN A+   S+   S        L DSGCN  +T+D+N++SL+ EY G++Q+ +G+GQ  P+SH
Subjt:  CPL--HQGSWAW-----TNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISH

Query:  T
        +
Subjt:  T

A0A6J1D9L6 uncharacterized protein LOC1110188923.9e-8749.63Show/hide
Query:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM
        + +++KD HSPIFL SNICNL+SI LDST+F+LWKFQL +ILKAHK FGFIDGS+  P + + +SS    ET  +P   ++ P INP +EDWIAKDQALM
Subjt:  AASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALM

Query:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR
        TLINATLS  ALAYVV   +SKQ WEVLEKHYSS+SRTN+VNLKSDLQSI KK                      S+ I++E LLIY LNGL ++YN   
Subjt:  TLINATLSLAALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKK----------------------SVIIDEEDLLIYTLNGLPSDYNAFR

Query:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVA--------------QTHSTSSPR---------LPSSGNFERGRSNFN--------
        TSMRTR+Q+ +F+ELHV +KSEESAIEKQ K E+ +TQ +A+  +              Q+H     +          P+  N  RGRS+ N        
Subjt:  TSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSAMLVA--------------QTHSTSSPR---------LPSSGNFERGRSNFN--------

Query:  ----CPL-----HQGSWAWT--NYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHL---SLSSEYAGDDQISVGSG
            C +     H     +   N+HFQGRHPPPQL AMVA QN +Y +       G  S + WL DS CN H+TADL++L   S++S+Y G++ ISVGSG
Subjt:  ----CPL-----HQGSWAWT--NYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHL---SLSSEYAGDDQISVGSG

Query:  QFLPISHTG
        Q  PI+H G
Subjt:  QFLPISHTG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-2326.16Show/hide
Query:  LDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALMTLINATLSLAALAYVVGCTSSKQAW
        L STN+++W  Q+ ++   ++  GF+DGS   PP TI   ++               P +NP Y  W  +D+ + + +   +S++    V   T++ Q W
Subjt:  LDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALMTLINATLSLAALAYVVGCTSSKQAW

Query:  EVLEKHYSSSSRTNIVNLKSDLQSISKKSVIIDE---------------------EDLLIYTLNGLPSDYNAFRTSMRTRSQTTTFDELHVLLKSEESAI
        E L K Y++ S  ++  L++ L+  +K +  ID+                     ++ +   L  LP +Y      +  +    T  E+H  L + ES I
Subjt:  EVLEKHYSSSSRTNIVNLKSDLQSISKKSVIIDE---------------------EDLLIYTLNGLPSDYNAFRTSMRTRSQTTTFDELHVLLKSEESAI

Query:  EKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGRSNFN--CPLHQGSWAWTNYHFQGRHPPPQL----VAMVASQNVAYCS---------NSVAG
           + +      ++A+    T +T++    +  N    R+N N   P  Q S   TN+H       P L    +  V   +   CS         NS   
Subjt:  EKQTKSEEPLTQSSAMLVAQTHSTSSPRLPSSGNFERGRSNFN--CPLHQGSWAWTNYHFQGRHPPPQL----VAMVASQNVAYCS---------NSVAG

Query:  PS-------------GTP-SFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISHTG
        PS             G+P S +NWL DSG   H+T+D N+LSL   Y G D + V  G  +PISHTG
Subjt:  PS-------------GTP-SFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISHTG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-1923.92Show/hide
Query:  LDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALMTLINATLSLAALAYVVGCTSSKQAW
        L STN+++W  Q+ ++   ++  GF+DGS   PP TI   +                P +NP Y  W  +D+ + + I   +S++    V   T++ Q W
Subjt:  LDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALMTLINATLSLAALAYVVGCTSSKQAW

Query:  EVLEKHYSSSSRTNIVNLK--SDLQSISKKSVIIDEEDLLIYTLNGLPSDYNAFRTSMRTRSQTTTFDELHVLLKSEESAIEKQTKSE-EPLTQSSAMLV
        E L K Y++ S  ++  L+  +    ++     +D ++ +   L  LP DY      +  +    +  E+H  L + ES +     +E  P+T +    V
Subjt:  EVLEKHYSSSSRTNIVNLK--SDLQSISKKSVIIDEEDLLIYTLNGLPSDYNAFRTSMRTRSQTTTFDELHVLLKSEESAIEKQTKSE-EPLTQSSAMLV

Query:  AQTHSTSSPRLPSSGNFERGRSNFNCPLHQGSWAWTNYHFQGRHPPPQL----VAMVASQNVAYC-------SNSVAGPSGTP----------------S
            +T++ R  ++    R  +N N   +    + +      R P P L    +  V   +   C       S +    S +P                +
Subjt:  AQTHSTSSPRLPSSGNFERGRSNFNCPLHQGSWAWTNYHFQGRHPPPQL----VAMVASQNVAYC-------SNSVAGPSGTP----------------S

Query:  FSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISHTG
         +NWL DSG   H+T+D N+LS    Y G D + +  G  +PI+HTG
Subjt:  FSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQISVGSGQFLPISHTG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCTCATCAGAAAAGGACTCACATTCCCCAATTTTTCTCTTCTCCAACATTTGTAACCTAATCTCCATTCTGCTAGACTCCACAAATTTTGTTCTATGGAAATT
TCAGCTTCAATCTATCCTGAAGGCCCATAAACATTTTGGCTTCATCGATGGTTCCATTGTTTGTCCACCTAAAACGATTCCTGCATCGTCTTCGACTTCCACAGAAACGC
CACTTGAACCTGCGCTGGCTTCGACCACTCCTTCTATCAACCCTTTCTATGAAGACTGGATTGCAAAAGATCAAGCTCTCATGACCTTGATCAATGCCACGTTGTCTCTT
GCTGCACTGGCCTATGTCGTTGGATGCACCTCATCGAAACAAGCCTGGGAGGTTTTAGAGAAGCACTACTCTTCAAGCTCGAGGACCAACATTGTTAATCTGAAATCTGA
CCTTCAGTCCATATCCAAGAAATCTGTTATTATTGATGAGGAGGATCTACTAATTTATACTCTAAATGGTTTACCTTCTGACTATAATGCTTTTCGCACATCCATGAGGA
CTCGATCCCAAACCACCACATTTGATGAACTTCACGTCTTACTTAAATCTGAAGAATCTGCTATAGAGAAACAGACAAAATCAGAAGAGCCCCTCACCCAATCATCCGCT
ATGCTAGTTGCTCAGACTCATTCGACCTCTTCTCCGCGTTTGCCTTCTTCTGGGAATTTTGAGCGTGGTCGTTCCAACTTCAATTGTCCTCTCCATCAGGGGTCGTGGGC
GTGGACGAACTATCACTTCCAAGGTCGACACCCACCCCCTCAACTTGTAGCTATGGTTGCCTCACAGAATGTTGCTTATTGTAGCAATAGCGTTGCTGGGCCTAGTGGTA
CACCGAGTTTCTCAAATTGGTTGGATGATTCTGGATGCAATGCCCACCTAACAGCAGACCTCAATCATCTGTCTCTTTCTTCAGAATATGCAGGTGATGATCAAATCTCA
GTAGGCAGTGGACAATTTCTCCCTATATCCCACACTGGTTTTGATTGGTTCTCAGTATGTCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCCTCATCAGAAAAGGACTCACATTCCCCAATTTTTCTCTTCTCCAACATTTGTAACCTAATCTCCATTCTGCTAGACTCCACAAATTTTGTTCTATGGAAATT
TCAGCTTCAATCTATCCTGAAGGCCCATAAACATTTTGGCTTCATCGATGGTTCCATTGTTTGTCCACCTAAAACGATTCCTGCATCGTCTTCGACTTCCACAGAAACGC
CACTTGAACCTGCGCTGGCTTCGACCACTCCTTCTATCAACCCTTTCTATGAAGACTGGATTGCAAAAGATCAAGCTCTCATGACCTTGATCAATGCCACGTTGTCTCTT
GCTGCACTGGCCTATGTCGTTGGATGCACCTCATCGAAACAAGCCTGGGAGGTTTTAGAGAAGCACTACTCTTCAAGCTCGAGGACCAACATTGTTAATCTGAAATCTGA
CCTTCAGTCCATATCCAAGAAATCTGTTATTATTGATGAGGAGGATCTACTAATTTATACTCTAAATGGTTTACCTTCTGACTATAATGCTTTTCGCACATCCATGAGGA
CTCGATCCCAAACCACCACATTTGATGAACTTCACGTCTTACTTAAATCTGAAGAATCTGCTATAGAGAAACAGACAAAATCAGAAGAGCCCCTCACCCAATCATCCGCT
ATGCTAGTTGCTCAGACTCATTCGACCTCTTCTCCGCGTTTGCCTTCTTCTGGGAATTTTGAGCGTGGTCGTTCCAACTTCAATTGTCCTCTCCATCAGGGGTCGTGGGC
GTGGACGAACTATCACTTCCAAGGTCGACACCCACCCCCTCAACTTGTAGCTATGGTTGCCTCACAGAATGTTGCTTATTGTAGCAATAGCGTTGCTGGGCCTAGTGGTA
CACCGAGTTTCTCAAATTGGTTGGATGATTCTGGATGCAATGCCCACCTAACAGCAGACCTCAATCATCTGTCTCTTTCTTCAGAATATGCAGGTGATGATCAAATCTCA
GTAGGCAGTGGACAATTTCTCCCTATATCCCACACTGGTTTTGATTGGTTCTCAGTATGTCAATAA
Protein sequenceShow/hide protein sequence
MAASSEKDSHSPIFLFSNICNLISILLDSTNFVLWKFQLQSILKAHKHFGFIDGSIVCPPKTIPASSSTSTETPLEPALASTTPSINPFYEDWIAKDQALMTLINATLSL
AALAYVVGCTSSKQAWEVLEKHYSSSSRTNIVNLKSDLQSISKKSVIIDEEDLLIYTLNGLPSDYNAFRTSMRTRSQTTTFDELHVLLKSEESAIEKQTKSEEPLTQSSA
MLVAQTHSTSSPRLPSSGNFERGRSNFNCPLHQGSWAWTNYHFQGRHPPPQLVAMVASQNVAYCSNSVAGPSGTPSFSNWLDDSGCNAHLTADLNHLSLSSEYAGDDQIS
VGSGQFLPISHTGFDWFSVCQ