; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004522 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004522
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr6:4629284..4630570
RNA-Seq ExpressionLag0004522
SyntenyLag0004522
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]5.9e-11657.67Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM
        +S  +KD  SPIFLLSNICNL+S+RLDS+NFVLWKFQLT+ILKAHKLFGFVDG+ P P              +S +   + P   NP ++DW+AKD ALM
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM

Query:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK
        T+INATLSP ALAYVVG ++SK+VWD L K YSS SR+N+VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS+ +N+EDL IY LNGLP++YNTF+
Subjt:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK

Query:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPPS
        TS+RTR+QP +F ELHVLL++EESAL KQ++CDDS   PT LL+++ S      + NN F  R  G G+  G  GR +F +  RG G      P+     
Subjt:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPPS

Query:  PTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLPI
                 CQIC R GH+ALDC+N MNY+FQGRHPP QLAAMVAS N+  +  + V+S+ LTDSGCN HIT+D++ +S A EYNG+EQV +G+GQ+ PI
Subjt:  PTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLPI

Query:  THQG
        +H G
Subjt:  THQG

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.1e-11457.78Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM
        +S  +KD  SPIFLLSNICNL+S+RLDS+NFVLWKFQLT+ILKAHKL+GF+DG+ P P    PR       T+++++  + P   NP ++DW+AKD ALM
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM

Query:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK
        T+INATLSP ALAYVVG ++SK+VWD L K YSS SR+N+VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS+ +N+EDL IY LNGLP++YNTF+
Subjt:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK

Query:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP
        TS+RTR+QP +F ELHVLL++EESAL KQ++ DDS   PT LL+++ S      +  NNF RG   G G+  G  GR +F +  RG GS        SP 
Subjt:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP

Query:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP
          +       CQIC R GH+ALDC+N MNY+FQGRHPP QLAAMVAS N+  +  + V+S+ LTDSGCN  IT+D++ +S A EYNG+EQV IG+GQ+ P
Subjt:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP

Query:  ITHQG
        ++H G
Subjt:  ITHQG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]1.1e-11457.78Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM
        +S  +KD  SPIFLLSNICNL+S+RLDS+NFVLWKFQLT+ILKAHKL+GF+DG+ P P    PR       T+++++  + P   NP ++DW+AKD ALM
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM

Query:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK
        T+INATLSP ALAYVVG ++SK+VWD L K YSS SR+N+VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS+ +N+EDL IY LNGLP++YNTF+
Subjt:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK

Query:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP
        TS+RTR+QP +F ELHVLL++EESAL KQ++ DDS   PT LL+++ S      +  NNF RG   G G+  G  GR +F +  RG GS        SP 
Subjt:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP

Query:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP
          +       CQIC R GH+ALDC+N MNY+FQGRHPP QLAAMVAS N+  +  + V+S+ LTDSGCN  IT+D++ +S A EYNG+EQV IG+GQ+ P
Subjt:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP

Query:  ITHQG
        ++H G
Subjt:  ITHQG

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]5.9e-11657.67Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM
        +S  +KD  SPIFLLSNICNL+S+RLDS+NFVLWKFQLT+ILKAHKLFGFVDG+ P P              +S +   + P   NP ++DW+AKD ALM
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM

Query:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK
        T+INATLSP ALAYVVG ++SK+VWD L K YSS SR+N+VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS+ +N+EDL IY LNGLP++YNTF+
Subjt:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK

Query:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPPS
        TS+RTR+QP +F ELHVLL++EESAL KQ++CDDS   PT LL+++ S      + NN F  R  G G+  G  GR +F +  RG G      P+     
Subjt:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPPS

Query:  PTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLPI
                 CQIC R GH+ALDC+N MNY+FQGRHPP QLAAMVAS N+  +  + V+S+ LTDSGCN HIT+D++ +S A EYNG+EQV +G+GQ+ PI
Subjt:  PTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLPI

Query:  THQG
        +H G
Subjt:  THQG

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]4.1e-11759.05Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLP--NPQFDDWLAKDHA
        +++ +KDL SPIFLLSNICNLVSIRLDS++F+LWKFQLT+ILKAHKLFGF+DGS+ APS+ L   S  +S  ++        SLP  NP F+DW+AKD A
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLP--NPQFDDWLAKDHA

Query:  LMTLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNT
        LMTLINATLS  ALAYVV   TSK+VW+ LEKHYSS SRTN+VNLKSDLQSI KK+ ESID YVKRIKE+KDK ANVS  +NDE L IY LNGL ++YNT
Subjt:  LMTLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNT

Query:  FKTSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFF---RGRSQGRGRTQGRGGRS---TFFSSGRGR--GSPF
          TS+RTRAQ  SF ELHV +KSEESA+EKQ + +D    P AL A+    +P+ Q+  + F   +   +GRG+  GRG  +   TF + GRGR  G+ F
Subjt:  FKTSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFF---RGRSQGRGRTQGRGGRS---TFFSSGRGR--GSPF

Query:  PSAPILSPPSPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQ-STTNPVSSTWLTDSGCNAHITADLSNL---SAASEYNG
         S              R  CQIC + GH+ALDCYN MN+ FQGRHPP QLAAMVA  N+S  +  N   +TWL DS CN H+TADLSNL   S AS+YNG
Subjt:  PSAPILSPPSPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQ-STTNPVSSTWLTDSGCNAHITADLSNL---SAASEYNG

Query:  DEQVSIGSGQSLPITHQGCG
        +E +S+GSGQS PITH GCG
Subjt:  DEQVSIGSGQSLPITHQGCG

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X25.3e-11557.78Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM
        +S  +KD  SPIFLLSNICNL+S+RLDS+NFVLWKFQLT+ILKAHKL+GF+DG+ P P    PR       T+++++  + P   NP ++DW+AKD ALM
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM

Query:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK
        T+INATLSP ALAYVVG ++SK+VWD L K YSS SR+N+VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS+ +N+EDL IY LNGLP++YNTF+
Subjt:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK

Query:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP
        TS+RTR+QP +F ELHVLL++EESAL KQ++ DDS   PT LL+++ S      +  NNF RG   G G+  G  GR +F +  RG GS        SP 
Subjt:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP

Query:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP
          +       CQIC R GH+ALDC+N MNY+FQGRHPP QLAAMVAS N+  +  + V+S+ LTDSGCN  IT+D++ +S A EYNG+EQV IG+GQ+ P
Subjt:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP

Query:  ITHQG
        ++H G
Subjt:  ITHQG

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X35.3e-11557.78Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM
        +S  +KD  SPIFLLSNICNL+S+RLDS+NFVLWKFQLT+ILKAHKL+GF+DG+ P P    PR       T+++++  + P   NP ++DW+AKD ALM
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM

Query:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK
        T+INATLSP ALAYVVG ++SK+VWD L K YSS SR+N+VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS+ +N+EDL IY LNGLP++YNTF+
Subjt:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK

Query:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP
        TS+RTR+QP +F ELHVLL++EESAL KQ++ DDS   PT LL+++ S      +  NNF RG   G G+  G  GR +F +  RG GS        SP 
Subjt:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP

Query:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP
          +       CQIC R GH+ALDC+N MNY+FQGRHPP QLAAMVAS N+  +  + V+S+ LTDSGCN  IT+D++ +S A EYNG+EQV IG+GQ+ P
Subjt:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP

Query:  ITHQG
        ++H G
Subjt:  ITHQG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X15.3e-11557.78Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM
        +S  +KD  SPIFLLSNICNL+S+RLDS+NFVLWKFQLT+ILKAHKL+GF+DG+ P P    PR       T+++++  + P   NP ++DW+AKD ALM
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM

Query:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK
        T+INATLSP ALAYVVG ++SK+VWD L K YSS SR+N+VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS+ +N+EDL IY LNGLP++YNTF+
Subjt:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK

Query:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP
        TS+RTR+QP +F ELHVLL++EESAL KQ++ DDS   PT LL+++ S      +  NNF RG   G G+  G  GR +F +  RG GS        SP 
Subjt:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP

Query:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP
          +       CQIC R GH+ALDC+N MNY+FQGRHPP QLAAMVAS N+  +  + V+S+ LTDSGCN  IT+D++ +S A EYNG+EQV IG+GQ+ P
Subjt:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP

Query:  ITHQG
        ++H G
Subjt:  ITHQG

A0A5D3CLI6 T4.52.7e-11457.82Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM
        +S  +KD  SPIFLLSNICNL+S+RLDS+NFVLWKFQLT+ILKAHKL+GF+DG+ P P    PR       T+++++  + P   NP ++DW+AKD ALM
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALM

Query:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK
        T+INATLSP ALAYVVG ++SK+VWD L K YSS SR+N+VNLKSDLQ+I KK  ESID Y+KRIKE+KDKLANVS+ +N+EDL IY LNGLP++YNTF+
Subjt:  TLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFK

Query:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP
        TS+RTR+QP +F ELHVLL++EESAL KQ++ DDS   PT LL+++ S      +  NNF RG   G G+  G  GR +F +  RG GS        SP 
Subjt:  TSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQS-TNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPP

Query:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP
          +       CQIC R GH+ALDC+N MNY+FQGRHPP QLAAMVAS N+  +  + V+S+ LTDSGCN  IT+D++ +S A EYNG+EQV IG+GQ+ P
Subjt:  SPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLP

Query:  ITH
        ++H
Subjt:  ITH

A0A6J1D9L6 uncharacterized protein LOC1110188922.0e-11759.05Show/hide
Query:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLP--NPQFDDWLAKDHA
        +++ +KDL SPIFLLSNICNLVSIRLDS++F+LWKFQLT+ILKAHKLFGF+DGS+ APS+ L   S  +S  ++        SLP  NP F+DW+AKD A
Subjt:  ASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLP--NPQFDDWLAKDHA

Query:  LMTLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNT
        LMTLINATLS  ALAYVV   TSK+VW+ LEKHYSS SRTN+VNLKSDLQSI KK+ ESID YVKRIKE+KDK ANVS  +NDE L IY LNGL ++YNT
Subjt:  LMTLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNT

Query:  FKTSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFF---RGRSQGRGRTQGRGGRS---TFFSSGRGR--GSPF
          TS+RTRAQ  SF ELHV +KSEESA+EKQ + +D    P AL A+    +P+ Q+  + F   +   +GRG+  GRG  +   TF + GRGR  G+ F
Subjt:  FKTSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFF---RGRSQGRGRTQGRGGRS---TFFSSGRGR--GSPF

Query:  PSAPILSPPSPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQ-STTNPVSSTWLTDSGCNAHITADLSNL---SAASEYNG
         S              R  CQIC + GH+ALDCYN MN+ FQGRHPP QLAAMVA  N+S  +  N   +TWL DS CN H+TADLSNL   S AS+YNG
Subjt:  PSAPILSPPSPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQGRHPPAQLAAMVASHNSSQ-STTNPVSSTWLTDSGCNAHITADLSNL---SAASEYNG

Query:  DEQVSIGSGQSLPITHQGCG
        +E +S+GSGQS PITH GCG
Subjt:  DEQVSIGSGQSLPITHQGCG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-0523.47Show/hide
Query:  DDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTL
        +DW   D    + I   LS   +  ++   T++ +W  LE  Y S + TN + LK  L ++      +   ++     L  +LAN+   + +ED  I  L
Subjt:  DDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTL

Query:  NGLPSDYNTFKTSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSP
        N LPS Y+   T++          +  + LK   SAL    +           L     G    +S+NN+  GRS  RG+++ R       S  R R   
Subjt:  NGLPSDYNTFKTSLRTRAQPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSP

Query:  FPSAPILSPPSPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQ--GRHPPAQLAAMVASHNSSQSTTNP---------VSSTWLTDSGCNAHIT
                            C  C +PGH   DC N      +  G+      AAMV ++++     N            S W+ D+  + H T
Subjt:  FPSAPILSPPSPTSGVGRVVCQICLRPGHSALDCYNNMNYSFQ--GRHPPAQLAAMVASHNSSQSTTNP---------VSSTWLTDSGCNAHIT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.6e-2625.83Show/hide
Query:  LTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALMTLINATL
        L +   L  N+ N+   +L S+N+++W  Q+ ++   ++L GF+DGS   P   +                 +AP + NP +  W  +D  + + +   +
Subjt:  LTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALMTLINATL

Query:  SPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFKTSLRTRA
        S +    V   +T+ ++W+ L K Y++ S  ++  L++ L+  T K +++IDDY++ +    D+LA +   ++ ++     L  LP +Y      +  + 
Subjt:  SPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFKTSLRTRA

Query:  QPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGR--GRTQGRGGRSTFFSSGRGRGSPFPSAPILSPPSPTSGV
         PP+  E+H  L + ES +        SSA    + ANA S      +TNN   G    R   R      +    SS     +   S P L         
Subjt:  QPPSFAELHVLLKSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGR--GRTQGRGGRSTFFSSGRGRGSPFPSAPILSPPSPTSGV

Query:  GRVVCQICLRPGHSALDCYNNMNY--SFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLPITHQ
            CQIC   GHSA  C    ++  S   + PP+         N +  +    S+ WL DSG   HIT+D +NLS    Y G + V +  G ++PI+H 
Subjt:  GRVVCQICLRPGHSALDCYNNMNY--SFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLPITHQ

Query:  GCGTLNTPHSTLSLNTFVNAPH
        G  +L+T    L+L+  +  P+
Subjt:  GCGTLNTPHSTLSLNTFVNAPH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.9e-2326.72Show/hide
Query:  RLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALMTLINATLSPAALAYVVGCSTSKEV
        +L S+N+++W  Q+ ++   ++L GF+DGS P P                A     A    NP +  W  +D  + + I   +S +    V   +T+ ++
Subjt:  RLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALMTLINATLSPAALAYVVGCSTSKEV

Query:  WDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFKTSLRTRAQPPSFAELHVLLKSEES
        W+ L K Y++ S  ++  L+                ++ R     D+LA +   ++ ++     L  LP DY      +  +  PPS  E+H  L + ES
Subjt:  WDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFKTSLRTRAQPPSFAELHVLLKSEES

Query:  ALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPPSPTSGVGRVVCQICLRPGHSALDC-
         L   N     SA    + AN  +   R  +TN     R   R         +++  S  G  S            P   +GR  CQIC   GHSA  C 
Subjt:  ALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPPSPTSGVGRVVCQICLRPGHSALDC-

Query:  ----YNNMNYSFQGRHP--PAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLPITHQGCGTLNTPHSTLSL
            + +     Q   P  P Q  A +A  NS  +  N     WL DSG   HIT+D +NLS    Y G + V I  G ++PITH G  +L T   +L L
Subjt:  ----YNNMNYSFQGRHP--PAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLPITHQGCGTLNTPHSTLSL

Query:  NTFVNAPH
        N  +  P+
Subjt:  NTFVNAPH

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.8e-0519.2Show/hide
Query:  DLTSPIFLLSNI-----CNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALMT
        D  SP +L  +I      ++  +  D  N+V WK +  S L+  K FGF+DG+LP P                           +P +  W   +  +M 
Subjt:  DLTSPIFLLSNI-----CNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALMT

Query:  LINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSI------------------VNDED
         +  +++   L  V+   T+ ++W+ L + +       I  L+  L ++ ++  +S+++Y  ++ ++  +L+  + I                    +++
Subjt:  LINATLSPAALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSI------------------VNDED

Query:  LFIYTLNGLP--SDYNTFKTSLRTRAQPPSFAELHVLLKSEESALEKQNR
             L GL     +    T +  +  PPS  E   ++K  ES ++  +R
Subjt:  LFIYTLNGLP--SDYNTFKTSLRTRAQPPSFAELHVLLKSEESALEKQNR

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-1125.7Show/hide
Query:  IFLLSNICNLVSIRLD--SSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALMTLINATLSP
        I+ +SNI + + + LD   SN+  W+    +   +  + G +DG+L      LP                      N    +W  +D  +   +  TL+P
Subjt:  IFLLSNICNLVSIRLD--SSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALMTLINATLSP

Query:  AAL-AYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSE-SIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFKTSLRTRA
               V  STS+++W  ++  + +      + L S+L+  TK   +  + DY +++K+L D L NV   V D +L +Y LNGL   ++     ++ R 
Subjt:  AAL-AYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSE-SIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFKTSLRTRA

Query:  QPPSFAELHVLLKSEESALEKQNRCD----DSSAPPTALLANAHSGAP------RGQSTNNFFRGRSQGRGRTQGRGGRSTFFS
          PSF +   +L+ EE  L++  + +    D S+  T L   A S AP      R       +RGR +G    +GRGGR ++++
Subjt:  QPPSFAELHVLLKSEESALEKQNRCD----DSSAPPTALLANAHSGAP------RGQSTNNFFRGRSQGRGRTQGRGGRSTFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGATTTACAGAAAGATCTTACCTCACCTATTTTTCTTCTTTCTAATATCTGCAACCTGGTCTCGATTCGGCTTGACTCCTCTAATTTTGTTCTCTGGAAGTT
CCAATTAACGTCAATTCTGAAAGCCCACAAGCTCTTCGGTTTTGTCGATGGCTCCTTGCCGGCCCCTTCGAAAGTTCTTCCACGAGAATCGTCGGTGGATTCTGCTACTT
CCTCGGCTGCTGCTGCTGGTTCCGCTCCATCTCTGCCGAATCCTCAGTTCGATGACTGGCTTGCTAAAGACCATGCCCTTATGACTCTCATCAATGCCACCTTATCACCG
GCAGCTCTCGCATATGTGGTTGGTTGCTCAACTTCAAAGGAGGTATGGGATGCCCTTGAGAAACATTATTCCTCAACTTCTCGAACCAATATTGTTAATCTAAAGTCTGA
TTTACAATCTATTACTAAGAAATCCTCTGAATCCATTGATGACTATGTTAAGCGTATCAAGGAACTCAAAGATAAGTTGGCAAATGTTTCATCTATTGTGAATGATGAAG
ATCTGTTTATATATACTTTGAATGGCCTACCATCTGATTACAATACTTTCAAAACCTCCTTGCGTACTCGAGCTCAACCTCCTTCTTTTGCTGAACTACATGTTTTACTA
AAATCTGAGGAATCAGCTCTTGAAAAGCAAAATCGATGTGATGATTCTTCTGCACCCCCCACTGCCTTACTTGCCAATGCCCATTCTGGGGCTCCTCGTGGTCAATCAAC
CAATAATTTCTTCAGAGGTCGATCTCAGGGCAGAGGCCGCACTCAGGGTCGTGGTGGTCGATCCACCTTTTTCTCTTCAGGTCGTGGTCGAGGTTCCCCTTTTCCTTCTG
CTCCAATTTTGTCTCCTCCGTCACCTACTTCAGGCGTTGGTCGGGTCGTCTGCCAAATATGTCTTCGCCCTGGTCATTCAGCCTTGGATTGCTATAATAATATGAACTAT
AGCTTTCAAGGCCGACACCCTCCTGCACAGCTTGCTGCCATGGTTGCTTCTCACAATTCCTCTCAATCTACTACTAATCCAGTTTCTTCAACTTGGCTGACAGATTCTGG
GTGTAATGCTCACATTACGGCTGATTTATCAAACTTATCTGCTGCTTCTGAGTATAATGGGGATGAACAAGTTTCGATTGGTAGTGGTCAGTCACTCCCTATAACACATC
AAGGCTGTGGTACTCTTAATACACCCCATTCTACCCTTTCTCTTAACACATTTGTGAATGCCCCACATTTTTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTGATTTACAGAAAGATCTTACCTCACCTATTTTTCTTCTTTCTAATATCTGCAACCTGGTCTCGATTCGGCTTGACTCCTCTAATTTTGTTCTCTGGAAGTT
CCAATTAACGTCAATTCTGAAAGCCCACAAGCTCTTCGGTTTTGTCGATGGCTCCTTGCCGGCCCCTTCGAAAGTTCTTCCACGAGAATCGTCGGTGGATTCTGCTACTT
CCTCGGCTGCTGCTGCTGGTTCCGCTCCATCTCTGCCGAATCCTCAGTTCGATGACTGGCTTGCTAAAGACCATGCCCTTATGACTCTCATCAATGCCACCTTATCACCG
GCAGCTCTCGCATATGTGGTTGGTTGCTCAACTTCAAAGGAGGTATGGGATGCCCTTGAGAAACATTATTCCTCAACTTCTCGAACCAATATTGTTAATCTAAAGTCTGA
TTTACAATCTATTACTAAGAAATCCTCTGAATCCATTGATGACTATGTTAAGCGTATCAAGGAACTCAAAGATAAGTTGGCAAATGTTTCATCTATTGTGAATGATGAAG
ATCTGTTTATATATACTTTGAATGGCCTACCATCTGATTACAATACTTTCAAAACCTCCTTGCGTACTCGAGCTCAACCTCCTTCTTTTGCTGAACTACATGTTTTACTA
AAATCTGAGGAATCAGCTCTTGAAAAGCAAAATCGATGTGATGATTCTTCTGCACCCCCCACTGCCTTACTTGCCAATGCCCATTCTGGGGCTCCTCGTGGTCAATCAAC
CAATAATTTCTTCAGAGGTCGATCTCAGGGCAGAGGCCGCACTCAGGGTCGTGGTGGTCGATCCACCTTTTTCTCTTCAGGTCGTGGTCGAGGTTCCCCTTTTCCTTCTG
CTCCAATTTTGTCTCCTCCGTCACCTACTTCAGGCGTTGGTCGGGTCGTCTGCCAAATATGTCTTCGCCCTGGTCATTCAGCCTTGGATTGCTATAATAATATGAACTAT
AGCTTTCAAGGCCGACACCCTCCTGCACAGCTTGCTGCCATGGTTGCTTCTCACAATTCCTCTCAATCTACTACTAATCCAGTTTCTTCAACTTGGCTGACAGATTCTGG
GTGTAATGCTCACATTACGGCTGATTTATCAAACTTATCTGCTGCTTCTGAGTATAATGGGGATGAACAAGTTTCGATTGGTAGTGGTCAGTCACTCCCTATAACACATC
AAGGCTGTGGTACTCTTAATACACCCCATTCTACCCTTTCTCTTAACACATTTGTGAATGCCCCACATTTTTCGTAG
Protein sequenceShow/hide protein sequence
MASDLQKDLTSPIFLLSNICNLVSIRLDSSNFVLWKFQLTSILKAHKLFGFVDGSLPAPSKVLPRESSVDSATSSAAAAGSAPSLPNPQFDDWLAKDHALMTLINATLSP
AALAYVVGCSTSKEVWDALEKHYSSTSRTNIVNLKSDLQSITKKSSESIDDYVKRIKELKDKLANVSSIVNDEDLFIYTLNGLPSDYNTFKTSLRTRAQPPSFAELHVLL
KSEESALEKQNRCDDSSAPPTALLANAHSGAPRGQSTNNFFRGRSQGRGRTQGRGGRSTFFSSGRGRGSPFPSAPILSPPSPTSGVGRVVCQICLRPGHSALDCYNNMNY
SFQGRHPPAQLAAMVASHNSSQSTTNPVSSTWLTDSGCNAHITADLSNLSAASEYNGDEQVSIGSGQSLPITHQGCGTLNTPHSTLSLNTFVNAPHFS