; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022098 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022098
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:18331557..18341317
RNA-Seq ExpressionLag0022098
SyntenyLag0022098
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035073.1 uncharacterized protein E6C27_scaffold57G001380 [Cucumis melo var. makuwa]1.6e-13279.81Show/hide
Query:  HAVFRAKLASGPGGGVMPPRTSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPA
        HA+FR+K+A     GVMPPRT RQRRQNQDG Q PTQ  S   SST   +  AG++QFAR+ QEIGR +RA PSD EK YGIERLKKLGATVFEGSTD A
Subjt:  HAVFRAKLASGPGGGVMPPRTSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPA

Query:  DVEVWLNMLEKCFDVMSCPEERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSR
        D E WLNMLEKCFDVM+CPEERKV+LATFLLQKEA+GWWKSI+ARRSDAR LDWQTFRGIFE+KYYP+TYCEAKRDEFL LKQGSLSVAEYERKYTELSR
Subjt:  DVEVWLNMLEKCFDVMSCPEERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSR

Query:  YAEVIVASESDRCRRFERGLRFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSR
        YA+VI+ASESDRCRRFERGLRFEIRTPVTAIAKWT+FSQLVETALRVEQSI EE+S  E SRG  TASGFRGREQRRF PG+N S RQDFK RSGGQ SR
Subjt:  YAEVIVASESDRCRRFERGLRFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSR

Query:  QMSSGSAYQRQSQRAPS
         +S GS +QRQSQR PS
Subjt:  QMSSGSAYQRQSQRAPS

KAA0037581.1 reverse transcriptase [Cucumis melo var. makuwa]2.6e-13570.05Show/hide
Query:  VEIELPVPDTLPTSAESSGSSLS-----YFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGGVMPPRTSR
        VEIELPVPD LPTSAESS S+ S     YF++         IL   R   C+          + S  IVRGDDVCW HA+FRAK A GPGGGV       
Subjt:  VEIELPVPDTLPTSAESSGSSLS-----YFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGGVMPPRTSR

Query:  QRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPEERK
                T   +Q       +   G+   G     +   EIGRPE+AGPSDLEK YGIERLKKLGATVFEGSTDPAD EVWLNMLEKCFDVMSCP+ERK
Subjt:  QRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPEERK

Query:  VKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGLRFE
        VKLATFLL KEAEGWWKSIIARR+DARTLDWQTFRGIFEEKYYP TYCEAKRDEFLELKQ SLSVA+YERKYTELSRYAE+IVASESDRC RFERGLRFE
Subjt:  VKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGLRFE

Query:  IRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS
        IRTPVTAIAKW +FSQLVETALRV+QSI+EE+S  E SRG  T SG RGREQRRF PGVNVSG QDFKRRSGG+  RQMSSGSAYQRQS+RA S
Subjt:  IRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS

KAA0056684.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.9e-13466.1Show/hide
Query:  KLVAIYLHHVYVGTNRHYSLDVEIELPVPDTLPTSAESSGSSLSYFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAK
        + + + +  + +   R+    VEI+ PVPD LP           Y  +++  G  + +       R      E F    +    V+ + V   HA+FR+K
Subjt:  KLVAIYLHHVYVGTNRHYSLDVEIELPVPDTLPTSAESSGSSLSYFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAK

Query:  LASGPGGGVMPPRTSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLN
        +A     GVMPPRT R+RRQNQDG Q PTQ  S   SST   +  AG++QFAR+ QEIGR +RA PSD EK YGIERLKKLGATVFEGSTDPAD E WLN
Subjt:  LASGPGGGVMPPRTSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLN

Query:  MLEKCFDVMSCPEERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVA
        MLEKCFDVM+CPEERKV+LATFLLQKEAEGWWKSI+ARRSDAR LDWQTFRGIFE+KYYP+TYCEAKRDEFL LKQGSLSVAEYERKYTELSRYA+VI+A
Subjt:  MLEKCFDVMSCPEERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVA

Query:  SESDRCRRFERGLRFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSA
        SESDRCRRFERGLRFEIRTPVTAIAKWT+FSQLVETALRVEQSI EE+S  E SRG  TASGFRGREQRRF PG+N+S RQDFK RSGGQ SR +S GS 
Subjt:  SESDRCRRFERGLRFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSA

Query:  YQRQSQRAPS
        +QRQSQR PS
Subjt:  YQRQSQRAPS

TYK03091.1 reverse transcriptase [Cucumis melo var. makuwa]4.1e-13670.3Show/hide
Query:  VEIELPVPDTLPTSAESSGSSLS-----YFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGGVMPPRTSR
        VEIELPVPD LPTSAESS S+ S     YF++         IL   R   C+          + S  IVRGDDVCW HA+FRAK A GPGGGV       
Subjt:  VEIELPVPDTLPTSAESSGSSLS-----YFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGGVMPPRTSR

Query:  QRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPEERK
                T   +Q       +   G+   G     +   EIGRPE+AGPSDLEK YGIERLKKLGATVFEGSTDPAD EVWLNMLEKCFDVMSCP+ERK
Subjt:  QRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPEERK

Query:  VKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGLRFE
        VKLATFLLQKEAEGWWKSIIARR+DARTLDWQTFRGIFEEKYYP TYCEAKRDEFLELKQ SLSVA+YERKYTELSRYAE+IVASESDRC RFERGLRFE
Subjt:  VKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGLRFE

Query:  IRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS
        IRTPVTAIAKW +FSQLVETALRV+QSI+EE+S  E SRG  T SG RGREQRRF PGVNVSG QDFKRRSGG+  RQMSSGSAYQRQS+RA S
Subjt:  IRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS

TYK14494.1 uncharacterized protein E5676_scaffold15G00050 [Cucumis melo var. makuwa]1.1e-14975.31Show/hide
Query:  VEIELPVPDTLPTSAESSGSSLSYFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGG--------VMPPR
        VEIELPV DTLPTSAESS S+                                     RS GIVR DDVCW H+VFRAK ASGPGGG        VMP R
Subjt:  VEIELPVPDTLPTSAESSGSSLSYFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGG--------VMPPR

Query:  TSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPE
        TSR+RRQNQDG Q PTQ QSERGSS PR QNE GS++FARSAQEIG PER  PSD EK Y IERLKKLGATVFEGSTD AD EVWLNMLEKCFDVMSCP+
Subjt:  TSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPE

Query:  ERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGL
        ERKV+LATFLLQKEAEGWWKSIIARR+DA TLD QTFRGIF EKYYP TYCEAKRDEFLELKQGSL VAEYERKYTELSRY E+IVASESDRCRRFERGL
Subjt:  ERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGL

Query:  RFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS
        RFEI TPVTAIAKWT+FSQLVETALRVEQSI+EE+SV E SRG  T SG RGREQ RF PGVNVSG QDFKRRSGG+  RQMSSGSAYQRQSQRA S
Subjt:  RFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS

TrEMBL top hitse value%identityAlignment
A0A5A7SX02 CCHC-type domain-containing protein7.8e-13379.81Show/hide
Query:  HAVFRAKLASGPGGGVMPPRTSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPA
        HA+FR+K+A     GVMPPRT RQRRQNQDG Q PTQ  S   SST   +  AG++QFAR+ QEIGR +RA PSD EK YGIERLKKLGATVFEGSTD A
Subjt:  HAVFRAKLASGPGGGVMPPRTSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPA

Query:  DVEVWLNMLEKCFDVMSCPEERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSR
        D E WLNMLEKCFDVM+CPEERKV+LATFLLQKEA+GWWKSI+ARRSDAR LDWQTFRGIFE+KYYP+TYCEAKRDEFL LKQGSLSVAEYERKYTELSR
Subjt:  DVEVWLNMLEKCFDVMSCPEERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSR

Query:  YAEVIVASESDRCRRFERGLRFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSR
        YA+VI+ASESDRCRRFERGLRFEIRTPVTAIAKWT+FSQLVETALRVEQSI EE+S  E SRG  TASGFRGREQRRF PG+N S RQDFK RSGGQ SR
Subjt:  YAEVIVASESDRCRRFERGLRFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSR

Query:  QMSSGSAYQRQSQRAPS
         +S GS +QRQSQR PS
Subjt:  QMSSGSAYQRQSQRAPS

A0A5A7T7M6 Reverse transcriptase1.3e-13570.05Show/hide
Query:  VEIELPVPDTLPTSAESSGSSLS-----YFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGGVMPPRTSR
        VEIELPVPD LPTSAESS S+ S     YF++         IL   R   C+          + S  IVRGDDVCW HA+FRAK A GPGGGV       
Subjt:  VEIELPVPDTLPTSAESSGSSLS-----YFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGGVMPPRTSR

Query:  QRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPEERK
                T   +Q       +   G+   G     +   EIGRPE+AGPSDLEK YGIERLKKLGATVFEGSTDPAD EVWLNMLEKCFDVMSCP+ERK
Subjt:  QRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPEERK

Query:  VKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGLRFE
        VKLATFLL KEAEGWWKSIIARR+DARTLDWQTFRGIFEEKYYP TYCEAKRDEFLELKQ SLSVA+YERKYTELSRYAE+IVASESDRC RFERGLRFE
Subjt:  VKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGLRFE

Query:  IRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS
        IRTPVTAIAKW +FSQLVETALRV+QSI+EE+S  E SRG  T SG RGREQRRF PGVNVSG QDFKRRSGG+  RQMSSGSAYQRQS+RA S
Subjt:  IRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS

A0A5A7UNA3 Reverse transcriptase1.4e-13466.1Show/hide
Query:  KLVAIYLHHVYVGTNRHYSLDVEIELPVPDTLPTSAESSGSSLSYFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAK
        + + + +  + +   R+    VEI+ PVPD LP           Y  +++  G  + +       R      E F    +    V+ + V   HA+FR+K
Subjt:  KLVAIYLHHVYVGTNRHYSLDVEIELPVPDTLPTSAESSGSSLSYFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAK

Query:  LASGPGGGVMPPRTSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLN
        +A     GVMPPRT R+RRQNQDG Q PTQ  S   SST   +  AG++QFAR+ QEIGR +RA PSD EK YGIERLKKLGATVFEGSTDPAD E WLN
Subjt:  LASGPGGGVMPPRTSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLN

Query:  MLEKCFDVMSCPEERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVA
        MLEKCFDVM+CPEERKV+LATFLLQKEAEGWWKSI+ARRSDAR LDWQTFRGIFE+KYYP+TYCEAKRDEFL LKQGSLSVAEYERKYTELSRYA+VI+A
Subjt:  MLEKCFDVMSCPEERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVA

Query:  SESDRCRRFERGLRFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSA
        SESDRCRRFERGLRFEIRTPVTAIAKWT+FSQLVETALRVEQSI EE+S  E SRG  TASGFRGREQRRF PG+N+S RQDFK RSGGQ SR +S GS 
Subjt:  SESDRCRRFERGLRFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSA

Query:  YQRQSQRAPS
        +QRQSQR PS
Subjt:  YQRQSQRAPS

A0A5D3BTP3 Reverse transcriptase2.0e-13670.3Show/hide
Query:  VEIELPVPDTLPTSAESSGSSLS-----YFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGGVMPPRTSR
        VEIELPVPD LPTSAESS S+ S     YF++         IL   R   C+          + S  IVRGDDVCW HA+FRAK A GPGGGV       
Subjt:  VEIELPVPDTLPTSAESSGSSLS-----YFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGGVMPPRTSR

Query:  QRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPEERK
                T   +Q       +   G+   G     +   EIGRPE+AGPSDLEK YGIERLKKLGATVFEGSTDPAD EVWLNMLEKCFDVMSCP+ERK
Subjt:  QRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPEERK

Query:  VKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGLRFE
        VKLATFLLQKEAEGWWKSIIARR+DARTLDWQTFRGIFEEKYYP TYCEAKRDEFLELKQ SLSVA+YERKYTELSRYAE+IVASESDRC RFERGLRFE
Subjt:  VKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGLRFE

Query:  IRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS
        IRTPVTAIAKW +FSQLVETALRV+QSI+EE+S  E SRG  T SG RGREQRRF PGVNVSG QDFKRRSGG+  RQMSSGSAYQRQS+RA S
Subjt:  IRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS

A0A5D3CU23 Retrotrans_gag domain-containing protein5.4e-15075.31Show/hide
Query:  VEIELPVPDTLPTSAESSGSSLSYFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGG--------VMPPR
        VEIELPV DTLPTSAESS S+                                     RS GIVR DDVCW H+VFRAK ASGPGGG        VMP R
Subjt:  VEIELPVPDTLPTSAESSGSSLSYFQTEICHGDTSCILICFRGFRCVVVHMEYFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGG--------VMPPR

Query:  TSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPE
        TSR+RRQNQDG Q PTQ QSERGSS PR QNE GS++FARSAQEIG PER  PSD EK Y IERLKKLGATVFEGSTD AD EVWLNMLEKCFDVMSCP+
Subjt:  TSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGATVFEGSTDPADVEVWLNMLEKCFDVMSCPE

Query:  ERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGL
        ERKV+LATFLLQKEAEGWWKSIIARR+DA TLD QTFRGIF EKYYP TYCEAKRDEFLELKQGSL VAEYERKYTELSRY E+IVASESDRCRRFERGL
Subjt:  ERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSRYAEVIVASESDRCRRFERGL

Query:  RFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS
        RFEI TPVTAIAKWT+FSQLVETALRVEQSI+EE+SV E SRG  T SG RGREQ RF PGVNVSG QDFKRRSGG+  RQMSSGSAYQRQSQRA S
Subjt:  RFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQRQSQRAPS

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-3428.1Show/hide
Query:  LVLAGDVIIEVANVDTAAGLWSKLESLYMTKSLTKKLLLKQRLFSLRMQEGTSLRDHDTNRCVLIKLQEPKMDHSRSVDQTTCGVIRISSLETSFSTVDL
        L L+ DV+  + + DTA G+W++LESLYM+K+LT KL LK++L++L M EGT+                                               
Subjt:  LVLAGDVIIEVANVDTAAGLWSKLESLYMTKSLTKKLLLKQRLFSLRMQEGTSLRDHDTNRCVLIKLQEPKMDHSRSVDQTTCGVIRISSLETSFSTVDL

Query:  DHQDHIDQLNSIMLDLRNLEIKVDDEDATLILLVSLSLSYENFVDSFISGNDKLSLEEVKIVLLTREVRHKV---AGSVIVNQAFG-LVASSSKGHRKSG
            H++  N ++  L NL +K+++ED  ++LL SL  SY+N   + + G   + L++V   LL  E   K     G  ++ +  G     SS  + +SG
Subjt:  DHQDHIDQLNSIMLDLRNLEIKVDDEDATLILLVSLSLSYENFVDSFISGNDKLSLEEVKIVLLTREVRHKV---AGSVIVNQAFG-LVASSSKGHRKSG

Query:  KNSKSNETNPRANDICNYCKEKGHWKYDC---KKKEGSAAVAK------VNTDSEDDLALVVN-EEPCL-----KDLWVLDSGVSCHMCPNREWFSTYQH
           KS   +      C  C + GH+K DC   +K +G  +  K          + D++ L +N EE C+     +  WV+D+  S H  P R+ F  Y  
Subjt:  KNSKSNETNPRANDICNYCKEKGHWKYDC---KKKEGSAAVAK------VNTDSEDDLALVVN-EEPCL-----KDLWVLDSGVSCHMCPNREWFSTYQH

Query:  VDGGNVTMANNAVCKVVGIGSVRIQTHDGFFCTLDEVRHVPLMTKCMISLSVLDSKGFS---------------------FRGSTLTSFAAVAS---STV
         D G V M N +  K+ GIG + I+T+ G    L +VRHVP +   +IS   LD  G+                       RG+   + A +     +  
Subjt:  VDGGNVTMANNAVCKVVGIGSVRIQTHDGFFCTLDEVRHVPLMTKCMISLSVLDSKGFS---------------------FRGSTLTSFAAVAS---STV

Query:  RKEDMTKLWHMRLGHMMKVG
        + E    LWH R+GHM + G
Subjt:  RKEDMTKLWHMRLGHMMKVG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGTTCTGGCTGGTGACGTCATCATTGAGGTCGCAAATGTGGATACTGCAGCTGGTCTTTGGTCTAAGCTCGAGAGTCTCTACATGACGAAGTCTTTGACTAAGAA
GTTGCTCTTGAAGCAACGTTTGTTCAGTCTGCGAATGCAGGAAGGTACGTCCCTTAGAGATCATGATACGAACCGATGTGTCTTAATCAAGCTACAAGAACCCAAAATGG
ATCATTCACGTAGCGTTGATCAAACTACTTGTGGAGTGATTCGAATCTCGAGTTTAGAAACGAGTTTCTCTACTGTTGATCTTGATCATCAAGATCACATAGACCAGTTG
AACTCTATTATGTTAGATCTACGTAACCTTGAGATTAAGGTAGATGATGAGGATGCTACTTTGATTCTGTTAGTATCTTTGTCGTTGTCGTATGAGAATTTTGTAGATTC
TTTTATTAGTGGTAATGATAAGTTGTCTCTAGAAGAAGTAAAAATTGTCCTTCTGACTAGAGAAGTCCGTCATAAAGTAGCAGGTTCGGTTATAGTAAATCAGGCGTTTG
GGTTGGTTGCTTCGAGTAGTAAAGGACATAGGAAATCTGGTAAAAATTCGAAGTCTAATGAAACTAATCCTAGAGCGAATGACATCTGTAATTACTGTAAAGAAAAGGGA
CATTGGAAATATGACTGTAAAAAGAAGGAAGGCTCTGCCGCTGTGGCAAAAGTTAATACTGATTCAGAAGATGATTTAGCCCTAGTTGTAAATGAAGAACCATGTCTTAA
AGATCTGTGGGTTCTTGATTCTGGGGTGTCGTGTCATATGTGTCCAAATAGGGAGTGGTTTTCAACTTATCAACACGTAGATGGTGGTAATGTCACTATGGCTAACAACG
CTGTCTGTAAGGTAGTTGGGATAGGCTCAGTCAGGATACAAACACATGATGGTTTTTTCTGCACTTTGGATGAGGTCAGGCATGTTCCACTAATGACCAAGTGTATGATA
TCTTTGAGTGTTTTAGACAGTAAGGGTTTCAGTTTCAGAGGTTCTACGCTGACTAGTTTTGCTGCTGTTGCATCCTCTACAGTTCGCAAAGAAGATATGACTAAGTTGTG
GCATATGAGACTTGGTCATATGATGAAAGTTGGGACAACTGATAACCTCGCTAATATGTTCGCCAAGCCAGTTCCAAGTGGCAAGTTTCAACATTGTTTGGACTTGCTAA
ATGTTCTGAATTGCAAATTGGTTCCTATGGGACCTTATGAAGCAGAGGGAGAATCTGGTGACTTTTTCGAGTTGCATCAGTTATATGAGAGAGAGTTCAAGTCAAGCTTG
GCGCATGCTTTGAGTAGGGACCAAGTTTGTCCTCCTTTGGCTCCAAGTTTGTCCTCCCTTTGGCTCCAAGCTAGATATGCTTTTAACTGGATAATGATTCTTCACGCGAC
AAAGAGAGCATTTAGAGACGAAGACATGGAGCAAACCTGTGTGATGAGATTCAAAGCTGATGGAGTAGAATTTGAGAGTCCATATTTGTGTTTGAAATGCGCCCACAAAT
GTAGAAGGAAAGTTTCGCAGAGCAATGAAGTTGAAGATCTGCGCTCACACTTGGAAACGAAAAAATATGGCAGTGGTTTACTATTTCATGGTCCGGTCTTATGCAAACTC
ATTGCACAGGATACCCCTGCTCACATGTCTACTACATGGACGCTTTGGATCAATACGTCTGTATCAAATACAAAGTGGGCTGTATCTCATAGTGTTACCAGGATAAGGCG
CAAATTGGTAGCCATCTATCTGCACCATGTCTATGTGGGCACTAATAGGCACTATTCTCTAGACGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTG
AAAGTTCCGGATCAAGCTTGAGTTATTTCCAAACCGAGATATGTCATGGCGACACCTCATGTATTTTGATATGTTTTCGGGGTTTCCGTTGTGTTGTGGTTCATATGGAA
TATTTTATATCCATGATTAGGTCAACAGGGATCGTTAGAGGTGACGATGTCTGTTGGTTTCACGCCGTCTTTCGGGCTAAGCTAGCAAGTGGTCCGGGAGGGGGAGTCAT
GCCACCACGTACCAGCAGACAACGCAGGCAGAATCAGGACGGGACGCAGGATCCTACCCAAAGTCAATCTGAAAGGGGATCCAGTACCCCGAGAGGTCAGAATGAGGCAG
GGAGTGATCAATTTGCTAGATCTGCACAGGAGATCGGTAGACCAGAGAGAGCAGGGCCTAGTGATCTGGAAAAGACGTATGGGATTGAACGGTTGAAGAAGTTGGGAGCC
ACAGTGTTTGAGGGTTCCACAGATCCAGCTGACGTCGAGGTCTGGTTGAATATGTTGGAGAAATGCTTCGACGTGATGAGTTGTCCTGAGGAGCGAAAAGTCAAATTAGC
CACATTCCTTTTGCAGAAGGAGGCAGAGGGATGGTGGAAGTCTATTATAGCCAGGCGCAGTGATGCACGTACGTTAGATTGGCAGACATTCAGAGGCATATTTGAGGAAA
AGTACTATCCCGCCACATATTGTGAGGCAAAGAGAGATGAGTTTCTGGAGCTGAAACAAGGGTCACTTTCAGTGGCTGAGTACGAGAGGAAGTATACCGAGCTTTCACGG
TATGCTGAAGTGATTGTGGCATCTGAGAGTGACAGGTGTCGCAGGTTTGAGAGAGGGCTACGTTTTGAGATACGTACCCCAGTTACTGCGATTGCAAAGTGGACGGATTT
TTCTCAGCTAGTAGAGACTGCCTTACGTGTGGAGCAGAGTATTATAGAGGAAAGGTCGGTTGCGGAGCCTAGTCGTGGAGCTCCAACAGCTAGTGGTTTTCGAGGTCGTG
AGCAGCGGAGGTTTGCACCTGGAGTGAATGTTTCAGGCCGTCAAGACTTCAAGCGTCGATCTGGTGGCCAATTATCGAGGCAAATGAGTTCGGGTAGTGCCTATCAGAGG
CAGAGTCAGAGAGCCCCCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAGTTCTGGCTGGTGACGTCATCATTGAGGTCGCAAATGTGGATACTGCAGCTGGTCTTTGGTCTAAGCTCGAGAGTCTCTACATGACGAAGTCTTTGACTAAGAA
GTTGCTCTTGAAGCAACGTTTGTTCAGTCTGCGAATGCAGGAAGGTACGTCCCTTAGAGATCATGATACGAACCGATGTGTCTTAATCAAGCTACAAGAACCCAAAATGG
ATCATTCACGTAGCGTTGATCAAACTACTTGTGGAGTGATTCGAATCTCGAGTTTAGAAACGAGTTTCTCTACTGTTGATCTTGATCATCAAGATCACATAGACCAGTTG
AACTCTATTATGTTAGATCTACGTAACCTTGAGATTAAGGTAGATGATGAGGATGCTACTTTGATTCTGTTAGTATCTTTGTCGTTGTCGTATGAGAATTTTGTAGATTC
TTTTATTAGTGGTAATGATAAGTTGTCTCTAGAAGAAGTAAAAATTGTCCTTCTGACTAGAGAAGTCCGTCATAAAGTAGCAGGTTCGGTTATAGTAAATCAGGCGTTTG
GGTTGGTTGCTTCGAGTAGTAAAGGACATAGGAAATCTGGTAAAAATTCGAAGTCTAATGAAACTAATCCTAGAGCGAATGACATCTGTAATTACTGTAAAGAAAAGGGA
CATTGGAAATATGACTGTAAAAAGAAGGAAGGCTCTGCCGCTGTGGCAAAAGTTAATACTGATTCAGAAGATGATTTAGCCCTAGTTGTAAATGAAGAACCATGTCTTAA
AGATCTGTGGGTTCTTGATTCTGGGGTGTCGTGTCATATGTGTCCAAATAGGGAGTGGTTTTCAACTTATCAACACGTAGATGGTGGTAATGTCACTATGGCTAACAACG
CTGTCTGTAAGGTAGTTGGGATAGGCTCAGTCAGGATACAAACACATGATGGTTTTTTCTGCACTTTGGATGAGGTCAGGCATGTTCCACTAATGACCAAGTGTATGATA
TCTTTGAGTGTTTTAGACAGTAAGGGTTTCAGTTTCAGAGGTTCTACGCTGACTAGTTTTGCTGCTGTTGCATCCTCTACAGTTCGCAAAGAAGATATGACTAAGTTGTG
GCATATGAGACTTGGTCATATGATGAAAGTTGGGACAACTGATAACCTCGCTAATATGTTCGCCAAGCCAGTTCCAAGTGGCAAGTTTCAACATTGTTTGGACTTGCTAA
ATGTTCTGAATTGCAAATTGGTTCCTATGGGACCTTATGAAGCAGAGGGAGAATCTGGTGACTTTTTCGAGTTGCATCAGTTATATGAGAGAGAGTTCAAGTCAAGCTTG
GCGCATGCTTTGAGTAGGGACCAAGTTTGTCCTCCTTTGGCTCCAAGTTTGTCCTCCCTTTGGCTCCAAGCTAGATATGCTTTTAACTGGATAATGATTCTTCACGCGAC
AAAGAGAGCATTTAGAGACGAAGACATGGAGCAAACCTGTGTGATGAGATTCAAAGCTGATGGAGTAGAATTTGAGAGTCCATATTTGTGTTTGAAATGCGCCCACAAAT
GTAGAAGGAAAGTTTCGCAGAGCAATGAAGTTGAAGATCTGCGCTCACACTTGGAAACGAAAAAATATGGCAGTGGTTTACTATTTCATGGTCCGGTCTTATGCAAACTC
ATTGCACAGGATACCCCTGCTCACATGTCTACTACATGGACGCTTTGGATCAATACGTCTGTATCAAATACAAAGTGGGCTGTATCTCATAGTGTTACCAGGATAAGGCG
CAAATTGGTAGCCATCTATCTGCACCATGTCTATGTGGGCACTAATAGGCACTATTCTCTAGACGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTG
AAAGTTCCGGATCAAGCTTGAGTTATTTCCAAACCGAGATATGTCATGGCGACACCTCATGTATTTTGATATGTTTTCGGGGTTTCCGTTGTGTTGTGGTTCATATGGAA
TATTTTATATCCATGATTAGGTCAACAGGGATCGTTAGAGGTGACGATGTCTGTTGGTTTCACGCCGTCTTTCGGGCTAAGCTAGCAAGTGGTCCGGGAGGGGGAGTCAT
GCCACCACGTACCAGCAGACAACGCAGGCAGAATCAGGACGGGACGCAGGATCCTACCCAAAGTCAATCTGAAAGGGGATCCAGTACCCCGAGAGGTCAGAATGAGGCAG
GGAGTGATCAATTTGCTAGATCTGCACAGGAGATCGGTAGACCAGAGAGAGCAGGGCCTAGTGATCTGGAAAAGACGTATGGGATTGAACGGTTGAAGAAGTTGGGAGCC
ACAGTGTTTGAGGGTTCCACAGATCCAGCTGACGTCGAGGTCTGGTTGAATATGTTGGAGAAATGCTTCGACGTGATGAGTTGTCCTGAGGAGCGAAAAGTCAAATTAGC
CACATTCCTTTTGCAGAAGGAGGCAGAGGGATGGTGGAAGTCTATTATAGCCAGGCGCAGTGATGCACGTACGTTAGATTGGCAGACATTCAGAGGCATATTTGAGGAAA
AGTACTATCCCGCCACATATTGTGAGGCAAAGAGAGATGAGTTTCTGGAGCTGAAACAAGGGTCACTTTCAGTGGCTGAGTACGAGAGGAAGTATACCGAGCTTTCACGG
TATGCTGAAGTGATTGTGGCATCTGAGAGTGACAGGTGTCGCAGGTTTGAGAGAGGGCTACGTTTTGAGATACGTACCCCAGTTACTGCGATTGCAAAGTGGACGGATTT
TTCTCAGCTAGTAGAGACTGCCTTACGTGTGGAGCAGAGTATTATAGAGGAAAGGTCGGTTGCGGAGCCTAGTCGTGGAGCTCCAACAGCTAGTGGTTTTCGAGGTCGTG
AGCAGCGGAGGTTTGCACCTGGAGTGAATGTTTCAGGCCGTCAAGACTTCAAGCGTCGATCTGGTGGCCAATTATCGAGGCAAATGAGTTCGGGTAGTGCCTATCAGAGG
CAGAGTCAGAGAGCCCCCAGTTAG
Protein sequenceShow/hide protein sequence
MLVLAGDVIIEVANVDTAAGLWSKLESLYMTKSLTKKLLLKQRLFSLRMQEGTSLRDHDTNRCVLIKLQEPKMDHSRSVDQTTCGVIRISSLETSFSTVDLDHQDHIDQL
NSIMLDLRNLEIKVDDEDATLILLVSLSLSYENFVDSFISGNDKLSLEEVKIVLLTREVRHKVAGSVIVNQAFGLVASSSKGHRKSGKNSKSNETNPRANDICNYCKEKG
HWKYDCKKKEGSAAVAKVNTDSEDDLALVVNEEPCLKDLWVLDSGVSCHMCPNREWFSTYQHVDGGNVTMANNAVCKVVGIGSVRIQTHDGFFCTLDEVRHVPLMTKCMI
SLSVLDSKGFSFRGSTLTSFAAVASSTVRKEDMTKLWHMRLGHMMKVGTTDNLANMFAKPVPSGKFQHCLDLLNVLNCKLVPMGPYEAEGESGDFFELHQLYEREFKSSL
AHALSRDQVCPPLAPSLSSLWLQARYAFNWIMILHATKRAFRDEDMEQTCVMRFKADGVEFESPYLCLKCAHKCRRKVSQSNEVEDLRSHLETKKYGSGLLFHGPVLCKL
IAQDTPAHMSTTWTLWINTSVSNTKWAVSHSVTRIRRKLVAIYLHHVYVGTNRHYSLDVEIELPVPDTLPTSAESSGSSLSYFQTEICHGDTSCILICFRGFRCVVVHME
YFISMIRSTGIVRGDDVCWFHAVFRAKLASGPGGGVMPPRTSRQRRQNQDGTQDPTQSQSERGSSTPRGQNEAGSDQFARSAQEIGRPERAGPSDLEKTYGIERLKKLGA
TVFEGSTDPADVEVWLNMLEKCFDVMSCPEERKVKLATFLLQKEAEGWWKSIIARRSDARTLDWQTFRGIFEEKYYPATYCEAKRDEFLELKQGSLSVAEYERKYTELSR
YAEVIVASESDRCRRFERGLRFEIRTPVTAIAKWTDFSQLVETALRVEQSIIEERSVAEPSRGAPTASGFRGREQRRFAPGVNVSGRQDFKRRSGGQLSRQMSSGSAYQR
QSQRAPS