; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027389 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027389
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:63236..64489
RNA-Seq ExpressionLag0027389
SyntenyLag0027389
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.5e-9253.37Show/hide
Query:  MASNGSMNSTLDTSIATAA-QIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKI
        M+S  S+    +T  ++   QIF  GNKIS+VKL DD FLLWKFQI+TALE +DL + L    +PPS++L ++TE+S+  A +  +PN AY  WKRQD++
Subjt:  MASNGSMNSTLDTSIATAA-QIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKI

Query:  LSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSM
        +SSWL+G MSE+IL+QM+HC SAK IW  L+ IF++R LAQ M+ K  L  I+ G M LKEYF KI Q VDAL  + KPV  +DHIL+IL+GLGSDY SM
Subjt:  LSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSM

Query:  VSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESES-PKPNPNPYAQSFTGGNR-GRGGGRYGSNRGGRTWNNRNRIQCQVY
        +S+ISA+    +VQEVMSLLLTQE++ ESKL  S+TALPSVN+        +ES  + N N Y  + +   R GRG GR  SNRG R   NRN+ QCQ+ 
Subjt:  VSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESES-PKPNPNPYAQSFTGGNR-GRGGGRYGSNRGGRTWNNRNRIQCQVY

Query:  GKFGHTAQRCYFRYAPSGPSNNPGSFSPHFNQSS--RPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGN
         K G++A RC+FRY    P +N   +SP+ + +S    +  PQM+AM+ A D+N D++WYPDSGATNHLTHS  NLS G+EYGGGN
Subjt:  GKFGHTAQRCYFRYAPSGPSNNPGSFSPHFNQSS--RPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGN

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.5e-9253.37Show/hide
Query:  MASNGSMNSTLDTSIATAA-QIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKI
        M+S  S+    +T  ++   QIF  GNKIS+VKL DD FLLWKFQI+TALE +DL + L    +PPS++L ++TE+S+  A +  +PN AY  WKRQD++
Subjt:  MASNGSMNSTLDTSIATAA-QIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKI

Query:  LSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSM
        +SSWL+G MSE+IL+QM+HC SAK IW  L+ IF++R LAQ M+ K  L  I+ G M LKEYF KI Q VDAL  + KPV  +DHIL+IL+GLGSDY SM
Subjt:  LSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSM

Query:  VSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESES-PKPNPNPYAQSFTGGNR-GRGGGRYGSNRGGRTWNNRNRIQCQVY
        +S+ISA+    +VQEVMSLLLTQE++ ESKL  S+TALPSVN+        +ES  + N N Y  + +   R GRG GR  SNRG R   NRN+ QCQ+ 
Subjt:  VSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESES-PKPNPNPYAQSFTGGNR-GRGGGRYGSNRGGRTWNNRNRIQCQVY

Query:  GKFGHTAQRCYFRYAPSGPSNNPGSFSPHFNQSS--RPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGN
         K G++A RC+FRY    P +N   +SP+ + +S    +  PQM+AM+ A D+N D++WYPDSGATNHLTHS  NLS G+EYGGGN
Subjt:  GKFGHTAQRCYFRYAPSGPSNNPGSFSPHFNQSS--RPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGN

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]7.6e-6044.19Show/hide
Query:  RQDKILSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGS
        +QDK+++SWL   M E+IL +MIHC++A+ +W  LE ++T+RNLA++M++K+ L+ I+ G + LK+YF K++  VD+L   GK V VEDHI+ IL+GL S
Subjt:  RQDKILSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGS

Query:  DYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQC
        +++S VS+ISA+   QT+QEV SLLL+ E R E    ++   LPSVNLT   K   S        PY Q+    N G    R       R WN+ NR QC
Subjt:  DYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQC

Query:  QVYGKFGHTAQRCYFRY-----APSGPSNNPGSFSPHFNQS------------------SRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNL
        Q+YGKFGHTA RCY R+      P+G S+    FS   N S                   +P+    MAA L   D N+DT+WYPDSGATNH+T +F NL
Subjt:  QVYGKFGHTAQRCYFRY-----APSGPSNNPGSFSPHFNQS------------------SRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNL

Query:  SAGTEYGGGN
        +  TEY G N
Subjt:  SAGTEYGGGN

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]7.6e-6044.19Show/hide
Query:  RQDKILSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGS
        +QDK+++SWL   M E+IL +MIHC++A+ +W  LE ++T+RNLA++M++K+ L+ I+ G + LK+YF K++  VD+L   GK V VEDHI+ IL+GL S
Subjt:  RQDKILSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGS

Query:  DYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQC
        +++S VS+ISA+   QT+QEV SLLL+ E R E    ++   LPSVNLT   K   S        PY Q+    N G    R       R WN+ NR QC
Subjt:  DYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQC

Query:  QVYGKFGHTAQRCYFRY-----APSGPSNNPGSFSPHFNQS------------------SRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNL
        Q+YGKFGHTA RCY R+      P+G S+    FS   N S                   +P+    MAA L   D N+DT+WYPDSGATNH+T +F NL
Subjt:  QVYGKFGHTAQRCYFRY-----APSGPSNNPGSFSPHFNQS------------------SRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNL

Query:  SAGTEYGGGN
        +  TEY G N
Subjt:  SAGTEYGGGN

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]4.7e-7841.97Show/hide
Query:  MASNGSMNSTLDTSIATAAQIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKIL
        MAS  S+ ++    I  A++  +PG+K+SIV+L DD  LLWKFQI TAL+G  L  ++   +D P++F+  T + S++   S    N AY +W +QDK++
Subjt:  MASNGSMNSTLDTSIATAAQIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKIL

Query:  SSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSMV
        S+WL+G M+EDIL QM+ C SA+ IW+ LE +F +R LA++M++K  L+  + G +SLK+YF KI+  VD+L + GK +  EDHI+ IL+GLG ++D+++
Subjt:  SSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSMV

Query:  SIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNP-NPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQCQVYGK
        S+I+A+  PQT+QEV SLLL QE R E  L +S  +LPSVNLT++    ++   +    NP+  +++   RGRG     SNR  R W   N+ QCQ+ G+
Subjt:  SIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNP-NPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQCQVYGK

Query:  FGHTAQRCYFRYAPS--GPSNNPGSFSPHFNQSSRPHKFP-----------------------QMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAG
        FGHTA RCY R+  +  GP+ NP +FSP    S  P   P                       QM A++ A D N+D++WY DSG TNH+T+ FGN S G
Subjt:  FGHTAQRCYFRYAPS--GPSNNPGSFSPHFNQSSRPHKFP-----------------------QMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAG

Query:  TEYGGGNCCLVQSDSKY
        +EY G     V + + Y
Subjt:  TEYGGGNCCLVQSDSKY

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-9353.37Show/hide
Query:  MASNGSMNSTLDTSIATAA-QIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKI
        M+S  S+    +T  ++   QIF  GNKIS+VKL DD FLLWKFQI+TALE +DL + L    +PPS++L ++TE+S+  A +  +PN AY  WKRQD++
Subjt:  MASNGSMNSTLDTSIATAA-QIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKI

Query:  LSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSM
        +SSWL+G MSE+IL+QM+HC SAK IW  L+ IF++R LAQ M+ K  L  I+ G M LKEYF KI Q VDAL  + KPV  +DHIL+IL+GLGSDY SM
Subjt:  LSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSM

Query:  VSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESES-PKPNPNPYAQSFTGGNR-GRGGGRYGSNRGGRTWNNRNRIQCQVY
        +S+ISA+    +VQEVMSLLLTQE++ ESKL  S+TALPSVN+        +ES  + N N Y  + +   R GRG GR  SNRG R   NRN+ QCQ+ 
Subjt:  VSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESES-PKPNPNPYAQSFTGGNR-GRGGGRYGSNRGGRTWNNRNRIQCQVY

Query:  GKFGHTAQRCYFRYAPSGPSNNPGSFSPHFNQSS--RPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGN
         K G++A RC+FRY    P +N   +SP+ + +S    +  PQM+AM+ A D+N D++WYPDSGATNHLTHS  NLS G+EYGGGN
Subjt:  GKFGHTAQRCYFRYAPSGPSNNPGSFSPHFNQSS--RPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGN

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-9353.37Show/hide
Query:  MASNGSMNSTLDTSIATAA-QIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKI
        M+S  S+    +T  ++   QIF  GNKIS+VKL DD FLLWKFQI+TALE +DL + L    +PPS++L ++TE+S+  A +  +PN AY  WKRQD++
Subjt:  MASNGSMNSTLDTSIATAA-QIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKI

Query:  LSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSM
        +SSWL+G MSE+IL+QM+HC SAK IW  L+ IF++R LAQ M+ K  L  I+ G M LKEYF KI Q VDAL  + KPV  +DHIL+IL+GLGSDY SM
Subjt:  LSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSM

Query:  VSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESES-PKPNPNPYAQSFTGGNR-GRGGGRYGSNRGGRTWNNRNRIQCQVY
        +S+ISA+    +VQEVMSLLLTQE++ ESKL  S+TALPSVN+        +ES  + N N Y  + +   R GRG GR  SNRG R   NRN+ QCQ+ 
Subjt:  VSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESES-PKPNPNPYAQSFTGGNR-GRGGGRYGSNRGGRTWNNRNRIQCQVY

Query:  GKFGHTAQRCYFRYAPSGPSNNPGSFSPHFNQSS--RPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGN
         K G++A RC+FRY    P +N   +SP+ + +S    +  PQM+AM+ A D+N D++WYPDSGATNHLTHS  NLS G+EYGGGN
Subjt:  GKFGHTAQRCYFRYAPSGPSNNPGSFSPHFNQSS--RPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGN

A0A6J1C6N9 dr1-associated corepressor homolog isoform X13.7e-6044.19Show/hide
Query:  RQDKILSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGS
        +QDK+++SWL   M E+IL +MIHC++A+ +W  LE ++T+RNLA++M++K+ L+ I+ G + LK+YF K++  VD+L   GK V VEDHI+ IL+GL S
Subjt:  RQDKILSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGS

Query:  DYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQC
        +++S VS+ISA+   QT+QEV SLLL+ E R E    ++   LPSVNLT   K   S        PY Q+    N G    R       R WN+ NR QC
Subjt:  DYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQC

Query:  QVYGKFGHTAQRCYFRY-----APSGPSNNPGSFSPHFNQS------------------SRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNL
        Q+YGKFGHTA RCY R+      P+G S+    FS   N S                   +P+    MAA L   D N+DT+WYPDSGATNH+T +F NL
Subjt:  QVYGKFGHTAQRCYFRY-----APSGPSNNPGSFSPHFNQS------------------SRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNL

Query:  SAGTEYGGGN
        +  TEY G N
Subjt:  SAGTEYGGGN

A0A6J1C8R2 dr1-associated corepressor homolog isoform X23.7e-6044.19Show/hide
Query:  RQDKILSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGS
        +QDK+++SWL   M E+IL +MIHC++A+ +W  LE ++T+RNLA++M++K+ L+ I+ G + LK+YF K++  VD+L   GK V VEDHI+ IL+GL S
Subjt:  RQDKILSSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGS

Query:  DYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQC
        +++S VS+ISA+   QT+QEV SLLL+ E R E    ++   LPSVNLT   K   S        PY Q+    N G    R       R WN+ NR QC
Subjt:  DYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQC

Query:  QVYGKFGHTAQRCYFRY-----APSGPSNNPGSFSPHFNQS------------------SRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNL
        Q+YGKFGHTA RCY R+      P+G S+    FS   N S                   +P+    MAA L   D N+DT+WYPDSGATNH+T +F NL
Subjt:  QVYGKFGHTAQRCYFRY-----APSGPSNNPGSFSPHFNQS------------------SRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNL

Query:  SAGTEYGGGN
        +  TEY G N
Subjt:  SAGTEYGGGN

A0A6J1DLT9 uncharacterized protein LOC1110217572.3e-7841.97Show/hide
Query:  MASNGSMNSTLDTSIATAAQIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKIL
        MAS  S+ ++    I  A++  +PG+K+SIV+L DD  LLWKFQI TAL+G  L  ++   +D P++F+  T + S++   S    N AY +W +QDK++
Subjt:  MASNGSMNSTLDTSIATAAQIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHL--TDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKIL

Query:  SSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSMV
        S+WL+G M+EDIL QM+ C SA+ IW+ LE +F +R LA++M++K  L+  + G +SLK+YF KI+  VD+L + GK +  EDHI+ IL+GLG ++D+++
Subjt:  SSWLVGPMSEDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSMV

Query:  SIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNP-NPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQCQVYGK
        S+I+A+  PQT+QEV SLLL QE R E  L +S  +LPSVNLT++    ++   +    NP+  +++   RGRG     SNR  R W   N+ QCQ+ G+
Subjt:  SIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNP-NPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQCQVYGK

Query:  FGHTAQRCYFRYAPS--GPSNNPGSFSPHFNQSSRPHKFP-----------------------QMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAG
        FGHTA RCY R+  +  GP+ NP +FSP    S  P   P                       QM A++ A D N+D++WY DSG TNH+T+ FGN S G
Subjt:  FGHTAQRCYFRYAPS--GPSNNPGSFSPHFNQSSRPHKFP-----------------------QMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAG

Query:  TEYGGGNCCLVQSDSKY
        +EY G     V + + Y
Subjt:  TEYGGGNCCLVQSDSKY

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-2525.99Show/hide
Query:  NKISIVKLTDDKFLLWKFQIITALEGFDLHHHLTDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKILSSWLVGPMSEDILHQMIHCSSAKAIWS
        N  ++ KLT   +L+W  Q+    +G++L   L         T+ T+A+          N  Y +WKRQDK++ S ++G +S  +   +   ++A  IW 
Subjt:  NKISIVKLTDDKFLLWKFQIITALEGFDLHHHLTDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKILSSWLVGPMSEDILHQMIHCSSAKAIWS

Query:  CLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSMVSIISAKIGPQTVQEVMSLLLTQENRIE
         L +I+   +   + +++T L+    G  ++ +Y   +    D L ++GKP+D ++ +  +L  L  +Y  ++  I+AK  P T+ E+   LL  E++I 
Subjt:  CLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSMVSIISAKIGPQTVQEVMSLLLTQENRIE

Query:  SKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTW----------NNRNRI---QCQVYGKFGHTAQRCYFRYAPS
        +   SS T +P     VS +   + +   N          GNR        +N   + W          NN+++    +CQ+ G  GH+A+RC       
Subjt:  SKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTW----------NNRNRI---QCQVYGKFGHTAQRCYFRYAPS

Query:  GPSNNPGSFSPHFNQSSRPHKFP--QMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGNCCLVQSDS
           +    F    N    P  F   Q  A L         +W  DSGAT+H+T  F NLS    Y GG+  +V   S
Subjt:  GPSNNPGSFSPHFNQSSRPHKFP--QMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGNCCLVQSDS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-1924.43Show/hide
Query:  NKISIVKLTDDKFLLWKFQIITALEGFDLHHHLTDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKILSSWLVGPMSEDILHQMIHCSSAKAIWS
        N  ++ KLT   +L+W  Q+    +G++L   L    P    T+ T+A       V   N  Y +W+RQDK++ S ++G +S  +   +   ++A  IW 
Subjt:  NKISIVKLTDDKFLLWKFQIITALEGFDLHHHLTDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKILSSWLVGPMSEDILHQMIHCSSAKAIWS

Query:  CLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSMVSIISAKIGPQTVQEVMSLLLTQENRIE
         L +I+   +   +    T L+ I                  D L ++GKP+D ++ +  +L  L  DY  ++  I+AK  P ++ E+   L+ +E+++ 
Subjt:  CLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSMVSIISAKIGPQTVQEVMSLLLTQENRIE

Query:  SKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRI---QCQVYGKFGHTAQRCYFRYAPSGPSNNPGSFS
        + L+S++    + N+   +    + + +   N         N  R      S+ G R+ N + +    +CQ+    GH+A+RC        P  +    +
Subjt:  SKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRI---QCQVYGKFGHTAQRCYFRYAPSGPSNNPGSFS

Query:  PHFNQSSRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGNCCLVQSDSKYARKTH---KHMCKCTRSVASDKYFYI
         +  QS+ P    Q  A L         +W  DSGAT+H+T  F NLS    Y GG+  ++ +D      TH     +   +RS+  +K  Y+
Subjt:  PHFNQSSRPHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGNCCLVQSDSKYARKTH---KHMCKCTRSVASDKYFYI

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.2e-0823.39Show/hide
Query:  NSAYLQWKRQDKILSSWLVGPMS-EDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHI
        N+  + W+++D I+   L G ++ +      +  S+++ IW  ++  F     A+ +++ + L+T   G M + +Y+ K+++  D+L  V  PV   + +
Subjt:  NSAYLQWKRQDKILSSWLVGPMS-EDILHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHI

Query:  LFILSGLGSDYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKT----ALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRG-GGRYGSN
        +++L+GL   +D+++++I  +    +  +  ++L  +E+R++  +  + T    +  S  L  S+ PP +   +   N        G RGRG G      
Subjt:  LFILSGLGSDYDSMVSIISAKIGPQTVQEVMSLLLTQENRIESKLSSSKT----ALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRG-GGRYGSN

Query:  RGGR-------TWNNRNR
        RGGR       T+N+ NR
Subjt:  RGGR-------TWNNRNR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.7e-1328.02Show/hide
Query:  QWKRQDKILSSWLVGPMSEDILHQMIHCS-SAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILS
        +WK +D ++  W+ G +++ +L  +I    +A+ +W  LE +F     A+ ++ +  L+T     +S+ EY  K++   D L  V  P+     ++ +L+
Subjt:  QWKRQDKILSSWLVGPMSEDILHQMIHCS-SAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILS

Query:  GLGSDYDSMVSIISAKIGPQTVQEVMSLLLTQENRI--ESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGR-YGSNRGGRT--
        GL   YD ++++I  K    +  E  S+LL +E+R+  +SK S S T  PS++  +   P + E        Y Q +   N   G GR    NRGG +  
Subjt:  GLGSDYDSMVSIISAKIGPQTVQEVMSLLLTQENRI--ESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGR-YGSNRGGRT--

Query:  --WNNRN
          +NN N
Subjt:  --WNNRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTAATGGTTCTATGAACTCCACTCTGGATACTTCCATTGCTACGGCTGCGCAAATCTTTAGTCCGGGTAATAAAATTTCAATTGTCAAGCTTACTGATGACAA
ATTTTTGTTATGGAAATTTCAAATTATTACTGCTCTTGAGGGTTTTGACCTTCACCATCATCTTACAGATGATCCTCCTTCTGAGTTTCTTACTGTCACTACTGAAGCGT
CTACAAATGAGGCTCAATCTGTCAAATCCCCAAATTCGGCTTATCTTCAATGGAAGCGTCAGGATAAAATTCTCTCTTCATGGCTCGTTGGGCCGATGTCCGAAGACATC
CTTCATCAGATGATCCATTGCTCCTCAGCCAAGGCTATATGGTCCTGTCTTGAGCAAATCTTCACTACCCGCAACTTGGCTCAGATGATGAAAATTAAGACCAATCTGCA
AACTATCCAGAATGGAGGTATGTCACTCAAAGAATACTTTTCGAAAATTCAGCAGTATGTAGATGCTTTATATGTTGTGGGGAAACCGGTGGATGTTGAGGACCATATAT
TGTTTATCTTATCTGGTTTGGGTTCTGATTATGATTCTATGGTGTCTATCATATCTGCTAAAATTGGTCCTCAGACCGTTCAAGAGGTTATGTCGCTTTTATTAACCCAG
GAAAATCGTATTGAAAGTAAGTTATCGAGTTCTAAGACTGCCCTTCCCTCTGTAAACCTCACGGTTAGCCAGAAACCCCCTGAATCTGAGTCCCCGAAGCCTAATCCGAA
TCCTTATGCTCAATCCTTCACTGGTGGGAATCGAGGGCGTGGTGGTGGTCGTTATGGTTCCAACCGTGGAGGTCGTACCTGGAACAACAGAAACAGAATCCAATGTCAGG
TGTATGGAAAGTTTGGTCACACTGCTCAGCGTTGTTATTTTCGATATGCTCCATCTGGTCCTTCTAACAATCCTGGTTCATTCTCTCCTCACTTTAATCAGTCTAGTCGC
CCACATAAATTTCCACAGATGGCTGCCATGCTCACGGCTCCTGATATCAATCAAGACACCAGCTGGTACCCTGATTCCGGTGCAACCAATCATCTCACTCACTCTTTTGG
CAATCTCTCGGCAGGTACCGAGTATGGTGGCGGTAATTGTTGTCTTGTGCAAAGTGATTCTAAATATGCAAGAAAAACTCACAAACACATGTGCAAGTGCACACGATCAG
TAGCAAGTGATAAGTATTTTTATATCGTTCTCCACAGGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTAATGGTTCTATGAACTCCACTCTGGATACTTCCATTGCTACGGCTGCGCAAATCTTTAGTCCGGGTAATAAAATTTCAATTGTCAAGCTTACTGATGACAA
ATTTTTGTTATGGAAATTTCAAATTATTACTGCTCTTGAGGGTTTTGACCTTCACCATCATCTTACAGATGATCCTCCTTCTGAGTTTCTTACTGTCACTACTGAAGCGT
CTACAAATGAGGCTCAATCTGTCAAATCCCCAAATTCGGCTTATCTTCAATGGAAGCGTCAGGATAAAATTCTCTCTTCATGGCTCGTTGGGCCGATGTCCGAAGACATC
CTTCATCAGATGATCCATTGCTCCTCAGCCAAGGCTATATGGTCCTGTCTTGAGCAAATCTTCACTACCCGCAACTTGGCTCAGATGATGAAAATTAAGACCAATCTGCA
AACTATCCAGAATGGAGGTATGTCACTCAAAGAATACTTTTCGAAAATTCAGCAGTATGTAGATGCTTTATATGTTGTGGGGAAACCGGTGGATGTTGAGGACCATATAT
TGTTTATCTTATCTGGTTTGGGTTCTGATTATGATTCTATGGTGTCTATCATATCTGCTAAAATTGGTCCTCAGACCGTTCAAGAGGTTATGTCGCTTTTATTAACCCAG
GAAAATCGTATTGAAAGTAAGTTATCGAGTTCTAAGACTGCCCTTCCCTCTGTAAACCTCACGGTTAGCCAGAAACCCCCTGAATCTGAGTCCCCGAAGCCTAATCCGAA
TCCTTATGCTCAATCCTTCACTGGTGGGAATCGAGGGCGTGGTGGTGGTCGTTATGGTTCCAACCGTGGAGGTCGTACCTGGAACAACAGAAACAGAATCCAATGTCAGG
TGTATGGAAAGTTTGGTCACACTGCTCAGCGTTGTTATTTTCGATATGCTCCATCTGGTCCTTCTAACAATCCTGGTTCATTCTCTCCTCACTTTAATCAGTCTAGTCGC
CCACATAAATTTCCACAGATGGCTGCCATGCTCACGGCTCCTGATATCAATCAAGACACCAGCTGGTACCCTGATTCCGGTGCAACCAATCATCTCACTCACTCTTTTGG
CAATCTCTCGGCAGGTACCGAGTATGGTGGCGGTAATTGTTGTCTTGTGCAAAGTGATTCTAAATATGCAAGAAAAACTCACAAACACATGTGCAAGTGCACACGATCAG
TAGCAAGTGATAAGTATTTTTATATCGTTCTCCACAGGGATTGA
Protein sequenceShow/hide protein sequence
MASNGSMNSTLDTSIATAAQIFSPGNKISIVKLTDDKFLLWKFQIITALEGFDLHHHLTDDPPSEFLTVTTEASTNEAQSVKSPNSAYLQWKRQDKILSSWLVGPMSEDI
LHQMIHCSSAKAIWSCLEQIFTTRNLAQMMKIKTNLQTIQNGGMSLKEYFSKIQQYVDALYVVGKPVDVEDHILFILSGLGSDYDSMVSIISAKIGPQTVQEVMSLLLTQ
ENRIESKLSSSKTALPSVNLTVSQKPPESESPKPNPNPYAQSFTGGNRGRGGGRYGSNRGGRTWNNRNRIQCQVYGKFGHTAQRCYFRYAPSGPSNNPGSFSPHFNQSSR
PHKFPQMAAMLTAPDINQDTSWYPDSGATNHLTHSFGNLSAGTEYGGGNCCLVQSDSKYARKTHKHMCKCTRSVASDKYFYIVLHRD