; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001600 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001600
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:33378011..33378424
RNA-Seq ExpressionLag0001600
SyntenyLag0001600
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.4e-3462.31Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        MGF  S+ELW  +Q+LFG+QSRAE DYL+QVFQQT K  ++M EYL+LMK+HAD+L  AGS VSVR LVSQVL GLDEEYN +V  +QG   ++WSEMH 
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNTRKSA--VNFSHNTSVN
        ELL +EKRLE QN+ KS   +N +   SVN
Subjt:  ELLVFEKRLELQNTRKSA--VNFSHNTSVN

XP_038896600.1 uncharacterized protein LOC120084860 [Benincasa hispida]2.1e-3563.04Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        MG+E ++ELW  IQ+LFG+QSRAEEDYLRQ+FQQTRK G KM+ YL+LMK H+D+L Q  SPVS RTL+SQVLLGLDEEYN VV  IQG   I+W +M  
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNTRKSAVNFSHNTSVNMANSSN-NGG
        ELL +EKRLE QN+ K    F+    VN+A S N NGG
Subjt:  ELLVFEKRLELQNTRKSAVNFSHNTSVNMANSSN-NGG

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]2.9e-3266.38Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        MG+EN + LWA IQ+LFGLQSRA EDYLRQVFQQT K  MKM EYLR+MKTH+D+LG  GSPV  R LVSQVLLGLDEE+N  VA IQG + I+W+ M  
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNTRK
        ELL FEKR    N ++
Subjt:  ELLVFEKRLELQNTRK

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]5.8e-3359.7Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        MG E +++LW +I  LFG+QSR EEDYLR VFQ TRK  +KM EYL+ MK + D+L QAGSP+  RTLVSQVLLGLDEEYNA+VAMIQG   ++W +M  
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNTRKSAVNFSH--NTSVNMANS
        ELL++E+RLE Q+ +K+ V F+   N SVNM N+
Subjt:  ELLVFEKRLELQNTRKSAVNFSH--NTSVNMANS

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]5.8e-3359.7Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        MG E +++LW +I  LFG+QSR EEDYLR VFQ TRK  +KM EYL+ MK + D+L QAGSP+  RTLVSQVLLGLDEEYNA+VAMIQG   ++W +M  
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNTRKSAVNFSH--NTSVNMANS
        ELL++E+RLE Q+ +K+ V F+   N SVNM N+
Subjt:  ELLVFEKRLELQNTRKSAVNFSH--NTSVNMANS

TrEMBL top hitse value%identityAlignment
A0A0A0LXB7 Uncharacterized protein1.8e-3254.81Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        +GF N++++W    D FG++SRAEED+LRQ FQ TRK    M +YLR+MKT+AD+LGQA SP+  R L+SQVLLGLDE YN V+ +IQG   I+W +M  
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNTRKSAVNFSHNTSVNMANSSNN
        +LL+FEKRL+ QN++K+  N   N ++NMA S NN
Subjt:  ELLVFEKRLELQNTRKSAVNFSHNTSVNMANSSNN

A0A5A7SIT7 Uncharacterized protein6.4e-3055.64Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        MGF N ++LW   QD FG+QSRAEED+LRQ+ Q TRK   KM EYL +MKT+ D+LGQ GSPV  R L+SQVLLGLDE YN V+ +IQG   I+W +M  
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNT---RKSAVNFSHNTSVNMA
        +LL+FEK L+ QNT   +K   N + + ++NMA
Subjt:  ELLVFEKRLELQNT---RKSAVNFSHNTSVNMA

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-3257.78Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        MGF N+++LW   QDLFG+QSRAEED+LRQ+FQ TRK      +YLR+MKT++D LGQAGSPV  R  +SQ LLGLDE YN V+A+IQG   I+W +M  
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNTRKSAVNFSHNTSVNMANSSNN
        ELL FEKRLE Q+T+K+  N   N  VN+A + N+
Subjt:  ELLVFEKRLELQNTRKSAVNFSHNTSVNMANSSNN

A0A6J1D5J0 uncharacterized protein LOC1110175012.6e-3161.29Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        MG+EN+ +LWA IQ+LFG+QS+AEEDYLRQVFQQTRK  +KM ++LR+MK+HAD+LGQAGSPV  R+L+SQVLLGLDEEYN VVA IQG   I+W EM  
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNTRKSAVNFSHN
        E        + QN + S   F++N
Subjt:  ELLVFEKRLELQNTRKSAVNFSHN

A0A6J1DCW4 uncharacterized protein LOC1110195986.7e-3562.31Show/hide
Query:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV
        MGF  S+ELW  +Q+LFG+QSRAE DYL+QVFQQT K  ++M EYL+LMK+HAD+L  AGS VSVR LVSQVL GLDEEYN +V  +QG   ++WSEMH 
Subjt:  MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHV

Query:  ELLVFEKRLELQNTRKSA--VNFSHNTSVN
        ELL +EKRLE QN+ KS   +N +   SVN
Subjt:  ELLVFEKRLELQNTRKSA--VNFSHNTSVN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-0526.77Show/hide
Query:  SQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPA-IITWSEMHVELLV
        +++LW ++++LF     A         + T  + + + EY + +K+ +D L    SP+S R LV  +L GL E+Y+ ++ +I+  +   +++E    LL+
Subjt:  SQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPA-IITWSEMHVELLV

Query:  FEKRLELQNTRKSAVNFSHNTSVNMAN
         E RL    + KS  + SH    +++N
Subjt:  FEKRLELQNTRKSAVNFSHNTSVNMAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTTCGAAAACTCTCAAGAGCTGTGGGCAACAATACAAGACCTATTTGGTCTTCAATCCCGTGCTGAGGAAGACTACCTCAGACAAGTTTTTCAACAGACTCGCAA
AGAAGGTATGAAAATGGCTGAATACTTGCGTTTAATGAAAACTCACGCTGATAGTCTTGGTCAAGCAGGAAGTCCAGTATCGGTAAGAACACTGGTATCTCAAGTTCTTT
TAGGACTTGACGAGGAGTATAATGCAGTGGTAGCAATGATTCAAGGACCAGCAATCATTACATGGTCGGAGATGCATGTTGAGCTCTTGGTGTTTGAGAAGAGACTTGAG
TTACAAAACACCCGGAAATCAGCTGTTAATTTCAGTCACAACACCTCAGTAAACATGGCTAACAGTAGTAACAATGGAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCTTCGAAAACTCTCAAGAGCTGTGGGCAACAATACAAGACCTATTTGGTCTTCAATCCCGTGCTGAGGAAGACTACCTCAGACAAGTTTTTCAACAGACTCGCAA
AGAAGGTATGAAAATGGCTGAATACTTGCGTTTAATGAAAACTCACGCTGATAGTCTTGGTCAAGCAGGAAGTCCAGTATCGGTAAGAACACTGGTATCTCAAGTTCTTT
TAGGACTTGACGAGGAGTATAATGCAGTGGTAGCAATGATTCAAGGACCAGCAATCATTACATGGTCGGAGATGCATGTTGAGCTCTTGGTGTTTGAGAAGAGACTTGAG
TTACAAAACACCCGGAAATCAGCTGTTAATTTCAGTCACAACACCTCAGTAAACATGGCTAACAGTAGTAACAATGGAGGTTAG
Protein sequenceShow/hide protein sequence
MGFENSQELWATIQDLFGLQSRAEEDYLRQVFQQTRKEGMKMAEYLRLMKTHADSLGQAGSPVSVRTLVSQVLLGLDEEYNAVVAMIQGPAIITWSEMHVELLVFEKRLE
LQNTRKSAVNFSHNTSVNMANSSNNGG