; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017786 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017786
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase core domain containing protein
Genome locationchr5:8809307..8810023
RNA-Seq ExpressionLag0017786
SyntenyLag0017786
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]2.2e-5050.67Show/hide
Query:  SANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINPLFE
        SA FSNPPLNQ+LNQ+ ++                    YKLEGHLTG  PCP  FV +A++ +  + +  G +ATI        AS + +   +N LFE
Subjt:  SANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINPLFE

Query:  SWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLG
         WV  D LLLGWLYNSMTP+VA Q+MGF + + LW+  Q+ FGVQSRAEED+LRQ+ Q  RK + KM +YL VMK++ DNLGQ GS V  RAL+SQVLLG
Subjt:  SWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLG

Query:  LDEEFNPVVAMIQGRLGITWSEM
        LDE +N V+ +IQG+  I+W +M
Subjt:  LDEEFNPVVAMIQGRLGITWSEM

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]3.7e-5068.83Show/hide
Query:  SSGGNATIREAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSD
        S G   T +       +S  A+   INPL+ESWV  DQLLLGWLYNSMTPEVATQVMG+E+A  LW  +QELFGVQS+AEEDYLRQVFQQ RK S+KM+D
Subjt:  SSGGNATIREAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSD

Query:  YLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM
        +LRVMKSHADNLGQ GS V  R+L+SQVLLGLDEE+NPVVA IQG+ GI+W EM
Subjt:  YLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.1e-4950.88Show/hide
Query:  VPGSANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINP
        V   A F++PPLNQLLNQITSI                   +YKL  +LTG KPCPP         T   P+ +  N        +  ++ + SS  +NP
Subjt:  VPGSANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINP

Query:  LFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQV
         +E+W+ VD+LLLGWLYNSM  +VA QVMGF +++ LW  VQELFGVQSRAE DYL+QVFQQ  K S++M +YL++MKSHADNL   GSSV+ R LVSQV
Subjt:  LFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQV

Query:  LLGLDEEFNPVVAMIQGRLGITWSEM
        L GLDEE+NP+V  +QG++ ++WSEM
Subjt:  LLGLDEEFNPVVAMIQGRLGITWSEM

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]1.4e-4957.51Show/hide
Query:  NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSS--------GGNATIREAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFES
        +Y+LEGHLTG  PCPP+F  A    T   P            G A++     ++ AS ++   ++NP +ES   VDQLLLGWLYN MT EVA QVMG+E+
Subjt:  NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSS--------GGNATIREAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFES

Query:  AQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM
         + LW  +QELFG+QSRA EDYLRQVFQQ  K +MKM +YLRVMK+H+DNLG TGS V  RALVSQVLLGLDEEFNP VA IQGR  I+W+ M
Subjt:  AQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM

XP_038904321.1 uncharacterized protein LOC120090675 [Benincasa hispida]4.1e-4156.21Show/hide
Query:  FVCAAAARTEPSPSSSGGNATI-REAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLR
        F+  A   +  + S+  G A +  E +  S AS ++++ E+NP + +W+AVDQLLLGWLYNSMTP++A QVMGFE A+ LW  +Q+LFG+QSRAEEDYLR
Subjt:  FVCAAAARTEPSPSSSGGNATI-REAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLR

Query:  QVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM
         VFQ  RK ++KM DYLR MK + DNL Q GS V  RALV QVLLGLDEE+N +VA IQGR  ++W +M
Subjt:  QVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein1.1e-5050.67Show/hide
Query:  SANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINPLFE
        SA FSNPPLNQ+LNQ+ ++                    YKLEGHLTG  PCP  FV +A++ +  + +  G +ATI        AS + +   +N LFE
Subjt:  SANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINPLFE

Query:  SWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLG
         WV  D LLLGWLYNSMTP+VA Q+MGF + + LW+  Q+ FGVQSRAEED+LRQ+ Q  RK + KM +YL VMK++ DNLGQ GS V  RAL+SQVLLG
Subjt:  SWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLG

Query:  LDEEFNPVVAMIQGRLGITWSEM
        LDE +N V+ +IQG+  I+W +M
Subjt:  LDEEFNPVVAMIQGRLGITWSEM

A0A5D3BCH9 Uncharacterized protein2.6e-4150Show/hide
Query:  SANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINPLFE
        SA FSNPPLNQ+LNQ+ ++                    YKLEGHLTG  PCP  FV +A++ +  + +  G +ATI        AS + +   +N LFE
Subjt:  SANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINPLFE

Query:  SWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQ
         WV  D LLLGWLYNSMTP+VA Q+MGF + + LW+  Q+ FGVQSRAEED+LRQ+ Q  RK + KM +YL VMK++ DNLGQ GS V  RAL+SQ
Subjt:  SWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQ

A0A5D3E3L7 Uncharacterized protein6.0e-3847.54Show/hide
Query:  KLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQE
        ++ G   G+   P KF+ + A   E + +     +++     V+ +S + +S+ +NP +E WV  D LLLG +YNSM P+VA Q+MGF +A+ LWE +Q 
Subjt:  KLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQE

Query:  LFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM
        LFG++SRAEE +LR  FQ  R+ + KM DYLR+MK +ADNLGQ GS V +R L+SQVLLGLDE +NPV A+IQG+  I+W +M
Subjt:  LFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM

A0A6J1D5J0 uncharacterized protein LOC1110175011.8e-5068.83Show/hide
Query:  SSGGNATIREAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSD
        S G   T +       +S  A+   INPL+ESWV  DQLLLGWLYNSMTPEVATQVMG+E+A  LW  +QELFGVQS+AEEDYLRQVFQQ RK S+KM+D
Subjt:  SSGGNATIREAAVVSEASETASSEEINPLFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSD

Query:  YLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM
        +LRVMKSHADNLGQ GS V  R+L+SQVLLGLDEE+NPVVA IQG+ GI+W EM
Subjt:  YLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM

A0A6J1DCW4 uncharacterized protein LOC1110195985.3e-5050.88Show/hide
Query:  VPGSANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINP
        V   A F++PPLNQLLNQITSI                   +YKL  +LTG KPCPP         T   P+ +  N        +  ++ + SS  +NP
Subjt:  VPGSANFSNPPLNQLLNQITSI-------------------NYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINP

Query:  LFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQV
         +E+W+ VD+LLLGWLYNSM  +VA QVMGF +++ LW  VQELFGVQSRAE DYL+QVFQQ  K S++M +YL++MKSHADNL   GSSV+ R LVSQV
Subjt:  LFESWVAVDQLLLGWLYNSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQV

Query:  LLGLDEEFNPVVAMIQGRLGITWSEM
        L GLDEE+NP+V  +QG++ ++WSEM
Subjt:  LLGLDEEFNPVVAMIQGRLGITWSEM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.6e-0426.72Show/hide
Query:  SWVAVDQLLLGWLYNSMTP-EVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLL
        +W   D ++   LY ++TP +     +   +++ +W  ++  F     A    L    +      M+++DY R MK  AD+L      VT+R LV  VL 
Subjt:  SWVAVDQLLLGWLYNSMTP-EVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLL

Query:  GLDEEFNPVVAMIQGR
        GL+ +F+ ++ +I+ R
Subjt:  GLDEEFNPVVAMIQGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAGCGTCAATTCCACCGTGATCTCAGCCTTAGTTCCTGGATCGGCTAATTTTAGCAATCCACCGTTGAACCAGCTACTGAATCAGATAACCTCTATAAATTATAA
GTTGGAAGGGCATCTCACGGGCGTTAAACCCTGTCCTCCGAAATTTGTCTGTGCCGCTGCAGCAAGGACTGAACCTTCTCCCTCATCGTCGGGAGGAAATGCGACAATTC
GAGAAGCTGCTGTTGTGAGTGAAGCTTCTGAGACTGCATCGTCTGAGGAAATTAACCCACTGTTTGAATCATGGGTTGCTGTGGATCAATTACTCCTAGGTTGGCTCTAC
AATTCAATGACGCCGGAGGTGGCTACTCAAGTAATGGGGTTTGAAAGTGCTCAAGGACTGTGGGAAGTAGTGCAGGAGCTGTTTGGAGTCCAATCTCGTGCTGAAGAAGA
CTACCTACGCCAGGTATTTCAGCAATGTAGAAAAGAAAGTATGAAAATGTCTGACTATTTGCGTGTAATGAAATCCCATGCAGATAACCTAGGGCAGACCGGCAGTTCGG
TCACTAATCGAGCGTTAGTTTCACAAGTCCTTTTAGGACTAGATGAAGAATTTAATCCAGTCGTGGCAATGATTCAAGGTCGATTAGGCATCACATGGTCTGAGATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTAGCGTCAATTCCACCGTGATCTCAGCCTTAGTTCCTGGATCGGCTAATTTTAGCAATCCACCGTTGAACCAGCTACTGAATCAGATAACCTCTATAAATTATAA
GTTGGAAGGGCATCTCACGGGCGTTAAACCCTGTCCTCCGAAATTTGTCTGTGCCGCTGCAGCAAGGACTGAACCTTCTCCCTCATCGTCGGGAGGAAATGCGACAATTC
GAGAAGCTGCTGTTGTGAGTGAAGCTTCTGAGACTGCATCGTCTGAGGAAATTAACCCACTGTTTGAATCATGGGTTGCTGTGGATCAATTACTCCTAGGTTGGCTCTAC
AATTCAATGACGCCGGAGGTGGCTACTCAAGTAATGGGGTTTGAAAGTGCTCAAGGACTGTGGGAAGTAGTGCAGGAGCTGTTTGGAGTCCAATCTCGTGCTGAAGAAGA
CTACCTACGCCAGGTATTTCAGCAATGTAGAAAAGAAAGTATGAAAATGTCTGACTATTTGCGTGTAATGAAATCCCATGCAGATAACCTAGGGCAGACCGGCAGTTCGG
TCACTAATCGAGCGTTAGTTTCACAAGTCCTTTTAGGACTAGATGAAGAATTTAATCCAGTCGTGGCAATGATTCAAGGTCGATTAGGCATCACATGGTCTGAGATGTAA
Protein sequenceShow/hide protein sequence
MTSVNSTVISALVPGSANFSNPPLNQLLNQITSINYKLEGHLTGVKPCPPKFVCAAAARTEPSPSSSGGNATIREAAVVSEASETASSEEINPLFESWVAVDQLLLGWLY
NSMTPEVATQVMGFESAQGLWEVVQELFGVQSRAEEDYLRQVFQQCRKESMKMSDYLRVMKSHADNLGQTGSSVTNRALVSQVLLGLDEEFNPVVAMIQGRLGITWSEM