; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025987 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025987
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr10:26220028..26226400
RNA-Seq ExpressionLag0025987
SyntenyLag0025987
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043583.1 putative Integrase core domain [Cucumis melo var. makuwa]3.5e-4972.22Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIK--ELVLQVVILDD
        AHVLVTN KKLEPRSRLC+FVGYPKE R GLF+DP+EN+V VSTN TFL+EDHMR+HKPQ KL+L +    QQ LLMK+V HQEL+K    V  +++  D
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIK--ELVLQVVILDD

Query:  GVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPE
        GVEDPL Y+QAMNDVDKDQW+KAM+LEMES YFN VWELVDLPE
Subjt:  GVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPE

KAA0047456.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-4767.76Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELV-----LQVVI
        AHVLV N KKLEPRSRLCQFV YPKE R GLF+DP+EN+V VSTN TFL+EDHMR+HKP+SKL+L EAT     ++ ++ L   + + +       QVVI
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELV-----LQVVI

Query:  LDDGVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI
         DDGVE PL Y+Q MNDVDKDQWVKAMDLEMES YFNSVWELVDLPE VK I
Subjt:  LDDGVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI

KAA0059678.1 gag/pol protein [Cucumis melo var. makuwa]6.3e-5174.15Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELVLQVVILDDGV
        AHVLVTN KKLEPRS LCQFVGYPKE RDGLF+DP+EN+V VSTN TFL+EDHMR+HK +SKL+L EAT     ++ ++  HQEL+K L  QVVI DDGV
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELVLQVVILDDGV

Query:  EDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI
        EDPL Y+QAMNDVDK+QWVKAMDLE+ES YFNSVWELVDLPE VK I
Subjt:  EDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI

KAA0064189.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-5072.79Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELVLQVVILDDGV
        AHVLVTN KKLEPRSRLCQFVGYPKE R G+F+DP+EN+V VSTN TFL+EDHMRNHKP+ KL+L EAT     ++ ++  HQEL+K    QVV  +DGV
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELVLQVVILDDGV

Query:  EDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI
        EDPL Y+QAMNDVDKDQWVKAMDLEM+S YFNSVWELVDLPE VK I
Subjt:  EDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI

TYJ98650.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-4767.76Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELV-----LQVVI
        AHVLV N KKLEPRSRLCQFV YPKE R GLF+DP+EN+V VSTN TFL+EDHMR+HKP+SKL+L EAT     ++ ++ L   + + +       QVVI
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELV-----LQVVI

Query:  LDDGVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI
         DDGVE PL Y+Q MNDVDKDQWVKAMDLEMES YFNSVWELVDLPE VK I
Subjt:  LDDGVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI

TrEMBL top hitse value%identityAlignment
A0A5A7TJH9 Putative Integrase core domain1.7e-4972.22Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIK--ELVLQVVILDD
        AHVLVTN KKLEPRSRLC+FVGYPKE R GLF+DP+EN+V VSTN TFL+EDHMR+HKPQ KL+L +    QQ LLMK+V HQEL+K    V  +++  D
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIK--ELVLQVVILDD

Query:  GVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPE
        GVEDPL Y+QAMNDVDKDQW+KAM+LEMES YFN VWELVDLPE
Subjt:  GVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPE

A0A5A7TZQ3 Gag/pol protein2.0e-4767.76Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELV-----LQVVI
        AHVLV N KKLEPRSRLCQFV YPKE R GLF+DP+EN+V VSTN TFL+EDHMR+HKP+SKL+L EAT     ++ ++ L   + + +       QVVI
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELV-----LQVVI

Query:  LDDGVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI
         DDGVE PL Y+Q MNDVDKDQWVKAMDLEMES YFNSVWELVDLPE VK I
Subjt:  LDDGVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI

A0A5A7UWW4 Gag/pol protein3.0e-5174.15Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELVLQVVILDDGV
        AHVLVTN KKLEPRS LCQFVGYPKE RDGLF+DP+EN+V VSTN TFL+EDHMR+HK +SKL+L EAT     ++ ++  HQEL+K L  QVVI DDGV
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELVLQVVILDDGV

Query:  EDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI
        EDPL Y+QAMNDVDK+QWVKAMDLE+ES YFNSVWELVDLPE VK I
Subjt:  EDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI

A0A5A7VF82 Gag/pol protein1.5e-5072.79Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELVLQVVILDDGV
        AHVLVTN KKLEPRSRLCQFVGYPKE R G+F+DP+EN+V VSTN TFL+EDHMRNHKP+ KL+L EAT     ++ ++  HQEL+K    QVV  +DGV
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELVLQVVILDDGV

Query:  EDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI
        EDPL Y+QAMNDVDKDQWVKAMDLEM+S YFNSVWELVDLPE VK I
Subjt:  EDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI

A0A5D3BFS9 Gag/pol protein2.0e-4767.76Show/hide
Query:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELV-----LQVVI
        AHVLV N KKLEPRSRLCQFV YPKE R GLF+DP+EN+V VSTN TFL+EDHMR+HKP+SKL+L EAT     ++ ++ L   + + +       QVVI
Subjt:  AHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATKTQQELLMKMVLHQELIKELV-----LQVVI

Query:  LDDGVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI
         DDGVE PL Y+Q MNDVDKDQWVKAMDLEMES YFNSVWELVDLPE VK I
Subjt:  LDDGVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACGCGTTCTTCACAGGGCAACGATCTTAGGTTTGAACGAGAGATCACAAGGGCGGAGACGAGACTTAGAAGAGAAAATAATGTTGAGCGACTTGAGGGA
GCAAATTCCATGATCCAACAAAGCACAGAGCAAAAGTGGCCACGATCTAATAGCACCTCGAATGAGATGAAGAAGCAGTTATTGGAAAAATTATTTCAAGCGAGC
AAAGCTGCAAATATCAGGAAGGAGATCTACGGGATCTCACAGATCACTGGGGAGATGCTCCATGAATACTGGGAGAGTTCGGTACACGGGGACTTGTCGCGCCAA
TTCAAGCTCCTACCAAGAGCAAAAGCTTGTGGACTTTGCTCGATGACTGGTCGTACTACAAATGCTTGCCCCCAGCTTCAGAAGGGATGTGAAGTGAACGTCATT
GGAGGTATGTCTTTTAATGATGCATTAAAGAAACTTGTTGAAAGTTCCCTACAGTTTCAGCAGGAGACAAGAAATGGCATCCAAAACCTAAGGAACCAGATAACC
CAATTGGCGATGCAGGTCAGTAAAATGGATAGCAAGAGCTTCGTAAAGCTTCCCACCCAACCTGAATTGAATCCAAAGGCGAACGTAAGTGCCGTGTGGAGTGGA
CTGACCACACCAAATTTTCTTCCAGGATTTTCTTCTCTATCTGATAAAAATTTTTCTTCTGTTATTGATGAGGACGATCTTGATGTTTTTAGGAGAGTGAAGGTG
AACATTCCCTTACTTGAGGCGATACTAAAAATTCGAGCGTACGCCGAGTTCCTTAGAACTTGGTATGAAGGTAAAGAAAGATCCTTAGGTAAAGAAGAGGTTAGT
GGAAACGTGAGTGTTCTGTTATCTGGTGAATTGCCTACTAAATGTTCAGACCCTGACTCTAACCCTTTGTATGATCGAGCTGATAATGTTAAGTCTATCATATTT
TGTGATGACGAGTTGCATACTTCTTTTGTTAGGTGCCTCGAGATTGAGGGGGATGTTTATGACGTAGTCGACGAGCACTGCGTGTCACTTATCAAGAAATCGGGA
GCACACGTGCTTGTGACAAATCTTAAGAAGTTGGAACCTCGTTCAAGGTTGTGCCAGTTTGTTGGTTACCCAAAAGAACCGAGAGATGGCCTTTTCTATGACCCA
CGAGAAAACAAAGTGTTGGTATCGACAAACCCTACTTTCTTGAAGGAAGACCACATGAGAAATCACAAACCACAGAGCAAGTTAATATTAGGAGAAGCTACGAAG
ACTCAACAAGAGTTGTTGATGAAGATGGTTCTTCATCAAGAGTTAATAAAGGAGCTAGTTCTTCAGGTTGTCATACTTGATGATGGCGTAGAGGATCCATTGTTC
TATAGACAGGCAATGAATGATGTAGACAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTACGTACTTCAATTCAGTGTGGGAACTTGTAGATCTA
CCTGAAAGGGTAAAACTCATAAGGTTTTTGGGGGATTCAATTTTGGGACCTTTTGTAGCCGTAAAACAGAGGATTGAGGCTGAACAAGAGGAGGAAAAGCGGAAA
TCAACTCATCGTTCGTGGGGATCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCACGCGTTCTTCACAGGGCAACGATCTTAGGTTTGAACGAGAGATCACAAGGGCGGAGACGAGACTTAGAAGAGAAAATAATGTTGAGCGACTTGAGGGA
GCAAATTCCATGATCCAACAAAGCACAGAGCAAAAGTGGCCACGATCTAATAGCACCTCGAATGAGATGAAGAAGCAGTTATTGGAAAAATTATTTCAAGCGAGC
AAAGCTGCAAATATCAGGAAGGAGATCTACGGGATCTCACAGATCACTGGGGAGATGCTCCATGAATACTGGGAGAGTTCGGTACACGGGGACTTGTCGCGCCAA
TTCAAGCTCCTACCAAGAGCAAAAGCTTGTGGACTTTGCTCGATGACTGGTCGTACTACAAATGCTTGCCCCCAGCTTCAGAAGGGATGTGAAGTGAACGTCATT
GGAGGTATGTCTTTTAATGATGCATTAAAGAAACTTGTTGAAAGTTCCCTACAGTTTCAGCAGGAGACAAGAAATGGCATCCAAAACCTAAGGAACCAGATAACC
CAATTGGCGATGCAGGTCAGTAAAATGGATAGCAAGAGCTTCGTAAAGCTTCCCACCCAACCTGAATTGAATCCAAAGGCGAACGTAAGTGCCGTGTGGAGTGGA
CTGACCACACCAAATTTTCTTCCAGGATTTTCTTCTCTATCTGATAAAAATTTTTCTTCTGTTATTGATGAGGACGATCTTGATGTTTTTAGGAGAGTGAAGGTG
AACATTCCCTTACTTGAGGCGATACTAAAAATTCGAGCGTACGCCGAGTTCCTTAGAACTTGGTATGAAGGTAAAGAAAGATCCTTAGGTAAAGAAGAGGTTAGT
GGAAACGTGAGTGTTCTGTTATCTGGTGAATTGCCTACTAAATGTTCAGACCCTGACTCTAACCCTTTGTATGATCGAGCTGATAATGTTAAGTCTATCATATTT
TGTGATGACGAGTTGCATACTTCTTTTGTTAGGTGCCTCGAGATTGAGGGGGATGTTTATGACGTAGTCGACGAGCACTGCGTGTCACTTATCAAGAAATCGGGA
GCACACGTGCTTGTGACAAATCTTAAGAAGTTGGAACCTCGTTCAAGGTTGTGCCAGTTTGTTGGTTACCCAAAAGAACCGAGAGATGGCCTTTTCTATGACCCA
CGAGAAAACAAAGTGTTGGTATCGACAAACCCTACTTTCTTGAAGGAAGACCACATGAGAAATCACAAACCACAGAGCAAGTTAATATTAGGAGAAGCTACGAAG
ACTCAACAAGAGTTGTTGATGAAGATGGTTCTTCATCAAGAGTTAATAAAGGAGCTAGTTCTTCAGGTTGTCATACTTGATGATGGCGTAGAGGATCCATTGTTC
TATAGACAGGCAATGAATGATGTAGACAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTACGTACTTCAATTCAGTGTGGGAACTTGTAGATCTA
CCTGAAAGGGTAAAACTCATAAGGTTTTTGGGGGATTCAATTTTGGGACCTTTTGTAGCCGTAAAACAGAGGATTGAGGCTGAACAAGAGGAGGAAAAGCGGAAA
TCAACTCATCGTTCGTGGGGATCGTGA
Protein sequenceShow/hide protein sequence
MITRSSQGNDLRFEREITRAETRLRRENNVERLEGANSMIQQSTEQKWPRSNSTSNEMKKQLLEKLFQASKAANIRKEIYGISQITGEMLHEYWESSVHGDLSRQ
FKLLPRAKACGLCSMTGRTTNACPQLQKGCEVNVIGGMSFNDALKKLVESSLQFQQETRNGIQNLRNQITQLAMQVSKMDSKSFVKLPTQPELNPKANVSAVWSG
LTTPNFLPGFSSLSDKNFSSVIDEDDLDVFRRVKVNIPLLEAILKIRAYAEFLRTWYEGKERSLGKEEVSGNVSVLLSGELPTKCSDPDSNPLYDRADNVKSIIF
CDDELHTSFVRCLEIEGDVYDVVDEHCVSLIKKSGAHVLVTNLKKLEPRSRLCQFVGYPKEPRDGLFYDPRENKVLVSTNPTFLKEDHMRNHKPQSKLILGEATK
TQQELLMKMVLHQELIKELVLQVVILDDGVEDPLFYRQAMNDVDKDQWVKAMDLEMESTYFNSVWELVDLPERVKLIRFLGDSILGPFVAVKQRIEAEQEEEKRK
STHRSWGS