; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020186 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020186
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr5:48653433..48657238
RNA-Seq ExpressionLag0020186
SyntenyLag0020186
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0000166 - nucleotide binding (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576609.1 hypothetical protein SDJN03_24183, partial [Cucurbita argyrosperma subsp. sororia]4.9e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

KAG7014662.1 mutS2 [Cucurbita argyrosperma subsp. argyrosperma]4.9e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

XP_022922841.1 uncharacterized protein LOC111430703 isoform X2 [Cucurbita moschata]4.9e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

XP_022985051.1 uncharacterized protein LOC111483140 isoform X2 [Cucurbita maxima]4.9e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

XP_022985054.1 uncharacterized protein LOC111483140 isoform X5 [Cucurbita maxima]4.9e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

TrEMBL top hitse value%identityAlignment
A0A6J1J3U0 uncharacterized protein LOC111483140 isoform X32.4e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

A0A6J1J709 uncharacterized protein LOC111483140 isoform X22.4e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

A0A6J1JA95 uncharacterized protein LOC111483140 isoform X12.4e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

A0A6J1JC76 uncharacterized protein LOC111483140 isoform X52.4e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

A0A6J1JCF6 uncharacterized protein LOC111483140 isoform X42.4e-1461.84Show/hide
Query:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        +HL S  +++W K  +        +L +   +AQLWSLNRTYEESLRLLDETNAAVEMHKHGGC LDLSGVDL L+
Subjt:  VHL-SHESMDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G54090.1 DNA mismatch repair protein MutS, type 21.5e-0835.63Show/hide
Query:  SPCCLVHLSHES-------MDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI
        SP  + H   +S       ++W K  +V       +L +   + +LWSL++++ ESL+LLDET+AA++M +HG  CLDLS + + L+
Subjt:  SPCCLVHLSHES-------MDWKKEVEVEVVTISINLYKWG-EAQLWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAACAGCTTTGATCAAAGATGTTTGGAAGGAGGAAAACAGTTTTTGGACCTACCGATTAGAAGAAACTTAAAAGACAAGGAAGTTGCTGAATGGACTGACTTAAG
TCTCAATCTCACTCCTATTGTTCTTTCCTCGAAGGAAGACCCTTCATTAGCCAAAGGAATCTGGAAAGACAAGTACCCAAAAAAGGTAAAATTCTTCCTCTGGGAGGTGG
TTCACAAAGCCATCAGAACAAAAGAAAATCTTCAAAGAAGAATGCCTTATTTGGCTTTGTCTCCAAGCTGGTGCACTTTGTGTAAAGCTAACTATGAATCTCAAAACCAC
CTTTTCATCCACTGGCCCTACAATATTACTTCTGGAACAAAATTTTGCAAGCTTTTAGATGGTATTTACCCTTTCCTGGGGAGGAAGTTGGGCATGGTTTTGTACCATCA
ACATAACCAGTTAGATCATAGGTCACAAAAAGTGCGTGAAACTGGAGCTTTCATGAGACAACTAATGGAGACATGTTTGATTGGGCTTGAGTTCAAACATTTGTGGCAAT
CACCGTGTTGTTTGGTCCATTTATCTCATGAATCAATGGATTGGAAAAAGGAGGTTGAGGTTGAAGTGGTGACCATCTCGATAAATCTGTACAAGTGGGGAGAGGCCCAA
CTTTGGTCTTTGAACCGGACATATGAAGAAAGCTTGAGACTTTTGGATGAGACTAATGCTGCAGTAGAAATGCACAAGCATGGTGGCTGCTGCTTGGATTTAAGTGGCGT
TGACCTTCATCTGATTGCCCCATTAGGTTACAACTTGGTTCCTCCTTGTTTCCCATCTTTCCAAGGATATTGCAAAAGTTCAACCAATGAGTTTTCAAGCCAACCTAAGA
GGGATAGAGAAAGATACAACAACTCATCTCTTCCAGCACTTCCATTGAACCATAAACCAGAACTCAACCACAAACCAAAGGGACCCACAAATAAGGAGCAGCTGTCGGCC
AAGTGGCTAACAGACCGGACTGGACAACGAAACGACCGGCGAAGAATGGCAAGTGAACGACGAATAGAAGGCGTCGAACTGACTGGACGGACGACAGCCAACAAAATGCC
AAGAACAAATATCAACGATGCCTCTTTAAGAGAGAATAGAGAACAAAGAGATATTTGGGACAATAAAATGGGAAAGAAACCCAGAGATGAATTTCCGACGGCGATGGCGA
ATTTCGGTAGCACAAGCAAGTTGCAGTGGCGGCGGGAGTGGAAGAGTCGGCCGGCAGCGGCGACGTCGACTTACCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAAACAGCTTTGATCAAAGATGTTTGGAAGGAGGAAAACAGTTTTTGGACCTACCGATTAGAAGAAACTTAAAAGACAAGGAAGTTGCTGAATGGACTGACTTAAG
TCTCAATCTCACTCCTATTGTTCTTTCCTCGAAGGAAGACCCTTCATTAGCCAAAGGAATCTGGAAAGACAAGTACCCAAAAAAGGTAAAATTCTTCCTCTGGGAGGTGG
TTCACAAAGCCATCAGAACAAAAGAAAATCTTCAAAGAAGAATGCCTTATTTGGCTTTGTCTCCAAGCTGGTGCACTTTGTGTAAAGCTAACTATGAATCTCAAAACCAC
CTTTTCATCCACTGGCCCTACAATATTACTTCTGGAACAAAATTTTGCAAGCTTTTAGATGGTATTTACCCTTTCCTGGGGAGGAAGTTGGGCATGGTTTTGTACCATCA
ACATAACCAGTTAGATCATAGGTCACAAAAAGTGCGTGAAACTGGAGCTTTCATGAGACAACTAATGGAGACATGTTTGATTGGGCTTGAGTTCAAACATTTGTGGCAAT
CACCGTGTTGTTTGGTCCATTTATCTCATGAATCAATGGATTGGAAAAAGGAGGTTGAGGTTGAAGTGGTGACCATCTCGATAAATCTGTACAAGTGGGGAGAGGCCCAA
CTTTGGTCTTTGAACCGGACATATGAAGAAAGCTTGAGACTTTTGGATGAGACTAATGCTGCAGTAGAAATGCACAAGCATGGTGGCTGCTGCTTGGATTTAAGTGGCGT
TGACCTTCATCTGATTGCCCCATTAGGTTACAACTTGGTTCCTCCTTGTTTCCCATCTTTCCAAGGATATTGCAAAAGTTCAACCAATGAGTTTTCAAGCCAACCTAAGA
GGGATAGAGAAAGATACAACAACTCATCTCTTCCAGCACTTCCATTGAACCATAAACCAGAACTCAACCACAAACCAAAGGGACCCACAAATAAGGAGCAGCTGTCGGCC
AAGTGGCTAACAGACCGGACTGGACAACGAAACGACCGGCGAAGAATGGCAAGTGAACGACGAATAGAAGGCGTCGAACTGACTGGACGGACGACAGCCAACAAAATGCC
AAGAACAAATATCAACGATGCCTCTTTAAGAGAGAATAGAGAACAAAGAGATATTTGGGACAATAAAATGGGAAAGAAACCCAGAGATGAATTTCCGACGGCGATGGCGA
ATTTCGGTAGCACAAGCAAGTTGCAGTGGCGGCGGGAGTGGAAGAGTCGGCCGGCAGCGGCGACGTCGACTTACCTATAG
Protein sequenceShow/hide protein sequence
MQNSFDQRCLEGGKQFLDLPIRRNLKDKEVAEWTDLSLNLTPIVLSSKEDPSLAKGIWKDKYPKKVKFFLWEVVHKAIRTKENLQRRMPYLALSPSWCTLCKANYESQNH
LFIHWPYNITSGTKFCKLLDGIYPFLGRKLGMVLYHQHNQLDHRSQKVRETGAFMRQLMETCLIGLEFKHLWQSPCCLVHLSHESMDWKKEVEVEVVTISINLYKWGEAQ
LWSLNRTYEESLRLLDETNAAVEMHKHGGCCLDLSGVDLHLIAPLGYNLVPPCFPSFQGYCKSSTNEFSSQPKRDRERYNNSSLPALPLNHKPELNHKPKGPTNKEQLSA
KWLTDRTGQRNDRRRMASERRIEGVELTGRTTANKMPRTNINDASLRENREQRDIWDNKMGKKPRDEFPTAMANFGSTSKLQWRREWKSRPAAATSTYL