; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025277 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025277
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr10:10721325..10731456
RNA-Seq ExpressionLag0025277
SyntenyLag0025277
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RYE04563.1 hypothetical protein EOP33_09135, partial [Rickettsiaceae bacterium]2.7e-0638.46Show/hide
Query:  DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG
        D  G++ +   PFT +I+   C       +  FK+PN P Y+GK D   H+  Y TWM   G +    C+AF+LTLTG
Subjt:  DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG

TYK05792.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-0778.95Show/hide
Query:  GESGTHVDSIGLPFWRQDRVGSWEHNHTRWNSLIPDSR
        GESG  VDSI L FW QDRVGSWEHNHTRWNS IP  R
Subjt:  GESGTHVDSIGLPFWRQDRVGSWEHNHTRWNSLIPDSR

XP_022150035.1 uncharacterized protein LOC111018307 [Momordica charantia]2.2e-0549.02Show/hide
Query:  EVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG
        +V  KFK+P   ++DG  DL  HLD Y  WMD +G +EA +CR F+ TL+G
Subjt:  EVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]4.1e-0730.27Show/hide
Query:  PQKGKGIADEEVGDS-ESVTSRMHHPEDDQIRKEAGPTHKKVRRNSPLRPAPGMYTKNNDRKRLEAQAGSRAEQGQKGQERELSKWLKEEDNHRDSQRRT
        P+KGKG  + +  +S  SV S++         +  G T ++ R   P         K   + +  A  G +++   +  E       K  D    S++R 
Subjt:  PQKGKGIADEEVGDS-ESVTSRMHHPEDDQIRKEAGPTHKKVRRNSPLRPAPGMYTKNNDRKRLEAQAGSRAEQGQKGQERELSKWLKEEDNHRDSQRRT

Query:  EKK----DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG
          K    D+E L+ Q   PFT+EIM         +V  KFK+P   ++D   D   HLD Y  WMD +G +EA RCR F+ TL G
Subjt:  EKK----DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]1.6e-0635.04Show/hide
Query:  TKNNDRKRLEAQAGSRAEQGQK-------GQERELSKWLKEEDNHRDSQRRTEKK----DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEY
        T+  D ++++ Q  S A  G K        +   L+K  K  D    S++R  +K    D+E L+GQ   PFT+EIM         +V  KFK+P    +
Subjt:  TKNNDRKRLEAQAGSRAEQGQK-------GQERELSKWLKEEDNHRDSQRRTEKK----DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEY

Query:  DGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG
        DG  +   HLD Y  WMD +G ++A RCR F+ TL G
Subjt:  DGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG

TrEMBL top hitse value%identityAlignment
A0A4Q3DBC0 Retrotrans_gag domain-containing protein (Fragment)1.9e-0536.9Show/hide
Query:  RRTEKKDIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG
        R + ++   G I +   PF  +I+   C          FK+P+ P YDG+ D   H+  Y TWM   G ++AT C+AF+LTLTG
Subjt:  RRTEKKDIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG

A0A4Q3DG65 Retrotrans_gag domain-containing protein (Fragment)1.3e-0638.46Show/hide
Query:  DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG
        D  G++ +   PFT +I+   C       +  FK+PN P Y+GK D   H+  Y TWM   G +    C+AF+LTLTG
Subjt:  DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG

A0A5D3C3J6 Gag/pol protein2.6e-0778.95Show/hide
Query:  GESGTHVDSIGLPFWRQDRVGSWEHNHTRWNSLIPDSR
        GESG  VDSI L FW QDRVGSWEHNHTRWNS IP  R
Subjt:  GESGTHVDSIGLPFWRQDRVGSWEHNHTRWNSLIPDSR

A0A6J1DWY0 uncharacterized protein LOC1110252932.0e-0730.27Show/hide
Query:  PQKGKGIADEEVGDS-ESVTSRMHHPEDDQIRKEAGPTHKKVRRNSPLRPAPGMYTKNNDRKRLEAQAGSRAEQGQKGQERELSKWLKEEDNHRDSQRRT
        P+KGKG  + +  +S  SV S++         +  G T ++ R   P         K   + +  A  G +++   +  E       K  D    S++R 
Subjt:  PQKGKGIADEEVGDS-ESVTSRMHHPEDDQIRKEAGPTHKKVRRNSPLRPAPGMYTKNNDRKRLEAQAGSRAEQGQKGQERELSKWLKEEDNHRDSQRRT

Query:  EKK----DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG
          K    D+E L+ Q   PFT+EIM         +V  KFK+P   ++D   D   HLD Y  WMD +G +EA RCR F+ TL G
Subjt:  EKK----DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG

A0A6J1E1E7 uncharacterized protein LOC1110255487.5e-0735.04Show/hide
Query:  TKNNDRKRLEAQAGSRAEQGQK-------GQERELSKWLKEEDNHRDSQRRTEKK----DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEY
        T+  D ++++ Q  S A  G K        +   L+K  K  D    S++R  +K    D+E L+GQ   PFT+EIM         +V  KFK+P    +
Subjt:  TKNNDRKRLEAQAGSRAEQGQK-------GQERELSKWLKEEDNHRDSQRRTEKK----DIEGLIGQMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEY

Query:  DGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG
        DG  +   HLD Y  WMD +G ++A RCR F+ TL G
Subjt:  DGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACTCACTGGTGGGTTAAGGTGGGAATCCTAAGAAACTCTACTCCTACGGATGATAGTAGTCACCCAATCAGTGGAGCCAAGTCTCACCCATCAAGGCGCCTCGA
TCTAAGTCTACATAGTACTCTAGCTACCAACTCACTGGTGGTGAATAGGAGACGCACAAGAGAGGAAGAGACAGGCGGCGCTCGCGGCTGGGATTCGGGCTGCACGGTTC
GTGCTGCTGCAGGGGATAGACGGGGGAGACAGTGGCGGAACGGCGCGGCGCCAGGGAGAAGACGACGGACGAGGGGGAGAGTGATCGGAGGGGGCGGCGGTGCTATCTGG
AACACCCTCGCTGCCCAAAACCCATTTTTCCATTCAGATCTGAAATGCGAAGCCGCCGCCCAAAACCCTCACCACTTGATCCTCTTTTGCATCGTTTTTCTCATCCCTTC
GCCACTTGATCTCTCTCTCTCTTTCTTTTCCTTACTCGCTGCTCAGACTGATTCCAACCTTCAGACTGATTTTTCCCTTGAATCCAGCGATGGCGCGGGGACGAAGGTGC
GAATACATGATCGGCGACATGATGATAGTTGGACGGAACGAATGTCGGACGAAAAGCGAGCAGCGACGGCGGTCGGAGCTGAGTACGCACGCGGGGGAGAGAGAGGAAGA
GGTGAGAGTGGCACGCACGTCGACTCAATAGGCCTACCATTTTGGAGACAAGACCGAGTGGGGAGCTGGGAACATAATCACACAAGATGGAATTCACTCATTCCCGACTC
TAGGGTAAGTAAGGTGTGTTCCCTTAAGTGGTGTCTCCGGGTCTTGAACAATGGGCCTTGCCCTCTCTATGGCACGAGAGGGACTTCTGTTTGTTGGACCTCAAACAAGT
TGTTCATTAGAGGAACACTTGAACTTAAGGATAAAGAGAACAACGGAAAATCTCTTTCAACCTCTCTCACTCTCTTGTGCAACATAGAGAAAGAAAATTCCTTAGAACTT
TACCTTGCTCTCACAAGCAATCAACGTGATCCTAGGAGAATACCAGTGTTGCCCGTTGGTAGTGTCTGTGGTGTTTTCCAGAAAGAAAAGAAAGGATTCGCTGGATTCGT
GGTTGTTTTTCTACAAACATTGGAGAAAAGGCGAGTTTTGATCAAAACTTCTCTAGTATTGGCGCCATCTGTGGGGAAGACGCTGACTAGCAAATGTTGTATTCGGCAGG
AAAGCGCAGCAGCGATGGAGCACCAAGATCAACCAGTGACAGACGAGGCGAGCCCCTTGCCTGTTAAGGAAGCTCGAGACTGTAAGGCACGAGGAAGAGTATTTACACAG
AGACCCCAAAAAGGTAAAGGAATAGCAGACGAAGAGGTAGGGGATTCAGAAAGTGTAACCAGCCGAATGCACCATCCGGAGGATGATCAGATCCGGAAGGAAGCTGGGCC
TACCCACAAAAAGGTTCGCAGGAATTCGCCGCTGCGGCCAGCACCAGGTATGTATACCAAGAATAATGACAGGAAAAGGTTGGAGGCTCAAGCAGGGTCCAGGGCCGAGC
AGGGCCAAAAAGGGCAAGAGCGAGAGCTATCCAAGTGGCTGAAAGAGGAAGACAACCATCGTGACTCCCAAAGAAGAACTGAGAAAAAAGACATAGAAGGGCTAATCGGA
CAGATGAGACCACCCTTCACTGATGAAATAATGGGAGGAGAGTGTTGGCTTTTTGGCCCAGAGGTGTCGCATAAATTCAAGGTACCAAACTTCCCGGAGTATGATGGAAA
GAAAGATCTGAAACAGCACTTAGACACATACTTAACCTGGATGGATTTCCACGGGGCGAACGAAGCGACAAGGTGCCGAGCCTTCGCGTTAACACTTACAGGTCAGCAGG
ATGAAAGATTGCTCAACTCGATCGGTTTTGCTGCAGCAACTGATTTCACAAGCAGAAGGCAGAAAGTCTGGAATAAAACTGCACGTGCTAAACCGTGGGATCTCTCATTA
ATTGAGCTGGCAGCCATGTGGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAACTCACTGGTGGGTTAAGGTGGGAATCCTAAGAAACTCTACTCCTACGGATGATAGTAGTCACCCAATCAGTGGAGCCAAGTCTCACCCATCAAGGCGCCTCGA
TCTAAGTCTACATAGTACTCTAGCTACCAACTCACTGGTGGTGAATAGGAGACGCACAAGAGAGGAAGAGACAGGCGGCGCTCGCGGCTGGGATTCGGGCTGCACGGTTC
GTGCTGCTGCAGGGGATAGACGGGGGAGACAGTGGCGGAACGGCGCGGCGCCAGGGAGAAGACGACGGACGAGGGGGAGAGTGATCGGAGGGGGCGGCGGTGCTATCTGG
AACACCCTCGCTGCCCAAAACCCATTTTTCCATTCAGATCTGAAATGCGAAGCCGCCGCCCAAAACCCTCACCACTTGATCCTCTTTTGCATCGTTTTTCTCATCCCTTC
GCCACTTGATCTCTCTCTCTCTTTCTTTTCCTTACTCGCTGCTCAGACTGATTCCAACCTTCAGACTGATTTTTCCCTTGAATCCAGCGATGGCGCGGGGACGAAGGTGC
GAATACATGATCGGCGACATGATGATAGTTGGACGGAACGAATGTCGGACGAAAAGCGAGCAGCGACGGCGGTCGGAGCTGAGTACGCACGCGGGGGAGAGAGAGGAAGA
GGTGAGAGTGGCACGCACGTCGACTCAATAGGCCTACCATTTTGGAGACAAGACCGAGTGGGGAGCTGGGAACATAATCACACAAGATGGAATTCACTCATTCCCGACTC
TAGGGTAAGTAAGGTGTGTTCCCTTAAGTGGTGTCTCCGGGTCTTGAACAATGGGCCTTGCCCTCTCTATGGCACGAGAGGGACTTCTGTTTGTTGGACCTCAAACAAGT
TGTTCATTAGAGGAACACTTGAACTTAAGGATAAAGAGAACAACGGAAAATCTCTTTCAACCTCTCTCACTCTCTTGTGCAACATAGAGAAAGAAAATTCCTTAGAACTT
TACCTTGCTCTCACAAGCAATCAACGTGATCCTAGGAGAATACCAGTGTTGCCCGTTGGTAGTGTCTGTGGTGTTTTCCAGAAAGAAAAGAAAGGATTCGCTGGATTCGT
GGTTGTTTTTCTACAAACATTGGAGAAAAGGCGAGTTTTGATCAAAACTTCTCTAGTATTGGCGCCATCTGTGGGGAAGACGCTGACTAGCAAATGTTGTATTCGGCAGG
AAAGCGCAGCAGCGATGGAGCACCAAGATCAACCAGTGACAGACGAGGCGAGCCCCTTGCCTGTTAAGGAAGCTCGAGACTGTAAGGCACGAGGAAGAGTATTTACACAG
AGACCCCAAAAAGGTAAAGGAATAGCAGACGAAGAGGTAGGGGATTCAGAAAGTGTAACCAGCCGAATGCACCATCCGGAGGATGATCAGATCCGGAAGGAAGCTGGGCC
TACCCACAAAAAGGTTCGCAGGAATTCGCCGCTGCGGCCAGCACCAGGTATGTATACCAAGAATAATGACAGGAAAAGGTTGGAGGCTCAAGCAGGGTCCAGGGCCGAGC
AGGGCCAAAAAGGGCAAGAGCGAGAGCTATCCAAGTGGCTGAAAGAGGAAGACAACCATCGTGACTCCCAAAGAAGAACTGAGAAAAAAGACATAGAAGGGCTAATCGGA
CAGATGAGACCACCCTTCACTGATGAAATAATGGGAGGAGAGTGTTGGCTTTTTGGCCCAGAGGTGTCGCATAAATTCAAGGTACCAAACTTCCCGGAGTATGATGGAAA
GAAAGATCTGAAACAGCACTTAGACACATACTTAACCTGGATGGATTTCCACGGGGCGAACGAAGCGACAAGGTGCCGAGCCTTCGCGTTAACACTTACAGGTCAGCAGG
ATGAAAGATTGCTCAACTCGATCGGTTTTGCTGCAGCAACTGATTTCACAAGCAGAAGGCAGAAAGTCTGGAATAAAACTGCACGTGCTAAACCGTGGGATCTCTCATTA
ATTGAGCTGGCAGCCATGTGGAGCTGA
Protein sequenceShow/hide protein sequence
MPTHWWVKVGILRNSTPTDDSSHPISGAKSHPSRRLDLSLHSTLATNSLVVNRRRTREEETGGARGWDSGCTVRAAAGDRRGRQWRNGAAPGRRRRTRGRVIGGGGGAIW
NTLAAQNPFFHSDLKCEAAAQNPHHLILFCIVFLIPSPLDLSLSFFSLLAAQTDSNLQTDFSLESSDGAGTKVRIHDRRHDDSWTERMSDEKRAATAVGAEYARGGERGR
GESGTHVDSIGLPFWRQDRVGSWEHNHTRWNSLIPDSRVSKVCSLKWCLRVLNNGPCPLYGTRGTSVCWTSNKLFIRGTLELKDKENNGKSLSTSLTLLCNIEKENSLEL
YLALTSNQRDPRRIPVLPVGSVCGVFQKEKKGFAGFVVVFLQTLEKRRVLIKTSLVLAPSVGKTLTSKCCIRQESAAAMEHQDQPVTDEASPLPVKEARDCKARGRVFTQ
RPQKGKGIADEEVGDSESVTSRMHHPEDDQIRKEAGPTHKKVRRNSPLRPAPGMYTKNNDRKRLEAQAGSRAEQGQKGQERELSKWLKEEDNHRDSQRRTEKKDIEGLIG
QMRPPFTDEIMGGECWLFGPEVSHKFKVPNFPEYDGKKDLKQHLDTYLTWMDFHGANEATRCRAFALTLTGQQDERLLNSIGFAAATDFTSRRQKVWNKTARAKPWDLSL
IELAAMWS