; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008636 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008636
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:26953610..26955386
RNA-Seq ExpressionLag0008636
SyntenyLag0008636
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157414.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024115 [Momordica charantia]4.3e-2454.62Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RNLE PIDSW + K+++R  FVP+ F+      L+ LRQGSKSVE YY EM TL+  LD  ED    M RF  GLNKEIA  +DLQ Y +
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKS
        ++EM+HLAIKIEK LQR+S
Subjt:  LDEMVHLAIKIEKHLQRKS

XP_022158803.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025268 [Momordica charantia]1.5e-2454.55Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RNLE PIDSW + K+++R  FVP+ F+      L+ LRQGSKSVE YY EM TL+  LD  ED    M RF  GLNKEIA  +DLQ Y +
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKSTR
        ++EM+HLAIKIEK LQR+S R
Subjt:  LDEMVHLAIKIEKHLQRKSTR

XP_023520835.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111784339 [Cucurbita pepo subsp. pepo]9.0e-2249.59Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLF-----YNLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RNLE PIDSW + K+ MR+ FVP+ F       L+ L+QG KSVE YY EM TL++ LD  ED    M RF  GLN EIA   DLQ Y +
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLF-----YNLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKSTR
        ++E++H+AIKIE+ +QR+S R
Subjt:  LDEMVHLAIKIEKHLQRKSTR

XP_023553652.1 uncharacterized protein LOC111811140 [Cucurbita pepo subsp. pepo]9.0e-2249.59Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLF-----YNLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RNLE PIDSW + K+ MR+ FVP+ F       L+ L+QG KSVE YY EM TL++ LD  ED    M RF  GLN EIA   DLQ Y +
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLF-----YNLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKSTR
        ++E++H+AIKIE+ +QR+S R
Subjt:  LDEMVHLAIKIEKHLQRKSTR

XP_038887118.1 uncharacterized protein K02A2.6-like [Benincasa hispida]1.3e-2350.75Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLF-----YNLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RNLE PI SW + K  MR+HFVP  F       L+ LRQG+KSVE YY EM  L++ LD  ED  T M RF  GLNKEIA  +DLQ Y D
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLF-----YNLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKSTRPLDCKDLFDCNPS
        ++EM+HLAIK+EKHL  K  R    K     N S
Subjt:  LDEMVHLAIKIEKHLQRKSTRPLDCKDLFDCNPS

TrEMBL top hitse value%identityAlignment
A0A2N9GNR6 Uncharacterized protein2.2e-2629.53Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RN E P+++W +LK +MR  FVP  FY      L+ L QGS+SVE Y+ EM+  +   +  ED   TM RF  GLN++IA +++LQ Y +
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKSTR------------PLDCKDLFDCNPSTIVIPTHDEVHVDVCELGKKAQKAKTKACELIEKESCETMSEREKHEISLWPKE
        +++MVH+A+K+E+ L+RK T               + +D  D  P  +     D V   V   G            LI + + +    R        P+E
Subjt:  LDEMVHLAIKIEKHLQRKSTR------------PLDCKDLFDCNPSTIVIPTHDEVHVDVCELGKKAQKAKTKACELIEKESCETMSEREKHEISLWPKE

Query:  KNERKAKNMSLCETVCEEAKTI---IVKLDSKFPSKVNNGNQHMNINANAFLERQNSKFASKFIHGSTHKVFKPGDWVWKHYWKDPYSFNKKPKWRSKGG
          E + +     E   E+   +    VK   ++P+  +N                             H+    GDWVW H  K+ +  ++K K   +G 
Subjt:  KNERKAKNMSLCETVCEEAKTI---IVKLDSKFPSKVNNGNQHMNINANAFLERQNSKFASKFIHGSTHKVFKPGDWVWKHYWKDPYSFNKKPKWRSKGG

Query:  LP------ITSHDYKIALQGERDVSYIFNAADFRPFDVGDPFDLRTNPLEEGENDMNSS
         P      I  + +K+ L GE  VS  F  +D  PFDVG+  D R+NP EE  ND N S
Subjt:  LP------ITSHDYKIALQGERDVSYIFNAADFRPFDVGDPFDLRTNPLEEGENDMNSS

A0A2N9H3X9 CCHC-type domain-containing protein8.5e-2628.77Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RN E P+++W +LK +MR  FVP  FY      L+ L QGS+SVE Y+ EM+  +   +  ED   TM RF  GLN++IA +++LQ Y +
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKSTRPLDC------KDLFDCNPSTIVIPTHDEVHVDVCELGKKAQKAKTKACELIEKESCETMSEREKHEISLWPKEKNERKA
        +++MVH+A+K+E+ L+RK T           K  +D N              D  E  +K +  K K      K   E+   R + +I  +       K 
Subjt:  LDEMVHLAIKIEKHLQRKSTRPLDC------KDLFDCNPSTIVIPTHDEVHVDVCELGKKAQKAKTKACELIEKESCETMSEREKHEISLWPKEKNERKA

Query:  KNMSLCETVCEEAKTIIVKLDSKFPSKVNNGNQHM--------------NINANAFLERQNSKFASKFIHGSTHKV---FKPGDWVWKHYWKDPYSFNKK
               + C   + +I++ + +  ++  + +  M               +   + + R+        I     KV    K   WVW H  K+ +  ++K
Subjt:  KNMSLCETVCEEAKTIIVKLDSKFPSKVNNGNQHM--------------NINANAFLERQNSKFASKFIHGSTHKV---FKPGDWVWKHYWKDPYSFNKK

Query:  PKWRSKGGLP------ITSHDYKIALQGERDVSYIFNAADFRPFDVGDPFDLRTNPLEEGENDMN
         K   +G  P      I  + YK+ L GE  VS  FN +D  PFDVG+  D  +NP EE  ND N
Subjt:  PKWRSKGGLP------ITSHDYKIALQGERDVSYIFNAADFRPFDVGDPFDLRTNPLEEGENDMN

A0A2N9ICK7 Reverse transcriptase7.7e-2726.4Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RN E P+++W +LK IMR  FVP  FY      L+ L QGS+SVE Y+ EM+  +   +  ED   TM RF  GLN++IA +++LQ Y +
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKST--RPLDCKD-----------------LFDCNPST-IVIPTHDE---------VHVDVCELGKKAQKAKTKACELIEKE--
        +++MVH+A+K+E+ L+RK T  R +  +D                 L D +    +V P   E          H+ V +  ++ +      C +  KE  
Subjt:  LDEMVHLAIKIEKHLQRKST--RPLDCKD-----------------LFDCNPST-IVIPTHDE---------VHVDVCELGKKAQKAKTKACELIEKE--

Query:  -----------------------------------------------------SCETMSEREKHEISLWPKEKNERKAKNMSLCETVCEEAKTIIVKLDS
                                                             S  ++ E  +H   + P  + E+   N+  C    ++   +   +  
Subjt:  -----------------------------------------------------SCETMSEREKHEISLWPKEKNERKAKNMSLCETVCEEAKTIIVKLDS

Query:  KFPSKVNNGNQHMNINANAFLERQNSKFASKFIHGSTHKVFKPGDWVWKHYWKDPYSFNKKPKWRSKGGLP------ITSHDYKIALQGERDVSYIFNAA
        K  +      + +          +N + AS+   G    +F+PGDWVW H  K+ +  ++K K   +G  P      I  + +K+ L GE  VS  F  +
Subjt:  KFPSKVNNGNQHMNINANAFLERQNSKFASKFIHGSTHKVFKPGDWVWKHYWKDPYSFNKKPKWRSKGGLP------ITSHDYKIALQGERDVSYIFNAA

Query:  DFRPFDVGDPFDLRTNPLEEGENDMNSS
        D  PFDVG+  D R+NP EE  ND N S
Subjt:  DFRPFDVGDPFDLRTNPLEEGENDMNSS

A0A2N9JA32 Integrase catalytic domain-containing protein1.8e-2827.79Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RN E P+++W +LK IMR  FVP  FY      L+ L QGS+SVE Y+ EM+  +   +  ED   TM RF  GLN++IA +++LQ Y +
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKST--RPLDCKD-----------------LFDCNPST-IVIPTHDE---------VHVDVCELGKKAQKAKTKACELIEKESC
        +++MVH+A+K+E+ L+RK T  R +  +D                 L D +    +V P   E          H+ V +  ++ +      C +  KE  
Subjt:  LDEMVHLAIKIEKHLQRKST--RPLDCKD-----------------LFDCNPST-IVIPTHDE---------VHVDVCELGKKAQKAKTKACELIEKESC

Query:  E--------------------------TMSEREKHEISLWPKEKNE-------RKAKNMSLCETVCEEAKTIIVKLDSKFPSKVNNGNQHMNINA--NAF
        +                          T+  R  + + L PK+          R   N++      +E K   +K +   P  +        + +    F
Subjt:  E--------------------------TMSEREKHEISLWPKEKNE-------RKAKNMSLCETVCEEAKTIIVKLDSKFPSKVNNGNQHMNINA--NAF

Query:  LE-------------RQNSKFASKFIHGSTHKVFKPGDWVWKHYWKDPYSFNKKPKWRSKGGLP------ITSHDYKIALQGERDVSYIFNAADFRPFDV
        ++             ++N + AS+   G    +F+PGDWVW H  K+ +  ++K K   +G  P      I  + +K+ L GE  VS  F  +D  PFDV
Subjt:  LE-------------RQNSKFASKFIHGSTHKVFKPGDWVWKHYWKDPYSFNKKPKWRSKGGLP------ITSHDYKIALQGERDVSYIFNAADFRPFDV

Query:  GDPFDLRTNPLEEGENDMNSS
        G+  D R+NP EE  ND N S
Subjt:  GDPFDLRTNPLEEGENDMNSS

A0A6J1DX46 LOW QUALITY PROTEIN: uncharacterized protein LOC1110252687.2e-2554.55Show/hide
Query:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD
        WW        RNLE PIDSW + K+++R  FVP+ F+      L+ LRQGSKSVE YY EM TL+  LD  ED    M RF  GLNKEIA  +DLQ Y +
Subjt:  WW--------RNLEHPIDSWNDLKQIMREHFVPKLFY-----NLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGD

Query:  LDEMVHLAIKIEKHLQRKSTR
        ++EM+HLAIKIEK LQR+S R
Subjt:  LDEMVHLAIKIEKHLQRKSTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTGGTGGCGGAAATGGCAGCAAACGGCGGTGGTGGTGAAGACGGACGGCGCCGGCGGTGGTGGAGGAACCTTGAACATCCAATAGATTCATGGAATGATTTAAA
GCAAATAATGCGAGAGCACTTTGTTCCTAAGCTTTTCTATAACTTGCGGACTTTGAGACAAGGGAGCAAAAGTGTGGAGGCTTACTACATGGAGATGCAAACATTGCTTG
AGGAACTTGATTTTTATGAGGATGAGATGACTACCATGACTCGTTTCTTTAGAGGACTTAATAAGGAGATTGCCACCCTACTTGATCTTCAACTTTATGGGGATTTAGAC
GAGATGGTACACTTAGCCATAAAGATTGAAAAACATCTCCAAAGGAAGTCTACAAGGCCATTAGATTGCAAAGACTTGTTTGATTGTAACCCTTCTACTATTGTTATACC
TACTCATGATGAAGTACATGTTGATGTGTGTGAGTTAGGGAAAAAGGCACAAAAGGCCAAGACAAAAGCATGTGAATTGATTGAAAAAGAATCTTGTGAGACAATGAGTG
AAAGAGAAAAACATGAGATTAGTCTATGGCCTAAAGAGAAAAATGAGAGAAAAGCCAAGAATATGAGTTTGTGTGAAACAGTTTGTGAGGAAGCCAAAACAATCATTGTG
AAGCTTGATTCAAAGTTCCCTTCTAAAGTCAACAATGGCAACCAACACATGAACATTAATGCCAATGCATTTCTTGAGAGGCAAAACTCCAAATTTGCTTCTAAGTTCAT
CCATGGCAGCACGCATAAGGTGTTTAAACCGGGTGATTGGGTTTGGAAACACTATTGGAAGGATCCTTATTCTTTTAATAAGAAACCCAAGTGGAGATCCAAAGGCGGTT
TACCAATCACATCCCATGACTACAAAATTGCTCTACAAGGCGAGAGGGATGTAAGTTACATTTTCAATGCCGCTGATTTTCGTCCATTTGATGTAGGAGACCCTTTTGAT
TTGAGGACAAATCCTCTTGAAGAAGGGGAGAATGATATGAATTCTTCCAATAAAGAATTCCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGCTGGTGGCGGAAATGGCAGCAAACGGCGGTGGTGGTGAAGACGGACGGCGCCGGCGGTGGTGGAGGAACCTTGAACATCCAATAGATTCATGGAATGATTTAAA
GCAAATAATGCGAGAGCACTTTGTTCCTAAGCTTTTCTATAACTTGCGGACTTTGAGACAAGGGAGCAAAAGTGTGGAGGCTTACTACATGGAGATGCAAACATTGCTTG
AGGAACTTGATTTTTATGAGGATGAGATGACTACCATGACTCGTTTCTTTAGAGGACTTAATAAGGAGATTGCCACCCTACTTGATCTTCAACTTTATGGGGATTTAGAC
GAGATGGTACACTTAGCCATAAAGATTGAAAAACATCTCCAAAGGAAGTCTACAAGGCCATTAGATTGCAAAGACTTGTTTGATTGTAACCCTTCTACTATTGTTATACC
TACTCATGATGAAGTACATGTTGATGTGTGTGAGTTAGGGAAAAAGGCACAAAAGGCCAAGACAAAAGCATGTGAATTGATTGAAAAAGAATCTTGTGAGACAATGAGTG
AAAGAGAAAAACATGAGATTAGTCTATGGCCTAAAGAGAAAAATGAGAGAAAAGCCAAGAATATGAGTTTGTGTGAAACAGTTTGTGAGGAAGCCAAAACAATCATTGTG
AAGCTTGATTCAAAGTTCCCTTCTAAAGTCAACAATGGCAACCAACACATGAACATTAATGCCAATGCATTTCTTGAGAGGCAAAACTCCAAATTTGCTTCTAAGTTCAT
CCATGGCAGCACGCATAAGGTGTTTAAACCGGGTGATTGGGTTTGGAAACACTATTGGAAGGATCCTTATTCTTTTAATAAGAAACCCAAGTGGAGATCCAAAGGCGGTT
TACCAATCACATCCCATGACTACAAAATTGCTCTACAAGGCGAGAGGGATGTAAGTTACATTTTCAATGCCGCTGATTTTCGTCCATTTGATGTAGGAGACCCTTTTGAT
TTGAGGACAAATCCTCTTGAAGAAGGGGAGAATGATATGAATTCTTCCAATAAAGAATTCCAATAG
Protein sequenceShow/hide protein sequence
MVLVAEMAANGGGGEDGRRRRWWRNLEHPIDSWNDLKQIMREHFVPKLFYNLRTLRQGSKSVEAYYMEMQTLLEELDFYEDEMTTMTRFFRGLNKEIATLLDLQLYGDLD
EMVHLAIKIEKHLQRKSTRPLDCKDLFDCNPSTIVIPTHDEVHVDVCELGKKAQKAKTKACELIEKESCETMSEREKHEISLWPKEKNERKAKNMSLCETVCEEAKTIIV
KLDSKFPSKVNNGNQHMNINANAFLERQNSKFASKFIHGSTHKVFKPGDWVWKHYWKDPYSFNKKPKWRSKGGLPITSHDYKIALQGERDVSYIFNAADFRPFDVGDPFD
LRTNPLEEGENDMNSSNKEFQ