; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019397 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019397
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:41789767..41790558
RNA-Seq ExpressionLag0019397
SyntenyLag0019397
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK16471.1 uncharacterized protein E5676_scaffold21G002740 [Cucumis melo var. makuwa]6.5e-4143.59Show/hide
Query:  QMKKILSKRK-HSIRKSQKEAATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQK
        Q+++++S  K ++   + + +  L +V + I NL +++  +A +  +    E   +K   FF+  PP FG+ TDPL+A  WI IL++IFDLI C SD+QK
Subjt:  QMKKILSKRK-HSIRKSQKEAATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQK

Query:  VSLAVLRLMDDALCWWRMEERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGL
        VS A LRL D A  WW +    LEADG+ VTW KFK LF +RY P+ L+ + F EL N+ QGD TV EYD +F+KL SLA +  ++ ++ W+A  F  GL
Subjt:  VSLAVLRLMDDALCWWRMEERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGL

Query:  RREIRTRLDFLDDMSYVEVRNEALRLERLQQRLN
        R +IR ++ F +++SY EVRN AL  E   QR+N
Subjt:  RREIRTRLDFLDDMSYVEVRNEALRLERLQQRLN

XP_022136421.1 uncharacterized protein LOC111008132 [Momordica charantia]7.7e-4248.42Show/hide
Query:  KEAATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRM
        KEAA   DV N I NL  +M  VA R  ++     + SKI  FF   PP FGQ TDP  A  WI  L++IFD I C SDEQ+VS AVL+L D AL WW +
Subjt:  KEAATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRM

Query:  EERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVE
         ER L+ D + VTW KFK LF+ RYFP+ +Q   + EL ++ QGDRTV EYD EF+KLCSL  D  ++  + W+ +        +IR ++  L D +Y +
Subjt:  EERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVE

Query:  VRNEALRLERLQQRLNAQRDQ
        VR+ AL LER   RLNAQ DQ
Subjt:  VRNEALRLERLQQRLNAQRDQ

XP_022931358.1 uncharacterized protein LOC111437567 isoform X1 [Cucurbita moschata]2.9e-4147.89Show/hide
Query:  AATLKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRME
        A  L DVC+ I +L  + +  VA R       E ++SK  QFF  +PP+FG+ TDPL+A  W+  L++IFD IGC SDEQKVS A LRL D AL WW + 
Subjt:  AATLKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRME

Query:  ERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEV
        +R L ADG AVTW KFK LFY+RYFP  L+ +   EL  + QG+RTV+EYD EF+ L SL  +     +D  +A  F  GLR +I  +L F D++SY ++
Subjt:  ERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEV

Query:  RNEALRLERLQQR
        RN AL +E+   R
Subjt:  RNEALRLERLQQR

XP_022931360.1 uncharacterized protein LOC111437567 isoform X2 [Cucurbita moschata]6.5e-4148.1Show/hide
Query:  LKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRMEERG
        L DVC+ I +L  + +  VA R       E ++SK  QFF  +PP+FG+ TDPL+A  W+  L++IFD IGC SDEQKVS A LRL D AL WW + +R 
Subjt:  LKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRMEERG

Query:  LEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEVRNE
        L ADG AVTW KFK LFY+RYFP  L+ +   EL  + QG+RTV+EYD EF+ L SL  +     +D  +A  F  GLR +I  +L F D++SY ++RN 
Subjt:  LEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEVRNE

Query:  ALRLERLQQR
        AL +E+   R
Subjt:  ALRLERLQQR

XP_023522569.1 uncharacterized protein LOC111786568 [Cucurbita pepo subsp. pepo]1.1e-4047.42Show/hide
Query:  AATLKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRME
        A  L DVC+ I +L  + +  VA R       E ++SK  QFF  +PP+FG+ TDPL+A  W+  L++IFD IGC SDEQ VS A LRL D AL WW + 
Subjt:  AATLKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRME

Query:  ERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEV
        +R L ADG AVTW KFK LFY+RYFP  L+ +   EL  + QG+RTV+EYD EF+ L SL  +     +D  +A  F  GLR +I  +L F D++SY ++
Subjt:  ERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEV

Query:  RNEALRLERLQQR
        RN AL +E+   R
Subjt:  RNEALRLERLQQR

TrEMBL top hitse value%identityAlignment
A0A0A0L042 Retrotrans_gag domain-containing protein9.1e-4144.02Show/hide
Query:  QMKKILSKRKHSIRKSQKE-AATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQK
        Q+++++S  K + R    E +  L +V + I NL ++M  +A +  +    E   +K   FF+  PP FG+ TDPL+A  WI IL++IFDLI C SDE K
Subjt:  QMKKILSKRKHSIRKSQKE-AATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQK

Query:  VSLAVLRLMDDALCWWRMEERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGL
        VS A LRL D A  WW +    LEADG+ VTW KFK LF +RY P  L+ + F EL N+ QGD TV EYD +F+KL SLA +  ++ ++ W+A  F  GL
Subjt:  VSLAVLRLMDDALCWWRMEERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGL

Query:  RREIRTRLDFLDDMSYVEVRNEALRLERLQQRLN
        R +IR ++  L++ S  E+RN AL +E   QR+N
Subjt:  RREIRTRLDFLDDMSYVEVRNEALRLERLQQRLN

A0A5D3CX20 Retrotrans_gag domain-containing protein3.1e-4143.59Show/hide
Query:  QMKKILSKRK-HSIRKSQKEAATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQK
        Q+++++S  K ++   + + +  L +V + I NL +++  +A +  +    E   +K   FF+  PP FG+ TDPL+A  WI IL++IFDLI C SD+QK
Subjt:  QMKKILSKRK-HSIRKSQKEAATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQK

Query:  VSLAVLRLMDDALCWWRMEERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGL
        VS A LRL D A  WW +    LEADG+ VTW KFK LF +RY P+ L+ + F EL N+ QGD TV EYD +F+KL SLA +  ++ ++ W+A  F  GL
Subjt:  VSLAVLRLMDDALCWWRMEERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGL

Query:  RREIRTRLDFLDDMSYVEVRNEALRLERLQQRLN
        R +IR ++ F +++SY EVRN AL  E   QR+N
Subjt:  RREIRTRLDFLDDMSYVEVRNEALRLERLQQRLN

A0A6J1C495 uncharacterized protein LOC1110081323.7e-4248.42Show/hide
Query:  KEAATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRM
        KEAA   DV N I NL  +M  VA R  ++     + SKI  FF   PP FGQ TDP  A  WI  L++IFD I C SDEQ+VS AVL+L D AL WW +
Subjt:  KEAATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRM

Query:  EERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVE
         ER L+ D + VTW KFK LF+ RYFP+ +Q   + EL ++ QGDRTV EYD EF+KLCSL  D  ++  + W+ +        +IR ++  L D +Y +
Subjt:  EERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVE

Query:  VRNEALRLERLQQRLNAQRDQ
        VR+ AL LER   RLNAQ DQ
Subjt:  VRNEALRLERLQQRLNAQRDQ

A0A6J1EYB6 uncharacterized protein LOC111437567 isoform X11.4e-4147.89Show/hide
Query:  AATLKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRME
        A  L DVC+ I +L  + +  VA R       E ++SK  QFF  +PP+FG+ TDPL+A  W+  L++IFD IGC SDEQKVS A LRL D AL WW + 
Subjt:  AATLKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRME

Query:  ERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEV
        +R L ADG AVTW KFK LFY+RYFP  L+ +   EL  + QG+RTV+EYD EF+ L SL  +     +D  +A  F  GLR +I  +L F D++SY ++
Subjt:  ERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEV

Query:  RNEALRLERLQQR
        RN AL +E+   R
Subjt:  RNEALRLERLQQR

A0A6J1EZ79 uncharacterized protein LOC111437567 isoform X23.1e-4148.1Show/hide
Query:  LKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRMEERG
        L DVC+ I +L  + +  VA R       E ++SK  QFF  +PP+FG+ TDPL+A  W+  L++IFD IGC SDEQKVS A LRL D AL WW + +R 
Subjt:  LKDVCNGIYNLANK-MTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSIFDLIGCFSDEQKVSLAVLRLMDDALCWWRMEERG

Query:  LEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEVRNE
        L ADG AVTW KFK LFY+RYFP  L+ +   EL  + QG+RTV+EYD EF+ L SL  +     +D  +A  F  GLR +I  +L F D++SY ++RN 
Subjt:  LEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFERGLRREIRTRLDFLDDMSYVEVRNE

Query:  ALRLERLQQR
        AL +E+   R
Subjt:  ALRLERLQQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAAAACCAACTCCAACAGCAGCAGCAACCACCCAACTACAACAGCAGCCAAGGAAGTTGCAAACCAAATGAAAAAAATCCTATCAAAAAGAAAACATTCCATCAG
AAAATCCCAGAAAGAAGCTGCAACTCTGAAAGATGTTTGCAACGGAATCTACAATTTAGCCAACAAAATGACCACGGTTGCTGAGAGAATGAATGAAATTATTGAATATG
AGTCATATGACAGCAAAATCGAACAGTTCTTTAGCTTCCGTCCTCCTTTCTTTGGACAGTATACTGACCCTTTACTTGCCACAGACTGGATTTACATTCTGGACAGTATC
TTTGATCTCATAGGTTGTTTTTCAGATGAGCAAAAAGTTTCTTTAGCTGTTCTAAGGCTAATGGATGATGCACTTTGTTGGTGGAGAATGGAGGAAAGAGGATTGGAAGC
TGATGGAATTGCAGTGACATGGCTGAAGTTTAAGACTTTATTCTATGAGAGATATTTCCCAAAGTCGTTGCAGTTTGATAATTTTATTGAACTTACTAATGTGGTACAAG
GGGACAGAACAGTAGTAGAGTACGATGCAGAGTTCATGAAGTTGTGTTCTCTTGCTCAAGATCAAAAGTATGTTTTAGAAGATGTCTGGAAAGCTGCCTTTTTCGAGAGA
GGTTTGAGACGAGAAATCAGGACACGACTTGACTTCTTGGACGACATGTCGTACGTCGAGGTTAGGAACGAGGCGTTGAGGCTAGAACGGCTGCAACAGCGCCTCAACGC
TCAAAGGGATCAAGGGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAAAACCAACTCCAACAGCAGCAGCAACCACCCAACTACAACAGCAGCCAAGGAAGTTGCAAACCAAATGAAAAAAATCCTATCAAAAAGAAAACATTCCATCAG
AAAATCCCAGAAAGAAGCTGCAACTCTGAAAGATGTTTGCAACGGAATCTACAATTTAGCCAACAAAATGACCACGGTTGCTGAGAGAATGAATGAAATTATTGAATATG
AGTCATATGACAGCAAAATCGAACAGTTCTTTAGCTTCCGTCCTCCTTTCTTTGGACAGTATACTGACCCTTTACTTGCCACAGACTGGATTTACATTCTGGACAGTATC
TTTGATCTCATAGGTTGTTTTTCAGATGAGCAAAAAGTTTCTTTAGCTGTTCTAAGGCTAATGGATGATGCACTTTGTTGGTGGAGAATGGAGGAAAGAGGATTGGAAGC
TGATGGAATTGCAGTGACATGGCTGAAGTTTAAGACTTTATTCTATGAGAGATATTTCCCAAAGTCGTTGCAGTTTGATAATTTTATTGAACTTACTAATGTGGTACAAG
GGGACAGAACAGTAGTAGAGTACGATGCAGAGTTCATGAAGTTGTGTTCTCTTGCTCAAGATCAAAAGTATGTTTTAGAAGATGTCTGGAAAGCTGCCTTTTTCGAGAGA
GGTTTGAGACGAGAAATCAGGACACGACTTGACTTCTTGGACGACATGTCGTACGTCGAGGTTAGGAACGAGGCGTTGAGGCTAGAACGGCTGCAACAGCGCCTCAACGC
TCAAAGGGATCAAGGGGATTGA
Protein sequenceShow/hide protein sequence
MVKTNSNSSSNHPTTTAAKEVANQMKKILSKRKHSIRKSQKEAATLKDVCNGIYNLANKMTTVAERMNEIIEYESYDSKIEQFFSFRPPFFGQYTDPLLATDWIYILDSI
FDLIGCFSDEQKVSLAVLRLMDDALCWWRMEERGLEADGIAVTWLKFKTLFYERYFPKSLQFDNFIELTNVVQGDRTVVEYDAEFMKLCSLAQDQKYVLEDVWKAAFFER
GLRREIRTRLDFLDDMSYVEVRNEALRLERLQQRLNAQRDQGD