; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028153 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028153
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:14641064..14641849
RNA-Seq ExpressionLag0028153
SyntenyLag0028153
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]2.1e-5253.37Show/hide
Query:  MRRNKVVNLFPLDLEIDRTLRSIRREKRLAEAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHLRSFLE
        MRR +  ++ P+D EI+RTLRS+RR K LA A   ++  P+ ++D+++PV+    S I+  P+ A NFELK  LI   +   F G P +D + HL  FLE
Subjt:  MRRNKVVNLFPLDLEIDRTLRSIRREKRLAEAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHLRSFLE

Query:  ICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCP
        IC TVK+NGV  D IRLRLFPFSL+DKA+ WL+S++ G+I +W ++A+ FL KFFPPAKT +LR+EIG F+Q D E LYEAWERYK+ +RRCP
Subjt:  ICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCP

RWR83368.1 hypothetical protein CKAN_01212200 [Cinnamomum micranthum f. kanehirae]2.1e-4750.51Show/hide
Query:  MRRNKVVNLFPLDLEIDRTLRSIRREKRLA---EAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQ-TARDNSFKGHPSEDAHSHLR
        MRRN+ +NL PLD EI+RTLR +++EK+     E    +++A +++ D+  P++    S I    +QA NFE+K  +IQ  A    F G P +D ++H+ 
Subjt:  MRRNKVVNLFPLDLEIDRTLRSIRREKRLA---EAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQ-TARDNSFKGHPSEDAHSHLR

Query:  SFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCPN
        +FLE+C T K NGV  DA+RLRL PFSL+DKAK WL S+    I+TWDELA+ FL KFFPP KT K+R +I TF Q + E LYEAWERYKE LR+CP+
Subjt:  SFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCPN

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]2.8e-6063.98Show/hide
Query:  NLFPLDLEIDRTLRSIRREKRLAEAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHLRSFLEICRTVKM
        NL PLD EIDRT R   R   L +     +E PKAIRD+ QP LP    GI+  P+   NFELK  LIQ AR+ +F+G  +ED H HLRSFLEIC TVKM
Subjt:  NLFPLDLEIDRTLRSIRREKRLAEAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHLRSFLEICRTVKM

Query:  NGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCP
        NGV  DAI+LRLFPFSLQD+AKDWLE++   +I+TW+ LAQAFL K+FPPAK+ +LRTEIGTFRQL++EQLYEAWERYK+ LRRCP
Subjt:  NGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCP

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]2.4e-4853.23Show/hide
Query:  MRRNKVVNLFPLDLEIDRT---LRSIRREKRLAEA-----MVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAH
        MRR + ++L  +D E +RT   LR I+R +R A A       ++D   +AIRD+++PV+    SGI    + A NFELK  LI   + N F G   ED +
Subjt:  MRRNKVVNLFPLDLEIDRT---LRSIRREKRLAEA-----MVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAH

Query:  SHLRSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRC
        +HL SFLEIC TVKMNGV  DAIRLRLF FSL+DKAK W +S+  G+I+TWD+LAQ FLTK+FPP+K+ +LR EI  F+QLD E  YEAWER+K+ LRRC
Subjt:  SHLRSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRC

Query:  P
        P
Subjt:  P

XP_022860306.1 uncharacterized protein LOC111380876 [Olea europaea var. sylvestris]5.1e-4657.86Show/hide
Query:  HQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHLRSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLES
        ++D   +AIRD+++PV+    SGI +  + A NFELK  LI   + N F G   ED ++HL SFLEIC TVKMNGV  DAIRLRLF FSL+DKAK W +S
Subjt:  HQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHLRSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLES

Query:  VEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCP
        +  G+I+TWD+LAQ FLTK+FPP+K+T+L +EI  F+QLD E  YEAWER+K+ LRRCP
Subjt:  VEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCP

TrEMBL top hitse value%identityAlignment
A0A1S3UKD4 uncharacterized protein LOC1067662671.0e-4449.25Show/hide
Query:  DLEIDRTLRSIR-------REKRLAEAMVHQDEAP----------KAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHL
        D  I+RT RS R       RE+R  +  + Q+E            K IRD+  P        IV  P+QA NFE+K  L+Q  + N F G  SED +SHL
Subjt:  DLEIDRTLRSIR-------REKRLAEAMVHQDEAP----------KAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHL

Query:  RSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCPN
         +FL IC T+K NGV  DAI LRLFPFSL+DKAK+WL+S+  G+ISTW+++A  F+TK+FPP+K+ K+R EI +F Q D E LYEAWERYKE +R+CP+
Subjt:  RSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCPN

A0A3S3N117 Retrotrans_gag domain-containing protein1.0e-4750.51Show/hide
Query:  MRRNKVVNLFPLDLEIDRTLRSIRREKRLA---EAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQ-TARDNSFKGHPSEDAHSHLR
        MRRN+ +NL PLD EI+RTLR +++EK+     E    +++A +++ D+  P++    S I    +QA NFE+K  +IQ  A    F G P +D ++H+ 
Subjt:  MRRNKVVNLFPLDLEIDRTLRSIRREKRLA---EAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQ-TARDNSFKGHPSEDAHSHLR

Query:  SFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCPN
        +FLE+C T K NGV  DA+RLRL PFSL+DKAK WL S+    I+TWDELA+ FL KFFPP KT K+R +I TF Q + E LYEAWERYKE LR+CP+
Subjt:  SFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCPN

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.2e-4649.07Show/hide
Query:  MRRNKVVNLFPLDLEIDRTLRSIRREK----RLAEAMVHQD------------EAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDN-SF
        M+R   +NL P D +I+RT R  RRE      L + M   +            EA +A+RD++ P++   +  I    + A NFE+K   IQ  + +  F
Subjt:  MRRNKVVNLFPLDLEIDRTLRSIRREK----RLAEAMVHQD------------EAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDN-SF

Query:  KGHPSEDAHSHLRSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWE
         G PS+D +SHL +FLEIC T K NGV  DAIRLRLFPFSL+DKAK WL S+  G+I+TW++LAQ FL KFFPPAKT K+R +I +F Q D E LYEAWE
Subjt:  KGHPSEDAHSHLRSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWE

Query:  RYKERLRRCPNMDI
        R+KE LRRCP+  I
Subjt:  RYKERLRRCPNMDI

A0A6J0ZYV0 uncharacterized protein LOC1104134135.5e-4649.07Show/hide
Query:  MRRNKVVNLFPLDLEIDRTLRSIRREK----RLAEAMVHQD------------EAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDN-SF
        M+R   +NL P D +I+RT R  RRE      L + M   +            EA +A+RD+  P++   +  I    + A NFE+K   IQ  + +  F
Subjt:  MRRNKVVNLFPLDLEIDRTLRSIRREK----RLAEAMVHQD------------EAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDN-SF

Query:  KGHPSEDAHSHLRSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWE
         G PS+D +SHL +FLEIC T K NGV  DAIRLRLFPFSL+DKAK WL S+  G+I+TW++LAQ FL KFFPPAKT K+R +I +F Q D E LYEAWE
Subjt:  KGHPSEDAHSHLRSFLEICRTVKMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWE

Query:  RYKERLRRCPNMDI
        R+KE LRRCP+  I
Subjt:  RYKERLRRCPNMDI

A0A7C9A3K2 Retrotrans_gag domain-containing protein (Fragment)5.1e-4450.26Show/hide
Query:  LFPLDLEIDRTLR---SIRREKRLAEAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHLRSFLEICRTV
        L P D EI+   R   S  R KR AEA + QD   + +RD++ P      S IV   ++A NFEL   LI       F GHPSE+ ++H+R FL  C T+
Subjt:  LFPLDLEIDRTLR---SIRREKRLAEAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHLRSFLEICRTV

Query:  KMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCPN
        K+NG  +DAIRLRLFPFSL+D+A DWL++ E  + +TWD L++AFL+K+FPP KT KLR EI +F Q D E LYEAWERYK+  R+CP+
Subjt:  KMNGVPTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAGGAACAAGGTGGTTAATTTGTTTCCGCTAGATCTCGAAATTGACAGGACTCTTAGATCCATTCGGAGAGAGAAAAGATTAGCAGAAGCAATGGTCCATCAAGA
TGAAGCTCCCAAGGCAATCAGAGACTTCTTACAGCCAGTTCTTCCCACCGAGAATTCTGGAATTGTCTACGCCCCAGTCCAAGCTACCAATTTTGAGTTAAAGACAGAAT
TGATTCAGACGGCGCGCGATAACTCTTTTAAGGGACATCCTTCTGAGGACGCACACTCACATCTGCGATCATTCTTGGAAATATGTAGGACGGTGAAGATGAACGGAGTT
CCGACAGACGCTATAAGATTGAGGCTGTTTCCATTTTCTCTACAAGACAAAGCAAAGGATTGGCTCGAATCAGTTGAGATGGGCAACATTAGTACATGGGACGAGCTTGC
CCAGGCTTTTCTGACGAAATTTTTCCCGCCTGCCAAGACTACCAAGCTTCGGACTGAAATCGGAACGTTTAGGCAGCTTGATGAAGAGCAATTGTACGAGGCATGGGAAA
GATACAAGGAAAGGCTTAGACGGTGCCCCAACATGGATATCCTGATTGGCTCCAAGTGCAGTTGTTTTACAATGATTGAATCTCTCCACCAAGACAGTCCTAGACACATC
AGCAGGAGGAAGTTTTCTTTCCAAAACAGTAACGGAAGCCAAAGATTTGTTGGAGGAAATGGCGGCAACCAGTTATCAGTGGCCGACCGAGAGGGGAGCAATTTCAAAGA
AGGTTGGAATTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGAAGGAACAAGGTGGTTAATTTGTTTCCGCTAGATCTCGAAATTGACAGGACTCTTAGATCCATTCGGAGAGAGAAAAGATTAGCAGAAGCAATGGTCCATCAAGA
TGAAGCTCCCAAGGCAATCAGAGACTTCTTACAGCCAGTTCTTCCCACCGAGAATTCTGGAATTGTCTACGCCCCAGTCCAAGCTACCAATTTTGAGTTAAAGACAGAAT
TGATTCAGACGGCGCGCGATAACTCTTTTAAGGGACATCCTTCTGAGGACGCACACTCACATCTGCGATCATTCTTGGAAATATGTAGGACGGTGAAGATGAACGGAGTT
CCGACAGACGCTATAAGATTGAGGCTGTTTCCATTTTCTCTACAAGACAAAGCAAAGGATTGGCTCGAATCAGTTGAGATGGGCAACATTAGTACATGGGACGAGCTTGC
CCAGGCTTTTCTGACGAAATTTTTCCCGCCTGCCAAGACTACCAAGCTTCGGACTGAAATCGGAACGTTTAGGCAGCTTGATGAAGAGCAATTGTACGAGGCATGGGAAA
GATACAAGGAAAGGCTTAGACGGTGCCCCAACATGGATATCCTGATTGGCTCCAAGTGCAGTTGTTTTACAATGATTGAATCTCTCCACCAAGACAGTCCTAGACACATC
AGCAGGAGGAAGTTTTCTTTCCAAAACAGTAACGGAAGCCAAAGATTTGTTGGAGGAAATGGCGGCAACCAGTTATCAGTGGCCGACCGAGAGGGGAGCAATTTCAAAGA
AGGTTGGAATTTATGA
Protein sequenceShow/hide protein sequence
MRRNKVVNLFPLDLEIDRTLRSIRREKRLAEAMVHQDEAPKAIRDFLQPVLPTENSGIVYAPVQATNFELKTELIQTARDNSFKGHPSEDAHSHLRSFLEICRTVKMNGV
PTDAIRLRLFPFSLQDKAKDWLESVEMGNISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKERLRRCPNMDILIGSKCSCFTMIESLHQDSPRHI
SRRKFSFQNSNGSQRFVGGNGGNQLSVADREGSNFKEGWNL