; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008418 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008418
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:21012050..21013134
RNA-Seq ExpressionLag0008418
SyntenyLag0008418
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]3.8e-5157.23Show/hide
Query:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE
        MV++  F G P +DP  HL  FL  C TVK+ GV+ D+IRL+LFPFSL+DKAR WL SL   SI+ W D+ + FLAKFFP  K  +LR+EIG+F+Q + E
Subjt:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE

Query:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA
         LYEAWERYKDL+R+CPQHG PDWLQ+Q+FY+GLN   +TI+D  +GG L++KT + A  ++EE+A
Subjt:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]3.8e-5157.23Show/hide
Query:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE
        MV++  F G P +DP  HL  FL  C TVK+ GV+ D+IRL+LFPFSL+DKAR WL SL   SI+ W D+ + FLAKFFP  K  +LR+EIG+F+Q + E
Subjt:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE

Query:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA
         LYEAWERYKDL+R+CPQHG PDWLQ+Q+FY+GLN   +TI+D  +GG L++KT + A  ++EE+A
Subjt:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]3.8e-5157.23Show/hide
Query:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE
        MV++  F G P +DP  HL  FL  C TVK+ GV+ D+IRL+LFPFSL+DKAR WL SL   SI+ W D+ + FLAKFFP  K  +LR+EIG+F+Q + E
Subjt:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE

Query:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA
         LYEAWERYKDL+R+CPQHG PDWLQ+Q+FY+GLN   +TI+D  +GG L++KT + A  ++EE+A
Subjt:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.1e-5861.96Show/hide
Query:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE
        M RE AF+G  +EDP  HLRSFL  CGTVKM GVS D+I+L+LFPFSLQD+A+DWL+++  +SI  W  L +AFL K+FP  K  +LR EIG F+QLE+E
Subjt:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE

Query:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELALTKI--------PNAPKA
        QLYEAWERYKDLLR+CPQHGYPDWLQIQLFY+GL  + K+ILD TAGG++ +K  +EA TI+E+LA T          PN PKA
Subjt:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELALTKI--------PNAPKA

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]6.5e-5157.83Show/hide
Query:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE
        MV++  F G P +DP  HL  FL  C TVKM GV+ D+IRL+LFPFSL+DKAR WL SL   SI  W D+ + FLAKFFP  K  +LR+EIG+F+Q + E
Subjt:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE

Query:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA
         LYEAWERYKDL+R CPQHG PDWLQ+Q+FY+GLN   +TI+D  +GG L++KT + A +++EE+A
Subjt:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA

TrEMBL top hitse value%identityAlignment
A0A2I4G4Q3 uncharacterized protein LOC1090047123.5e-5056.63Show/hide
Query:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE
        MV++  F G P +DP  HL  FL  C TVK+ GV+ D+IRL+LFPFSL+D+AR WL SL   SI  W D+ + F AKFFP  K T+LR+EIG+F+Q + E
Subjt:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE

Query:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA
         LYEAWE YKDL+R+CPQHG PDWLQ+Q+FY+GLN + +TI+D T+GG L+ KT++ A  ++EE+A
Subjt:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.0e-4655.62Show/hide
Query:  FKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEEQLYEAW
        F G PS+DP  HL +FL  C T K  GV+ D+IRL+LFPFSL+DKA+ WL+SL + SI  W DL + FLAKFFP  K  K+R +I  F Q + E LYEAW
Subjt:  FKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEEQLYEAW

Query:  ERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA
        ER+K+LLR+CP HG PDWLQ+Q FY+GL  ++KTI+D  AGG L++K   +A  ++EE+A
Subjt:  ERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA

A0A6J0ZYV0 uncharacterized protein LOC1104134133.0e-4655.62Show/hide
Query:  FKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEEQLYEAW
        F G PS+DP  HL +FL  C T K  GV+ D+IRL+LFPFSL+DKA+ WL+SL + SI  W DL + FLAKFFP  K  K+R +I  F Q + E LYEAW
Subjt:  FKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEEQLYEAW

Query:  ERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA
        ER+K+LLR+CP HG PDWLQ+Q FY+GL  ++KTI+D  AGG L++K   +A  ++EE+A
Subjt:  ERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA

A0A6P6XAQ1 Reverse transcriptase1.5e-4553.61Show/hide
Query:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE
        MV+++ + G+ +EDP  HL +FL  C T+K  GVS D+I+L+LFPFSL+DKA+ WL S   N+   W++L KAFL KFFP  K  KLR +I  F Q E E
Subjt:  MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEE

Query:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA
         LYEAWERY++L R+CP HG PDWL +Q FY+GL    KT +D  AGG L+ KT +EA+ +IEE+A
Subjt:  QLYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA

A0A7C9A3K2 Retrotrans_gag domain-containing protein (Fragment)2.0e-4552.73Show/hide
Query:  VRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEEQ
        V    F GHPSE+P  H+R FL  C T+K+ G S D+IRL+LFPFSL+D+A DWL +   NS   W+ L +AFL+K+FP  K  KLRAEI  F Q + E 
Subjt:  VRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEEQ

Query:  LYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA
        LYEAWERYKDL R+CP H  PDWL IQ FY+GL ++++  +D  AGG L+ K++  A+ ++EE+A
Subjt:  LYEAWERYKDLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAGAGAGAATGCTTTCAAAGGCCATCCATCAGAAGATCCACGTCATCACCTAAGATCATTTCTGAATACATGTGGAACTGTAAAAATGGGAGGAGTTAGCCCTGA
CTCAATTCGGTTGCAGTTATTCCCATTCTCTTTACAAGACAAGGCTAGAGATTGGTTAGACTCGTTAACCTCAAATAGCATTCTTGGTTGGAATGATTTAGTCAAAGCTT
TCCTTGCAAAATTTTTCCCATTAGAAAAGATCACCAAGCTTAGAGCTGAAATTGGAAAATTTCAACAATTAGAGGAAGAACAATTATATGAAGCTTGGGAAAGATATAAG
GATTTGTTAAGGAAATGCCCACAACATGGATATCCTGATTGGCTTCAAATTCAACTTTTCTACAGTGGGTTAAATAGGAATTTAAAGACAATTCTTGACACCACGGCTGG
AGGAAACTTGTTAGCCAAAACTGTCAAGGAGGCACGAACCATAATAGAAGAGTTGGCTTTGACAAAAATTCCAAATGCACCGAAAGCCGAAGTAAAAGAAGAACCTAAAC
CTGAAGAAAAGCAAAGCCCTTTGGAAGAAATGCTAAGAGACTTCATGAAGGAGACTAGAAATTTGAATGAAAATGTTCAAGCCAGAGTAATATTGTTATTGATAAATGGA
AGAAAAATGGATCAAATGACAATATTTATGAAGGAAATCCAGCAAGGAAATTTATCTAATGTCATCGCCATGGACACACAAGAACAATGCCTGGCCATAACTCGGAGAAG
TGGAAAGCAGGAAGTAGAAAGAAAAGAAGTCAAACAGACCAGCACTAGGAGGATTCTGGATGAAGATGATGATCAAAACACAGAAGAGGAGGAGAAAGCGCCGCTAGCTC
CACCTGAAAAGACGTCCCATGAGTTCGTGGACTGCTCCAGTGATATTGCATGGTACCTTATACATCACTACCCGAGGAGACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAAGAGAGAATGCTTTCAAAGGCCATCCATCAGAAGATCCACGTCATCACCTAAGATCATTTCTGAATACATGTGGAACTGTAAAAATGGGAGGAGTTAGCCCTGA
CTCAATTCGGTTGCAGTTATTCCCATTCTCTTTACAAGACAAGGCTAGAGATTGGTTAGACTCGTTAACCTCAAATAGCATTCTTGGTTGGAATGATTTAGTCAAAGCTT
TCCTTGCAAAATTTTTCCCATTAGAAAAGATCACCAAGCTTAGAGCTGAAATTGGAAAATTTCAACAATTAGAGGAAGAACAATTATATGAAGCTTGGGAAAGATATAAG
GATTTGTTAAGGAAATGCCCACAACATGGATATCCTGATTGGCTTCAAATTCAACTTTTCTACAGTGGGTTAAATAGGAATTTAAAGACAATTCTTGACACCACGGCTGG
AGGAAACTTGTTAGCCAAAACTGTCAAGGAGGCACGAACCATAATAGAAGAGTTGGCTTTGACAAAAATTCCAAATGCACCGAAAGCCGAAGTAAAAGAAGAACCTAAAC
CTGAAGAAAAGCAAAGCCCTTTGGAAGAAATGCTAAGAGACTTCATGAAGGAGACTAGAAATTTGAATGAAAATGTTCAAGCCAGAGTAATATTGTTATTGATAAATGGA
AGAAAAATGGATCAAATGACAATATTTATGAAGGAAATCCAGCAAGGAAATTTATCTAATGTCATCGCCATGGACACACAAGAACAATGCCTGGCCATAACTCGGAGAAG
TGGAAAGCAGGAAGTAGAAAGAAAAGAAGTCAAACAGACCAGCACTAGGAGGATTCTGGATGAAGATGATGATCAAAACACAGAAGAGGAGGAGAAAGCGCCGCTAGCTC
CACCTGAAAAGACGTCCCATGAGTTCGTGGACTGCTCCAGTGATATTGCATGGTACCTTATACATCACTACCCGAGGAGACTTTGA
Protein sequenceShow/hide protein sequence
MVRENAFKGHPSEDPRHHLRSFLNTCGTVKMGGVSPDSIRLQLFPFSLQDKARDWLDSLTSNSILGWNDLVKAFLAKFFPLEKITKLRAEIGKFQQLEEEQLYEAWERYK
DLLRKCPQHGYPDWLQIQLFYSGLNRNLKTILDTTAGGNLLAKTVKEARTIIEELALTKIPNAPKAEVKEEPKPEEKQSPLEEMLRDFMKETRNLNENVQARVILLLING
RKMDQMTIFMKEIQQGNLSNVIAMDTQEQCLAITRRSGKQEVERKEVKQTSTRRILDEDDDQNTEEEEKAPLAPPEKTSHEFVDCSSDIAWYLIHHYPRRL