; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038866 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038866
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:29449552..29450154
RNA-Seq ExpressionLag0038866
SyntenyLag0038866
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_009759808.1 PREDICTED: uncharacterized protein LOC104212287 [Nicotiana sylvestris]6.4e-4152.38Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSF-GTYSFRALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +F+KE L  KRK ++V  V L   C   +Q K+P+KV DPGSF+VPC+  G Y  +ALC+ GASIN++P+S+ KKLD+GEIK T + LQ ADQS 
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSF-GTYSFRALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIF
         +P GI+ENVL+R+ +F  P+D  V+ M E P  P ILGR FLATGR IID+ + +L +RV  ++ IF
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIF

XP_019231228.1 PREDICTED: uncharacterized protein LOC109212063, partial [Nicotiana attenuata]4.9e-4151.45Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSF-GTYSFRALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +F+KE L  KRK ++V  V L   C   +Q K+P+K+ DPGSF++PC+  G Y  +ALC+ GASIN++P S+ KKLD+GE+K T V LQ ADQS 
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSF-GTYSFRALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIFKAVKD
         RP GI+ENVL+R+ +F  P+D  V+ M E    P ILGRPFLATGR IID+ + +L +RV  ++ IF   KD
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIFKAVKD

XP_021678975.1 uncharacterized protein LOC110663841 [Hevea brasiliensis]5.8e-4248.09Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +F+KE L  K+K +  +TV L   C T +Q K+P K+ DPGSFS+PC  G  S  +ALC+LGAS++++PLS+C+KL +G++K TP+ LQLAD+S+
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIFKAVKDSKEEVLFMGY
          PVGI+ENV +++G FF+P+D  V+ M E+   P ILGRPFLAT R  ID++  +L ++VR ++  F   +DSKE  +   Y
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIFKAVKDSKEEVLFMGY

XP_022889504.1 uncharacterized protein LOC111405046 [Olea europaea var. sylvestris]8.4e-4149.72Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +FMKE + KKRK ++ +TV L   C   +Q+K+P+K  DPGSF++PC+ G+ SF +ALC+LGASIN++PLS+ KKL +GE+K T V LQLAD+S+
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDK---EIFKAVKDSKE
          P GI+E+VL+++ +F  P D  V++M E+  +P ILGRPFLATGR +ID++  +LT+RV  ++    I++A+K S +
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDK---EIFKAVKDSKE

XP_030497486.1 uncharacterized protein LOC115713139 [Cannabis sativa]4.9e-4150.3Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSF-GTYSFRALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +FMKE L KKRK ++ + V L   C   +Q+K+P K+ DPGSF++PCS  G+   +ALC+LGASIN++PLS+ K+L +GE K T V LQ+AD+S+
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSF-GTYSFRALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIFK
          P GI+E+VL+++G+F  P D  +++M E+  +P ILGRPFLATGR +ID+++ EL +RV+ ++E FK
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIFK

TrEMBL top hitse value%identityAlignment
A0A0S3QWS7 Uncharacterized protein5.3e-4148.88Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +FMK+ L KKRK +  +T+ L   C   +QQK+P K+ DPGSF +PC  G  +  +ALC+LGASIN++PLS+ K+L IGE+K T + LQLAD+S+
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIFKAVKDSKEEV
          P GIVE+VL+++ +F  P D  V++M E+  +P ILGRPFLATGR +ID+E+ +L +RV N+K  F   +  K ++
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIFKAVKDSKEEV

A0A1U7V076 uncharacterized protein LOC1042122873.1e-4152.38Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSF-GTYSFRALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +F+KE L  KRK ++V  V L   C   +Q K+P+KV DPGSF+VPC+  G Y  +ALC+ GASIN++P+S+ KKLD+GEIK T + LQ ADQS 
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSF-GTYSFRALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIF
         +P GI+ENVL+R+ +F  P+D  V+ M E P  P ILGR FLATGR IID+ + +L +RV  ++ IF
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIF

A0A445BJN7 Asp_protease domain-containing protein1.5e-4047.78Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +FMKE L KKR  K+  TV +   C   +Q+K+P+K+ DPGSF +PC+ G     RA C+LGASIN++PLSL +KL I E+K T + LQ+AD+S+
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDK---EIFKAVKDSKEE
         + +G+VENVL+++ +FFLP+D  ++++ E+   P ILGRPFLAT R +ID+E+ EL +RV ++     +FK ++DS +E
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDK---EIFKAVKDSKEE

A0A6P4BCZ7 uncharacterized protein LOC1074658171.2e-4047.78Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +FMKE L KKR  K+  TV +   C   +Q+K+P+K+ DPGSF +PC+ G     RA C+LGASIN++PLSL +KL I E+K T + LQ+AD+S+
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDK---EIFKAVKDSKEE
         + +G+VENVL+++ +FFLP+D  ++++ E+   P ILGRPFLAT R +ID+E+ EL +RV ++     +FK ++DS +E
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDK---EIFKAVKDSKEE

A0A6P4DC57 uncharacterized protein LOC1074895951.2e-4047.78Show/hide
Query:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV
        MP Y +FMKE L KKR  K+  TV +   C   +Q+K+P+K+ DPGSF +PC+ G     RA C+LGASIN++PLSL +KL I E+K T + LQ+AD+S+
Subjt:  MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSV

Query:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDK---EIFKAVKDSKEE
         + +G+VENVL+++ +FFLP+D  ++++ E+   P ILGRPFLAT R +ID+E+ EL +RV ++     +FK ++DS +E
Subjt:  VRPVGIVENVLIRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDK---EIFKAVKDSKEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCAATACAACAGGTTTATGAAGGAGTGGTTAGAAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCTGCACCAGAGTACAACAGAA
GGTACCTGAAAAAGTAGCAGACCCAGGGAGTTTTTCTGTTCCTTGCAGTTTTGGTACTTACTCTTTTAGAGCATTATGTGAATTAGGTGCTAGCATTAATATTATTCCAT
TATCCTTGTGTAAAAAGTTAGACATAGGTGAGATTAAATCTACTCCTGTTAAGCTCCAATTGGCTGATCAGTCTGTGGTTAGACCAGTTGGCATTGTAGAAAATGTTTTA
ATCAGATTAGGTAGATTTTTCCTCCCTATTGATTTGTATGTTATGAATATGATGGAAAACCCTACAATGCCTGCTATACTAGGAAGACCATTCCTCGCCACGGGGCGAGT
GATTATTGATATTGAGCGCAGGGAGCTCACTGTGAGAGTCAGGAATGACAAAGAAATATTTAAAGCAGTTAAAGACTCTAAAGAGGAAGTGCTTTTCATGGGTTACAAGA
AAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAAAAGAAGCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCCAATACAACAGGTTTATGAAGGAGTGGTTAGAAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCTGCACCAGAGTACAACAGAA
GGTACCTGAAAAAGTAGCAGACCCAGGGAGTTTTTCTGTTCCTTGCAGTTTTGGTACTTACTCTTTTAGAGCATTATGTGAATTAGGTGCTAGCATTAATATTATTCCAT
TATCCTTGTGTAAAAAGTTAGACATAGGTGAGATTAAATCTACTCCTGTTAAGCTCCAATTGGCTGATCAGTCTGTGGTTAGACCAGTTGGCATTGTAGAAAATGTTTTA
ATCAGATTAGGTAGATTTTTCCTCCCTATTGATTTGTATGTTATGAATATGATGGAAAACCCTACAATGCCTGCTATACTAGGAAGACCATTCCTCGCCACGGGGCGAGT
GATTATTGATATTGAGCGCAGGGAGCTCACTGTGAGAGTCAGGAATGACAAAGAAATATTTAAAGCAGTTAAAGACTCTAAAGAGGAAGTGCTTTTCATGGGTTACAAGA
AAGGTGCAAGAAAGAGCACCTCTGTTGGATTCACAGAAAAGAAGCCTCCTTGA
Protein sequenceShow/hide protein sequence
MPQYNRFMKEWLEKKRKEKKVDTVYLASTCCTRVQQKVPEKVADPGSFSVPCSFGTYSFRALCELGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVL
IRLGRFFLPIDLYVMNMMENPTMPAILGRPFLATGRVIIDIERRELTVRVRNDKEIFKAVKDSKEEVLFMGYKKGARKSTSVGFTEKKPP