; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010514 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010514
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Genome locationchr1:327951..329755
RNA-Seq ExpressionLag0010514
SyntenyLag0010514
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_013617205.1 PREDICTED: uncharacterized protein LOC106323665 [Brassica oleracea var. oleracea]1.4e-3139.66Show/hide
Query:  STMARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLL
        S M  FWW S + KKKI W  W  LC  KELGGL F+D+  FN+ALLAKQ WR+++  + L +R++  RY  +   L+  I T  S  W  ++  R+LL 
Subjt:  STMARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLL

Query:  AGMRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLG-WDLSKLRSFVVEEDTRLIATIPIIVA
         G+ H++G+G TT  ++D W+       PM   +AIV   L V+D I   +G W + ++R  +VEED   +   PI +A
Subjt:  AGMRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLG-WDLSKLRSFVVEEDTRLIATIPIIVA

XP_022131662.1 uncharacterized protein LOC111004787 [Momordica charantia]3.3e-4148.26Show/hide
Query:  ARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGM
        ARFWWGS++  KK+ W  W  +CLPKELGGLNFRDL  FN+AL+AKQVWR+     +L SRV+  +Y H + +LQA    N S FW G +W RDLL+ G+
Subjt:  ARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGM

Query:  RHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI
        R RVGNG+T + F+D WIP+  +  P  I     P  ++VAD I P+  WD+  +     EED  LI ++P+
Subjt:  RHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.7e-3243.48Show/hide
Query:  KKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGMRHRVGNGTTTD
        +K+ W KW  +C PKE GGLNFRDL  FN+AL+AK VWR      LL S+V+  +Y     LLQA   +  S FW G +W RDLL+ G+R RVGNG+T  
Subjt:  KKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGMRHRVGNGTTTD

Query:  FFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI
         F+D W+P+  T  P++ +   +     VA FIT    WD++ +      ED  LI ++PI
Subjt:  FFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI

XP_022568650.1 uncharacterized protein LOC106442391 [Brassica napus]8.2e-3239.23Show/hide
Query:  STMARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLL
        S M  FWW S + KKKI W  W  LC  KELGGL F+D+  FN+ALLAKQ WR+++  + L +R++  RY  +   L+  I T  S  W  ++  R+LL 
Subjt:  STMARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLL

Query:  AGMRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLG-WDLSKLRSFVVEEDTRLIATIPIIVAEE
         G+ H++G+G TT  ++D W+       PM   +AIV   L V+D I   +G W + ++R  +VEED   +   PI +A +
Subjt:  AGMRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLG-WDLSKLRSFVVEEDTRLIATIPIIVAEE

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]1.7e-3242.77Show/hide
Query:  MARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAG
        MARFWWGS++  KKI WK W  LC  K  GGL FR    FN+A LAKQ WR+F     L SRV+ GRY HQN  + A +    S+ W G+VW R+LL  G
Subjt:  MARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAG

Query:  MRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI
        +  ++G+GT  +  +D WIP      P++   +       VAD+IT +  WDL  L +     D   I TIP+
Subjt:  MRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI

TrEMBL top hitse value%identityAlignment
A0A6J1BRN0 uncharacterized protein LOC1110047871.6e-4148.26Show/hide
Query:  ARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGM
        ARFWWGS++  KK+ W  W  +CLPKELGGLNFRDL  FN+AL+AKQVWR+     +L SRV+  +Y H + +LQA    N S FW G +W RDLL+ G+
Subjt:  ARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGM

Query:  RHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI
        R RVGNG+T + F+D WIP+  +  P  I     P  ++VAD I P+  WD+  +     EED  LI ++P+
Subjt:  RHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI

A0A6J1DX30 uncharacterized protein LOC1110248741.8e-3243.48Show/hide
Query:  KKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGMRHRVGNGTTTD
        +K+ W KW  +C PKE GGLNFRDL  FN+AL+AK VWR      LL S+V+  +Y     LLQA   +  S FW G +W RDLL+ G+R RVGNG+T  
Subjt:  KKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGMRHRVGNGTTTD

Query:  FFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI
         F+D W+P+  T  P++ +   +     VA FIT    WD++ +      ED  LI ++PI
Subjt:  FFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI

A0A803PIB6 Uncharacterized protein8.0e-3342.77Show/hide
Query:  MARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAG
        MARFWWGS++  KKI WK W  LC  K  GGL FR    FN+A LAKQ WR+F     L SRV+ GRY HQN  + A +    S+ W G+VW R+LL  G
Subjt:  MARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAG

Query:  MRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI
        +  ++G+GT  +  +D WIP      P++   +       VAD+IT +  WDL  L +     D   I TIP+
Subjt:  MRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI

A0A803PKJ2 Uncharacterized protein4.0e-3242.86Show/hide
Query:  STMARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLL
        S MARFWWGS++  KKI WK W  LC  K  GGL FR    FN+A LAKQ WR+F     L SRV+ GRY H N  L A      S+ W G++W R+LL 
Subjt:  STMARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLL

Query:  AGMRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI
         G+R ++G GT     ND WIP      P Q      P    VA +IT +  W+   L       D   I TIP+
Subjt:  AGMRHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPI

A0A803PTB0 Uncharacterized protein9.4e-3443.53Show/hide
Query:  ARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGM
        ARFWWGS+  KKK  W  W+ LCLPKE GGL F+DL  FNKALLAKQVWR+      L  +V+   Y   + +L A   +  S  W GL+W R+++ AG 
Subjt:  ARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGM

Query:  RHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATI
        R RVG+G   D  ND W+P+ V   P  I    +PEG +V D    +  WD   ++    E+D  LI +I
Subjt:  RHRVGNGTTTDFFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATI

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003104.7e-2240Show/hide
Query:  SILLCN--ISTMARFWWGSTSTKKKIQWKKWADLCLPKE-LGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWW
        S LLC    S M  FWW S   K+KI W  W  LC  KE  GGL FRDL  FN+ALLAKQ +R+      L SR++  RY   + +++  + T  S  W 
Subjt:  SILLCN--ISTMARFWWGSTSTKKKIQWKKWADLCLPKE-LGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWW

Query:  GLVWARDLLLAGMRHRVGNGTTTDFFNDLWIPKEVTLMPM
         ++  R+LL  G+   +G+G  T  + D WI  E  L P+
Subjt:  GLVWARDLLLAGMRHRVGNGTTTDFFNDLWIPKEVTLMPM

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.0e-2030.94Show/hide
Query:  ISTMARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLL
        IS +A FWW +    K + WK W  L   K  GG+ F+D+ +FN ALL KQ+WR+ +  + L ++V   RY H++  L AP+ +  S  W  +  ++++L
Subjt:  ISTMARFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLL

Query:  LAGMRHRVGNGTTTDFFNDLWI---PKEVTLMPMQI---HEAIVPEGLEVADFITPS-LGWDLSKLRSFVVEEDTRLIATI
          G R  VGNG     +   W+   P    L   ++     A V   L+V+D I  S   W    +     E + +LI  +
Subjt:  LAGMRHRVGNGTTTDFFNDLWI---PKEVTLMPMQI---HEAIVPEGLEVADFITPS-LGWDLSKLRSFVVEEDTRLIATI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.3e-2340Show/hide
Query:  SILLCN--ISTMARFWWGSTSTKKKIQWKKWADLCLPKE-LGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWW
        S LLC    S M  FWW S   K+KI W  W  LC  KE  GGL FRDL  FN+ALLAKQ +R+      L SR++  RY   + +++  + T  S  W 
Subjt:  SILLCN--ISTMARFWWGSTSTKKKIQWKKWADLCLPKE-LGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWW

Query:  GLVWARDLLLAGMRHRVGNGTTTDFFNDLWIPKEVTLMPM
         ++  R+LL  G+   +G+G  T  + D WI  E  L P+
Subjt:  GLVWARDLLLAGMRHRVGNGTTTDFFNDLWIPKEVTLMPM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCCTTTTGGAGTGAACCAAAAGCAAAGCCATGAGGGTTATGCCCAAAGTGGACAATATCATACGATTGTGGAGATATGTGTAGTGTCCCGTAATTCCTTATCATC
TCTTATCAGAGCATACTCTTATTCTCTCAAAAACTCACTGCCGCACGATCAAGTGGCTCGAATGCTTAGAATTAAGGAGAAAGTTGAAGATGACCTTCGAAAATTTCATA
ATTTGTATCCGAGAGCCAAAGAGATGGGATTTGGTCGATTTTTTTTATCTCCAACTCTTTTTGAAGATGCGGTGAAGTCCATCCTTCTATGCAATATCTCAACGATGGCA
CGCTTTTGGTGGGGATCCACCTCTACCAAAAAGAAAATACAATGGAAGAAATGGGCGGATTTATGCCTTCCTAAAGAATTAGGGGGGTTAAATTTTAGAGATCTGGCAAG
CTTTAACAAGGCACTATTAGCTAAACAGGTGTGGCGACTTTTCACTATCCTCCAATTATTGGCGTCGAGAGTCATTCATGGTAGATATGCACACCAAAACCAGTTGTTGC
AAGCTCCAATCAAAACAAATTGTTCCGTCTTTTGGTGGGGTTTGGTGTGGGCTCGAGATCTATTACTAGCTGGCATGCGACATCGAGTGGGAAATGGTACAACTACAGAT
TTCTTTAACGATCTCTGGATCCCAAAAGAAGTAACACTCATGCCTATGCAAATTCATGAAGCGATAGTACCGGAAGGGTTGGAGGTGGCCGACTTTATCACCCCGTCTTT
AGGATGGGATTTGAGCAAACTTAGAAGTTTCGTGGTTGAAGAGGACACAAGGCTTATAGCAACAATTCCAATAATTGTTGCAGAAGAAATGATAAATGGATATGGCACTA
TACGTCTACAGGAGAGTATATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGCCTTTTGGAGTGAACCAAAAGCAAAGCCATGAGGGTTATGCCCAAAGTGGACAATATCATACGATTGTGGAGATATGTGTAGTGTCCCGTAATTCCTTATCATC
TCTTATCAGAGCATACTCTTATTCTCTCAAAAACTCACTGCCGCACGATCAAGTGGCTCGAATGCTTAGAATTAAGGAGAAAGTTGAAGATGACCTTCGAAAATTTCATA
ATTTGTATCCGAGAGCCAAAGAGATGGGATTTGGTCGATTTTTTTTATCTCCAACTCTTTTTGAAGATGCGGTGAAGTCCATCCTTCTATGCAATATCTCAACGATGGCA
CGCTTTTGGTGGGGATCCACCTCTACCAAAAAGAAAATACAATGGAAGAAATGGGCGGATTTATGCCTTCCTAAAGAATTAGGGGGGTTAAATTTTAGAGATCTGGCAAG
CTTTAACAAGGCACTATTAGCTAAACAGGTGTGGCGACTTTTCACTATCCTCCAATTATTGGCGTCGAGAGTCATTCATGGTAGATATGCACACCAAAACCAGTTGTTGC
AAGCTCCAATCAAAACAAATTGTTCCGTCTTTTGGTGGGGTTTGGTGTGGGCTCGAGATCTATTACTAGCTGGCATGCGACATCGAGTGGGAAATGGTACAACTACAGAT
TTCTTTAACGATCTCTGGATCCCAAAAGAAGTAACACTCATGCCTATGCAAATTCATGAAGCGATAGTACCGGAAGGGTTGGAGGTGGCCGACTTTATCACCCCGTCTTT
AGGATGGGATTTGAGCAAACTTAGAAGTTTCGTGGTTGAAGAGGACACAAGGCTTATAGCAACAATTCCAATAATTGTTGCAGAAGAAATGATAAATGGATATGGCACTA
TACGTCTACAGGAGAGTATATAG
Protein sequenceShow/hide protein sequence
MRPFGVNQKQSHEGYAQSGQYHTIVEICVVSRNSLSSLIRAYSYSLKNSLPHDQVARMLRIKEKVEDDLRKFHNLYPRAKEMGFGRFFLSPTLFEDAVKSILLCNISTMA
RFWWGSTSTKKKIQWKKWADLCLPKELGGLNFRDLASFNKALLAKQVWRLFTILQLLASRVIHGRYAHQNQLLQAPIKTNCSVFWWGLVWARDLLLAGMRHRVGNGTTTD
FFNDLWIPKEVTLMPMQIHEAIVPEGLEVADFITPSLGWDLSKLRSFVVEEDTRLIATIPIIVAEEMINGYGTIRLQESI