; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022512 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022512
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:31375080..31376342
RNA-Seq ExpressionLag0022512
SyntenyLag0022512
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5449841.1 hypothetical protein F2P56_030246 [Juglans regia]9.2e-2648.72Show/hide
Query:  RETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWI-DWDFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLV
        +ETK+ +      K + GF NCF V+  GRSGGL LLW  E+  S+ SFS  HID  I + D  +WRFTG+YGHP A  +  TW+L++++    S PWLV
Subjt:  RETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWI-DWDFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLV

Query:  GGDFNAIMFQYEKEGGR
        GGDFN +++ +EK GGR
Subjt:  GGDFNAIMFQYEKEGGR

KAG6636592.1 hypothetical protein CIPAW_11G121200 [Carya illinoinensis]4.6e-2548.36Show/hide
Query:  DGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDW-DFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVGGDFNAIMFQ
        D LK +LGF NCF V+S GRSGGL LLW+ ++G  L SFS  HID  I   D   WRFTG+YGHP    ++ TW+L++++      PWLVGGD N ++  
Subjt:  DGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDW-DFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVGGDFNAIMFQ

Query:  YEKEGGRRSLMLRGEASGKRLT
        +EK GGR   + + EA  + LT
Subjt:  YEKEGGRRSLMLRGEASGKRLT

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]6.4e-2746.97Show/hide
Query:  RETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDW-DFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLV
        +ETK+ +   D LK +LGF NCF V+S GRSGGL LLW+ ++G  L SFS  HID  I   D   WRFTG+YGHP    ++ TW+L++++      PWLV
Subjt:  RETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDW-DFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLV

Query:  GGDFNAIMFQYEKEGGRRSLMLRGEASGKRLT
        GGD N ++  +EK GGR   + + EA  + LT
Subjt:  GGDFNAIMFQYEKEGGRRSLMLRGEASGKRLT

XP_042980077.1 uncharacterized protein LOC122310261 [Carya illinoinensis]1.2e-2546.97Show/hide
Query:  RETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDW-DFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLV
        +ETK+ +   D LK +LGF NCF V+S GRSGGL LLW+ ++   L SFS  HID  I   D   WRFTG+YGHP A  ++ TW+L++++      PWLV
Subjt:  RETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDW-DFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLV

Query:  GGDFNAIMFQYEKEGGRRSLMLRGEASGKRLT
        GGD N ++  +EK GGR   + + EA  + LT
Subjt:  GGDFNAIMFQYEKEGGRRSLMLRGEASGKRLT

XP_042988712.1 uncharacterized protein LOC122316247 [Carya illinoinensis]7.1e-2648.74Show/hide
Query:  RETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWI---DWDFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPW
        +ETK+H ++ + +K  LG+  CF V S GRSGGL L+W  E   ++ S+S NHID  I   + D  QW+FTG+YGHP  EL+  TW+ ++S+RG  S PW
Subjt:  RETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWI---DWDFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPW

Query:  LVGGDFNAIMFQYEKEGGR
        LV GDFN ++   EK GGR
Subjt:  LVGGDFNAIMFQYEKEGGR

TrEMBL top hitse value%identityAlignment
A0A2I4EB36 uncharacterized protein LOC1089879802.7e-2348.6Show/hide
Query:  KVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVGGDFNAIMFQYEKE
        +V+L FANC  V+++G SGGL LLW  +V   ++S+S +HID W++ D KQW  T IYG P+A  +  TW+L++ +    S PWLV GDFN I  Q EK 
Subjt:  KVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVGGDFNAIMFQYEKE

Query:  GGRRSLM
        GGR  L+
Subjt:  GGRRSLM

A0A803PHH5 Uncharacterized protein2.4e-2447.83Show/hide
Query:  ETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQ-WRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVG
        E++++  R + L+V LGF  CF VE+ G+SGGLVLLWS ++  S+LSFS  HID +I  +  Q WRFTG YG P    +  +W+LL  +    + PW++G
Subjt:  ETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQ-WRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVG

Query:  GDFNAIMFQYEKEGG
        GDFN I+ Q EK+GG
Subjt:  GDFNAIMFQYEKEGG

A0A803PRV5 Uncharacterized protein4.2e-2448.7Show/hide
Query:  ETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQ-WRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVG
        E+K ++ R + L+V+LGF  CF VE+ G+SGGL+LLWSM++   +LS+S  HID +I  +  Q WRFTG YG P    +  +W LLK +    + PW+VG
Subjt:  ETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQ-WRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVG

Query:  GDFNAIMFQYEKEGG
        GDFN I+ Q EK GG
Subjt:  GDFNAIMFQYEKEGG

A0A803PTM0 Uncharacterized protein1.3e-2551.3Show/hide
Query:  ETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQ-WRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVG
        E+K+   R + L+V+LGF  CF VE+ G+SGGL+LLWSM+V   +LSFSP HID ++  +  Q WRFTG YG P    +  +W LL+ I    S PW VG
Subjt:  ETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQ-WRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVG

Query:  GDFNAIMFQYEKEGG
        GDFN I+ Q EK GG
Subjt:  GDFNAIMFQYEKEGG

F4NCJ0 Reverse transcriptase domain-containing protein5.5e-2452.63Show/hide
Query:  ETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVGG
        ET ++ +  + LK RLGFAN F V S GR+GGL + W  E+ FSL+SFS +HI G ID   K+WRF GIYG  + E K  TWSL++ +    S P L+GG
Subjt:  ETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQWRFTGIYGHPQAELKSRTWSLLKSIRGYGSTPWLVGG

Query:  DFNAIMFQYEKEGG
        DFN IM   EKEGG
Subjt:  DFNAIMFQYEKEGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAGTATGGGGTGGTGTTGGGAGAAGCCAACTGACGTGGGCTGGGGGGGGAAGTCGACCGATGAAGGAGAGGGAGCAGAGTGGGCAGGAGGAGGGGAAGTCCTCTC
GCAAGCTGGAGTTTCAGGTGGGACGGGGTTAGCTGGGGTTGAGATCACATCCGAGGGGAAGAGTGAGGGGAAAGAGAGCGGGGGGAAGAGTGATGGGAAAGAGAGCGGGG
GAAGAGTGGAGGGATGGAGAATAAGTGTTGGGTTGAGGGAAGGGGTGGAGCGGGCGAGTGGCCAGAAAGGGAGGAGTTGGAAGAGGCGGGCACGGGTCGCTTTAGCTGAC
TTAACCAACGATGAGGGACAGGTGGGTAGGTTGAAGGGTAAGAGAAAGGCTGGGGGTGACTCTGGGGCCCAAGAAGGCAAAGTCGGGGAGTGGCATTGTGCTGATGGAGG
GCAACGAGAAACAAAGGTGCACTCCTCGCGGTTTGATGGTTTGAAGGTGAGGTTGGGCTTTGCCAACTGCTTCTGTGTTGAAAGTAATGGTAGGAGTGGGGGCCTTGTTT
TGCTATGGAGTATGGAGGTGGGTTTCAGTTTACTATCTTTCTCTCCGAACCACATCGATGGATGGATCGACTGGGATTTTAAGCAGTGGAGGTTCACGGGCATTTACGGG
CATCCTCAAGCAGAGCTTAAATCCAGGACATGGTCTTTGTTGAAATCTATCCGTGGCTATGGGTCTACGCCTTGGCTGGTGGGTGGTGATTTTAATGCAATCATGTTCCA
GTATGAGAAGGAAGGAGGTAGGAGAAGCCTGATGCTAAGAGGAGAGGCTTCAGGGAAGCGGCTGACGTGTGTGGTCTGGGTGATCTTGGTTTTATCGAGGATCTGTTTAC
TTGGTGGAATGGCCGATCGGGGAATGAGACGGTCTGGGAGAGAATTGATAGATATTTTGGAAGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCAGTATGGGGTGGTGTTGGGAGAAGCCAACTGACGTGGGCTGGGGGGGGAAGTCGACCGATGAAGGAGAGGGAGCAGAGTGGGCAGGAGGAGGGGAAGTCCTCTC
GCAAGCTGGAGTTTCAGGTGGGACGGGGTTAGCTGGGGTTGAGATCACATCCGAGGGGAAGAGTGAGGGGAAAGAGAGCGGGGGGAAGAGTGATGGGAAAGAGAGCGGGG
GAAGAGTGGAGGGATGGAGAATAAGTGTTGGGTTGAGGGAAGGGGTGGAGCGGGCGAGTGGCCAGAAAGGGAGGAGTTGGAAGAGGCGGGCACGGGTCGCTTTAGCTGAC
TTAACCAACGATGAGGGACAGGTGGGTAGGTTGAAGGGTAAGAGAAAGGCTGGGGGTGACTCTGGGGCCCAAGAAGGCAAAGTCGGGGAGTGGCATTGTGCTGATGGAGG
GCAACGAGAAACAAAGGTGCACTCCTCGCGGTTTGATGGTTTGAAGGTGAGGTTGGGCTTTGCCAACTGCTTCTGTGTTGAAAGTAATGGTAGGAGTGGGGGCCTTGTTT
TGCTATGGAGTATGGAGGTGGGTTTCAGTTTACTATCTTTCTCTCCGAACCACATCGATGGATGGATCGACTGGGATTTTAAGCAGTGGAGGTTCACGGGCATTTACGGG
CATCCTCAAGCAGAGCTTAAATCCAGGACATGGTCTTTGTTGAAATCTATCCGTGGCTATGGGTCTACGCCTTGGCTGGTGGGTGGTGATTTTAATGCAATCATGTTCCA
GTATGAGAAGGAAGGAGGTAGGAGAAGCCTGATGCTAAGAGGAGAGGCTTCAGGGAAGCGGCTGACGTGTGTGGTCTGGGTGATCTTGGTTTTATCGAGGATCTGTTTAC
TTGGTGGAATGGCCGATCGGGGAATGAGACGGTCTGGGAGAGAATTGATAGATATTTTGGAAGCGTAG
Protein sequenceShow/hide protein sequence
MGSMGWCWEKPTDVGWGGKSTDEGEGAEWAGGGEVLSQAGVSGGTGLAGVEITSEGKSEGKESGGKSDGKESGGRVEGWRISVGLREGVERASGQKGRSWKRRARVALAD
LTNDEGQVGRLKGKRKAGGDSGAQEGKVGEWHCADGGQRETKVHSSRFDGLKVRLGFANCFCVESNGRSGGLVLLWSMEVGFSLLSFSPNHIDGWIDWDFKQWRFTGIYG
HPQAELKSRTWSLLKSIRGYGSTPWLVGGDFNAIMFQYEKEGGRRSLMLRGEASGKRLTCVVWVILVLSRICLLGGMADRGMRRSGRELIDILEA