; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018783 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018783
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:34271729..34272312
RNA-Seq ExpressionLag0018783
SyntenyLag0018783
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VFQ69070.1 unnamed protein product [Cuscuta campestris]1.2e-2945.75Show/hide
Query:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEE-VGPW
        MS+L WN RGLG+ +  + L  LV   RP +VFL ETK   K+ + +K +LGF + F++DS GRSGGLAL W  + + +L+SYS NH+D  VD E  GPW
Subjt:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEE-VGPW

Query:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSY--ITMRRRAEG
        RF+G YGHP+   +  +W+LLK L+  S +PW+    +MG +  I + R  +G
Subjt:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSY--ITMRRRAEG

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.5e-3050.38Show/hide
Query:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEEVG-PW
        M I+ WNV+GLG +R FR   KL+Q+ RPQ++FL+ETK+ +K+ +  +  L F NCF VD  G  GGLALLW LD+   + SYS +HID  +  E G  W
Subjt:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEEVG-PW

Query:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWL
        R + +YGHP++E K  TWSLL+ L G S +PWL
Subjt:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWL

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]3.6e-2945.65Show/hide
Query:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVD-EEVGPW
        M  + WN RGLG+    R L  L+ +  P ++FL ETK+ +K  D LK +LGF NCFSVDS GRSGGL LLW  D+   L S+S  HID  +  +++  W
Subjt:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVD-EEVGPW

Query:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATL
        RF+G+YGHP    +  TW+L++ L     +PWL G  L
Subjt:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATL

XP_042980077.1 uncharacterized protein LOC122310261 [Carya illinoinensis]1.4e-2846.38Show/hide
Query:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVD-EEVGPW
        M  + WN  GLG+    R L  L+ +  P ++FL ETK+ +K  D LK +LGF NCFSVDS GRSGGL LLW  D+R  L S+S  HID  +  +++  W
Subjt:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVD-EEVGPW

Query:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATL
        RF+G+YGHP A  +  TW+L++ L     +PWL G  L
Subjt:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATL

XP_042988712.1 uncharacterized protein LOC122316247 [Carya illinoinensis]2.3e-2838.92Show/hide
Query:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVD--EEVGP
        M  + WN RGLG+ R  R L  LV++  P ++FL ETK+H  + + +K  LG+  CF+V S GRSGGLAL+W  +   N+ SYS NHID  +   E  G 
Subjt:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVD--EEVGP

Query:  WRFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSYITMRRRAEGQKVSLNWRSGRKLFE
        W+F+G+YGHP  E +  TW+ ++ LRG+  +PWL             +R    +     R+ R+L +
Subjt:  WRFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSYITMRRRAEGQKVSLNWRSGRKLFE

TrEMBL top hitse value%identityAlignment
A0A2I4EA22 uncharacterized protein LOC1089877221.4e-2640.74Show/hide
Query:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDE---EVG
        M I  WN RGLG+ R  R LC L+Q+  P ++FL ET++ ++  +  K +LGF NC ++ S GR GG+ALLW ++I  ++++YS NH+D  + +     G
Subjt:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDE---EVG

Query:  PWRFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWL
         W  + +YG P+   +  +WSLLK L  + D PWL
Subjt:  PWRFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWL

A0A2N9EH45 Reverse transcriptase domain-containing protein1.4e-2643.21Show/hide
Query:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEEVG-PW
        M+IL WN RGLG+  A R L  LV+   P ++FL ETK+ S++ ++L+++L F  CF+V S GRSGGLALLW    +  + ++S NHID  V  + G  W
Subjt:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEEVG-PW

Query:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSYITMRRRAEGQKVSLNWRSGR
        RF+G YG P+   K  +W+L+  L G S  PWL+    MG Y         + VSL  RSG+
Subjt:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSYITMRRRAEGQKVSLNWRSGR

A0A484KUT5 Uncharacterized protein6.0e-3045.75Show/hide
Query:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEE-VGPW
        MS+L WN RGLG+ +  + L  LV   RP +VFL ETK   K+ + +K +LGF + F++DS GRSGGLAL W  + + +L+SYS NH+D  VD E  GPW
Subjt:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEE-VGPW

Query:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSY--ITMRRRAEG
        RF+G YGHP+   +  +W+LLK L+  S +PW+    +MG +  I + R  +G
Subjt:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSY--ITMRRRAEG

A0A5C7IHU5 Uncharacterized protein3.7e-2742.86Show/hide
Query:  LIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEEVG-PWRFS
        L WNV GLG+ RAF  L +L+++H P +VFL++TK +  + D +K+ LGF NCFSVDS G SGGL LLW   I  ++LS+S  HID  +    G  W FS
Subjt:  LIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEEVG-PWRFS

Query:  GIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSYITMRRRAEGQKVSLN
        G YG P    +  +WSLL+ LR ++ + W+ G       ++M  +A G   S +
Subjt:  GIYGHPQAECKAMTWSLLKHLRGSSDMPWLAGATLMGSYITMRRRAEGQKVSLN

A0A803PTM0 Uncharacterized protein2.1e-2743.7Show/hide
Query:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEEVGP-W
        M++LIWN +GLG+    R L  LV    PQM+F+ E+K+   R + L+++LGF  CF V++RG+SGGL LLW +D+   +LS+SP HID +V  E G  W
Subjt:  MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEEVGP-W

Query:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAG
        RF+G YG P    +  +W LL+ +      PW  G
Subjt:  RFSGIYGHPQAECKAMTWSLLKHLRGSSDMPWLAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATCCTGATTTGGAATGTGCGGGGTTTGGGGTCGACCCGTGCATTTCGACGGTTGTGCAAGTTAGTACAACAACATAGACCCCAAATGGTGTTCCTAACTGAGAC
AAAGGTACACTCGAAGCGTTTTGATCTCCTGAAAATTAGATTGGGTTTTGCAAACTGCTTTTCTGTTGACAGTAGGGGGAGGAGTGGTGGTCTTGCGTTGTTATGGGGCT
TAGACATCAGGTTCAATCTACTTTCTTACTCCCCTAATCATATTGATGGGTGGGTGGATGAAGAAGTGGGCCCATGGCGTTTCTCTGGGATTTATGGGCATCCTCAAGCG
GAGTGTAAAGCTATGACATGGTCTCTGTTGAAACATCTTCGAGGGAGTTCTGATATGCCATGGCTGGCAGGGGCGACTTTAATGGGATCCTATATCACCATGAGAAGGAG
GGCGGAAGGGCAAAAGGTGAGTCTGAACTGGAGGTCGGGGAGGAAATTGTTTGAGAAAGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTATCCTGATTTGGAATGTGCGGGGTTTGGGGTCGACCCGTGCATTTCGACGGTTGTGCAAGTTAGTACAACAACATAGACCCCAAATGGTGTTCCTAACTGAGAC
AAAGGTACACTCGAAGCGTTTTGATCTCCTGAAAATTAGATTGGGTTTTGCAAACTGCTTTTCTGTTGACAGTAGGGGGAGGAGTGGTGGTCTTGCGTTGTTATGGGGCT
TAGACATCAGGTTCAATCTACTTTCTTACTCCCCTAATCATATTGATGGGTGGGTGGATGAAGAAGTGGGCCCATGGCGTTTCTCTGGGATTTATGGGCATCCTCAAGCG
GAGTGTAAAGCTATGACATGGTCTCTGTTGAAACATCTTCGAGGGAGTTCTGATATGCCATGGCTGGCAGGGGCGACTTTAATGGGATCCTATATCACCATGAGAAGGAG
GGCGGAAGGGCAAAAGGTGAGTCTGAACTGGAGGTCGGGGAGGAAATTGTTTGAGAAAGGCTAG
Protein sequenceShow/hide protein sequence
MSILIWNVRGLGSTRAFRRLCKLVQQHRPQMVFLTETKVHSKRFDLLKIRLGFANCFSVDSRGRSGGLALLWGLDIRFNLLSYSPNHIDGWVDEEVGPWRFSGIYGHPQA
ECKAMTWSLLKHLRGSSDMPWLAGATLMGSYITMRRRAEGQKVSLNWRSGRKLFEKG