; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039507 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039507
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:45312854..45313411
RNA-Seq ExpressionLag0039507
SyntenyLag0039507
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]1.0e-5360.92Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT
        GK G V +KLDM+KAYD VEW ++R  M KM F+N W+DL+M CVESV F VL+NG P  EFTP  GLRQ DPL PYLF++CAEGL  L+N  + K  +T
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT

Query:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKA-SGQTINFDKSNFMVSSNTSEAMVMSIKED
        RL+IN+ CP I+HLFYADD LLFFKAS  +CR+I+ I+ +YEKA SGQ IN DKS F+VS NT E MV  I+++
Subjt:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKA-SGQTINFDKSNFMVSSNTSEAMVMSIKED

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]4.9e-4352.44Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT
        GK+G + +KLDM+KA+D VEW FI K M +M F N W DLVM C+ SV++ +L+NG       P  GLRQ DPL P LFL+CAEGL  L+NQA   + +T
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT

Query:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSE
         + IN+ CP +THLF+ADDS+LF KA+  +C  +R I+  YE+ASGQ IN DKS+   S NT++
Subjt:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSE

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]6.4e-4351.83Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT
        GK+G   +KLDM+KAYD VEW FI++ M KM F   WI LVM C+ SV++ +L+NG      TP  GLRQ DP+ PY+FL+CA+G   LLN    K  ++
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT

Query:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSE
         + I + CP ITHLF+ADDSLLF KA++ +C+T+ DI+  YE ASGQ IN DKS+   S+NT +
Subjt:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSE

XP_023902041.1 uncharacterized protein LOC112013897 [Quercus suber]3.7e-4351.83Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT
        GK+G   +KLDM+KAYD VEW FI++ M KM F   WI LVM C+ SV++ +L+NG      TP  GLRQ DP+ PY+FL+CA+G   LLN A  K  ++
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT

Query:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSE
         + I + CP ITHLF+ADD+LLF KA++ +C+T+ DI+  YE ASGQ IN DKS+   S+NT +
Subjt:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSE

XP_030939568.1 uncharacterized protein LOC115964386 [Quercus lobata]4.9e-4351.46Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT
        GK G + LKLDM+KAYD VEW+F+ K M K+ F + WI L+  C+ +V+F VL+NG+P   FTP  GLRQ DPL PYLFL+CAEGL  L+ QA+    L 
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT

Query:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIK
         + + +  P ++HLF+ADDSLLF +A+  D  TI +I+H YE+ASGQ IN +K+    S NT  AM   IK
Subjt:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIK

TrEMBL top hitse value%identityAlignment
A0A2N9ENX3 Reverse transcriptase domain-containing protein3.6e-4452.91Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT
        GK G + LKLDM+KA D VEW F+ K M KM F N W+ LVM CV SV++ VL+NG+P   F P  GLRQ DP+ PYLFL+CAEGL  LL +A   R++ 
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT

Query:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIKE
         L I++  P +THLF+ADDS+LF +A+  DC TI +I+  YE+ASGQ IN DK+    S++TS A+   I+E
Subjt:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIKE

A0A2N9FBK3 Reverse transcriptase domain-containing protein3.6e-4452.91Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT
        GK G + LKLDM+KA D VEW F+ K M KM F N W+ LVM CV SV++ VL+NG+P   F P  GLRQ DP+ PYLFL+CAEGL  LL +A   R++ 
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT

Query:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIKE
         L I++  P +THLF+ADDS+LF +A+  DC TI +I+  YE+ASGQ IN DK+    S++TS A+   I+E
Subjt:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIKE

A0A2N9FT83 Reverse transcriptase domain-containing protein5.1e-4651.46Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT
        GK+G + LKLDM+KAYD VEW F+R+ M KM F + W+ L+M C+ +V++ +L+NG+P    TP  GLRQ DP+ PYLFLICAEGL GLLN+A ++ E+ 
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT

Query:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIK
         + I++  P +THLF+ADDSLLF +A+  +C+ I+DI+H YEKASGQ +N  K+    S NT++     IK
Subjt:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIK

A0A2N9GM07 Reverse transcriptase domain-containing protein1.1e-4351.16Show/hide
Query:  NGKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKREL
        +GKD  + LKLDM+KAYD VEWVFI + M K+ F   WI L+M C+ +V + V +NG  C    P  GLRQ DPL PYLFL+CAEG   LL +A++   +
Subjt:  NGKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKREL

Query:  TRLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIK
          + + +  P +THLF+ADDS+LF KA+NTDC+ + +I  TYEKASGQ IN DKS+   S NTS      IK
Subjt:  TRLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIK

A0A6J1DUG8 uncharacterized protein LOC1110241355.1e-5460.92Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT
        GK G V +KLDM+KAYD VEW ++R  M KM F+N W+DL+M CVESV F VL+NG P  EFTP  GLRQ DPL PYLF++CAEGL  L+N  + K  +T
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELT

Query:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKA-SGQTINFDKSNFMVSSNTSEAMVMSIKED
        RL+IN+ CP I+HLFYADD LLFFKAS  +CR+I+ I+ +YEKA SGQ IN DKS F+VS NT E MV  I+++
Subjt:  RLRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKA-SGQTINFDKSNFMVSSNTSEAMVMSIKED

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012501.0e-1145.59Show/hide
Query:  LLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELTRLRINKYCPSITHLFYADDS
        ++NG P    TP  GLRQ DPL PYLF++C E L GL  +A  +  L  +R++   P I HL +ADD+
Subjt:  LLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELTRLRINKYCPSITHLFYADDS

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)4.5e-0725.85Show/hide
Query:  VLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELTRLRINK
        V+ LD+ KA+D V    I + M      +   D +M  +      +++ G+   +   + G++Q DPL P LF I  + L+  LN       +T      
Subjt:  VLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELTRLRINK

Query:  YCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDK
            I  L +ADD LL  +  + D          Y +  G T+N +K
Subjt:  YCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDK

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.1e-0430.77Show/hide
Query:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGC---VESVAFQV--LLNGKPCREFTPKWGLRQDDPLFPYL--FLICAE
        G  G ++LKLD+ KAYD + W ++  T+    F   W+  +         VA +V      K  R    +WG R DD   P+    + CAE
Subjt:  GKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGC---VESVAFQV--LLNGKPCREFTPKWGLRQDDPLFPYL--FLICAE

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)7.3e-1345.59Show/hide
Query:  LLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELTRLRINKYCPSITHLFYADDS
        ++NG P    TP  GLRQ DPL PYLF++C E L GL  +A  +  L  +R++   P I HL +ADD+
Subjt:  LLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELTRLRINKYCPSITHLFYADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGGTAACTCTACATCATTGAATGGCAAGGACGGGGTTGTGGTGCTAAAACTTGATATGACCAAGGCATACGATTGTGTAGAATGGGTTTTCATTCGAAAGACCAT
GGGTAAAATGGATTTTAGCAACAGCTGGATTGATCTTGTTATGGGTTGTGTGGAGTCGGTAGCTTTCCAGGTTTTACTGAATGGTAAACCGTGTCGAGAGTTCACCCCCA
AGTGGGGACTTCGACAGGACGATCCTCTGTTTCCATACTTATTCCTCATTTGTGCAGAAGGTTTGTTAGGGCTTTTGAATCAAGCTGATAACAAAAGGGAGTTGACAAGG
TTAAGAATTAATAAATATTGTCCATCTATCACTCATCTTTTTTATGCTGATGATAGCTTGTTGTTCTTTAAAGCTTCTAACACAGATTGCAGGACTATTCGAGATATCAT
ACACACTTATGAGAAAGCTTCAGGTCAAACAATCAATTTTGATAAGTCTAACTTCATGGTTAGTTCTAATACAAGTGAAGCTATGGTGATGTCTATCAAGGAGGATCGGA
GTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTGGTAACTCTACATCATTGAATGGCAAGGACGGGGTTGTGGTGCTAAAACTTGATATGACCAAGGCATACGATTGTGTAGAATGGGTTTTCATTCGAAAGACCAT
GGGTAAAATGGATTTTAGCAACAGCTGGATTGATCTTGTTATGGGTTGTGTGGAGTCGGTAGCTTTCCAGGTTTTACTGAATGGTAAACCGTGTCGAGAGTTCACCCCCA
AGTGGGGACTTCGACAGGACGATCCTCTGTTTCCATACTTATTCCTCATTTGTGCAGAAGGTTTGTTAGGGCTTTTGAATCAAGCTGATAACAAAAGGGAGTTGACAAGG
TTAAGAATTAATAAATATTGTCCATCTATCACTCATCTTTTTTATGCTGATGATAGCTTGTTGTTCTTTAAAGCTTCTAACACAGATTGCAGGACTATTCGAGATATCAT
ACACACTTATGAGAAAGCTTCAGGTCAAACAATCAATTTTGATAAGTCTAACTTCATGGTTAGTTCTAATACAAGTGAAGCTATGGTGATGTCTATCAAGGAGGATCGGA
GTTTCTAG
Protein sequenceShow/hide protein sequence
MVGNSTSLNGKDGVVVLKLDMTKAYDCVEWVFIRKTMGKMDFSNSWIDLVMGCVESVAFQVLLNGKPCREFTPKWGLRQDDPLFPYLFLICAEGLLGLLNQADNKRELTR
LRINKYCPSITHLFYADDSLLFFKASNTDCRTIRDIIHTYEKASGQTINFDKSNFMVSSNTSEAMVMSIKEDRSF