; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008526 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008526
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:24522221..24522778
RNA-Seq ExpressionLag0008526
SyntenyLag0008526
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7567709.1 Reverse transcriptase domain [Arabidopsis thaliana x Arabidopsis arenosa]5.6e-4752.05Show/hide
Query:  MAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRYKVS
        +A+K D+ KAYD+VEW FLERVM  MGFD++W+ WIMECV  VSY + ++  PYG I+  RGLRQGDPLSPYLF+ C+EVL+ +L+  +   Q+   K+ 
Subjt:  MAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRYKVS

Query:  RNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGI
        R  P IS++ FADDSL FC AT+  C  +  +  RY E+SGQ++N+ KS+VIF    P   R RL  +LGI
Subjt:  RNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGI

KAG7586949.1 Reverse transcriptase domain [Arabidopsis thaliana x Arabidopsis arenosa]9.5e-4748.37Show/hide
Query:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI
        +  +++ +   +A+K D+ KAYD+VEW FLER+M  MGFD +W+ WIMECV  VSY + ++  PYG I+  RGLRQGDPLSPYLF+ C+EVL+ +L++ +
Subjt:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI

Query:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIRH
           Q+   K+ R  P IS++ FADDSL FC AT+  C  +  +  RY E+SGQ++N+ KS+VIF    P   R RL  +LGI H
Subjt:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIRH

KAG7595246.1 Reverse transcriptase domain [Arabidopsis thaliana x Arabidopsis arenosa]1.4e-4548.09Show/hide
Query:  RRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGC
        R +K +   MA+KLD+ KA+DKVEW ++E V++ +GF  +W  W+M+C+  VSYS+ ++  P   I   RGLRQGDPLSPYL++LC+E L+ LL+  +  
Subjt:  RRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGC

Query:  NQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIRHT
        NQ+  +K SR+GP+IS++FFADDSL+FC A + EC KLL +L  Y + SGQ +NF KS +IF +  P+EV+  +  + GI  T
Subjt:  NQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIRHT

XP_023920446.1 uncharacterized protein LOC112031976 [Quercus suber]2.5e-4751.65Show/hide
Query:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI
        M R+R G+   MA+KLDM KAYD++EW+FL+R+M  MGF ++WIGWIMECV  V YS+ V+ +P G I   RG+RQGDPLSPYLF+LCSE L  L+   +
Subjt:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI

Query:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGI
         C  +    + RNGP IS+IFFADDSL+FC A   +  K+   L +Y   SGQ +N DK+ + FS+NT    +E L  LLG+
Subjt:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGI

XP_027118380.1 uncharacterized protein LOC113735584 [Coffea arabica]1.2e-4649.72Show/hide
Query:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI
        +N +R+G+   MAVKLDM KAYD+VEW FLE +M+ MGF  RWI WIMECV  VSYS  ++ +    +   RG+RQGDPLSPYLF+LCSE  + L+ +  
Subjt:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI

Query:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLG
           ++S  K+SR GP+I+++FFADDSLIFC A K++  +L+ VL  Y + SGQL+N DKS+++FS+N    ++  +  ++G
Subjt:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLG

TrEMBL top hitse value%identityAlignment
A0A2N9EV43 Reverse transcriptase domain-containing protein2.7e-4751.37Show/hide
Query:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI
        +  RRKG+   MA+KLDM KAYD+VEW FLERVM+ MGF++RWI  +M CV   SYS+ ++ +P G I+  RG+RQGDPLSPYLF+ C+E LT LL +  
Subjt:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI

Query:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIR
           +++   + R GP IS++ FADDSL+FC A+  EC +LL VL  Y + SGQ+VN DK+ + FS+NTP  +R+ +  L G+R
Subjt:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIR

A0A2N9FDP4 Reverse transcriptase domain-containing protein2.1e-4752.43Show/hide
Query:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI
        +  RRKG+   MA+KLDM KAYD+VEW+FLERVM  MGF +RWI  +M CV   SYS+ ++ +P G I+  RG+RQGDPLSPYLF+LC+E L+ LL    
Subjt:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI

Query:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIRHT
           Q+    + RNGP IS++ FADDSL+FC AT+EEC +L+ VLA+Y   S Q+VN +K+ + FS+NTP  VR  +  L G++ T
Subjt:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIRHT

A0A2N9HN71 Reverse transcriptase domain-containing protein2.7e-4751.37Show/hide
Query:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI
        +  RRKG+   MA+KLDM KAYD+VEW FLERVM+ MGF++RWI  +M CV   SYS+ ++ +P G I+  RG+RQGDPLSPYLF+ C+E LT LL +  
Subjt:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI

Query:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIR
           +++   + R GP IS++ FADDSL+FC A+  EC +LL VL  Y + SGQ+VN DK+ + FS+NTP  +R+ +  L G+R
Subjt:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIR

A0A2N9HPA9 Reverse transcriptase domain-containing protein2.1e-4751.35Show/hide
Query:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI
        +  RRKG+   MA+KLDM KAYD+VEW FLERVM+ MGF  RWI  +M CV   SYS+ V+ +P G I+  RG+RQGDPLSPYLF+LC+E L+ +L +V 
Subjt:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI

Query:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIRHT
           Q+S   + R  P IS++ FADDS++FC A+ EEC +LL +LARY + SGQ+VN  K+ + FS+NTP  V+  +  + G++ T
Subjt:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIRHT

A0A6P6WSS6 uncharacterized protein LOC1137355846.0e-4749.72Show/hide
Query:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI
        +N +R+G+   MAVKLDM KAYD+VEW FLE +M+ MGF  RWI WIMECV  VSYS  ++ +    +   RG+RQGDPLSPYLF+LCSE  + L+ +  
Subjt:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVI

Query:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLG
           ++S  K+SR GP+I+++FFADDSLIFC A K++  +L+ VL  Y + SGQL+N DKS+++FS+N    ++  +  ++G
Subjt:  GCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLG

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein4.6e-1227.44Show/hide
Query:  KYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRY
        K  M + LD  KA+DK++  F+ +V+   G    ++  I    +    +++V+ +    I  + G RQG PLSPYLF +  EVL   + +     ++   
Subjt:  KYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRY

Query:  KVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNV-IFSRNTPTE
        ++ +    IS    ADD +++ S  K    +LL ++  +GE+ G  +N +KS   ++++N   E
Subjt:  KVSRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNV-IFSRNTPTE

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM6.1e-0424.36Show/hide
Query:  LDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRYKVSRNGP
        LD+ KA+D +    +   +R  G    ++ ++         SL             RG++QGDPLSP LF L  + L   L   IG         ++ G 
Subjt:  LDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRYKVSRNGP

Query:  TISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTE
         I+N     D L+  + T+     LL     +  + G  +N DK   +  +  P +
Subjt:  TISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTE

P92555 Uncharacterized mitochondrial protein AtMg012502.7e-1247.76Show/hide
Query:  VSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRYKVSRNGPTISNIFFADDS
        ++  P G++   RGLRQGDPLSPYLF+LC+EVL+ L        +L   +VS N P I+++ FADD+
Subjt:  VSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRYKVSRNGPTISNIFFADDS

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)2.5e-0529.41Show/hide
Query:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVS-NKPYGMIRSERGLRQGDPLSPYLF------VLCSEVLT
        ++ RR+ RK    V LD+ KA+D V    + R ++ +G D     +I   ++  + ++RV        I   RG++QGDPLSP+LF      +LCS   T
Subjt:  MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVS-NKPYGMIRSERGLRQGDPLSPYLF------VLCSEVLT

Query:  FLLNEVIGCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKL---LVVLARYGELSGQLVNFDKS
          +   IG  ++         P ++   FADD L+     ++  V L   L  +A +  L G  +N  KS
Subjt:  FLLNEVIGCNQLSRYKVSRNGPTISNIFFADDSLIFCSATKEECVKL---LVVLARYGELSGQLVNFDKS

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)2.8e-0934.65Show/hide
Query:  RRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLF-VLCSEVLTFLLNEVIG
        RR KG+ Y++ V LD+ KA+D V    + R MR  G D     +IM  +T    ++ V  +    I    G++QGDPLSP LF ++  E++T L +E  G
Subjt:  RRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLF-VLCSEVLTFLLNEVIG

Query:  CNQLSRYKVSRNGPTISNIFFADDSLI
         +     K       I+++ FADD L+
Subjt:  CNQLSRYKVSRNGPTISNIFFADDSLI

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.9e-1347.76Show/hide
Query:  VSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRYKVSRNGPTISNIFFADDS
        ++  P G++   RGLRQGDPLSPYLF+LC+EVL+ L        +L   +VS N P I+++ FADD+
Subjt:  VSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRYKVSRNGPTISNIFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAGGAGAAAAGGGAGGAAGTATTCCATGGCGGTCAAGCTTGATATGGGAAAAGCTTATGACAAAGTTGAATGGTTGTTCTTGGAGAGAGTGATGAGGGTAAT
GGGCTTTGATGCGAGATGGATCGGGTGGATAATGGAGTGTGTGACTTTAGTATCGTATAGTCTGAGGGTGAGTAATAAGCCATATGGGATGATTAGGTCGGAGAGAGGGT
TGAGGCAGGGAGATCCCTTGTCACCATATCTGTTCGTTTTGTGTTCGGAGGTGCTTACTTTCTTGCTGAATGAGGTGATTGGCTGTAACCAGCTCTCGAGATATAAGGTG
TCGAGGAATGGGCCTACCATTTCAAATATTTTCTTTGCTGATGATTCTTTGATATTTTGTAGTGCAACCAAAGAGGAGTGTGTTAAGCTACTTGTTGTGTTGGCTAGATA
TGGTGAGCTGTCTGGACAGTTAGTGAATTTTGATAAGAGTAATGTAATATTTAGTAGAAATACACCAACGGAAGTGAGAGAACGATTGGCGGGGCTGTTGGGAATTCGTC
ATACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGAGGAGAAAAGGGAGGAAGTATTCCATGGCGGTCAAGCTTGATATGGGAAAAGCTTATGACAAAGTTGAATGGTTGTTCTTGGAGAGAGTGATGAGGGTAAT
GGGCTTTGATGCGAGATGGATCGGGTGGATAATGGAGTGTGTGACTTTAGTATCGTATAGTCTGAGGGTGAGTAATAAGCCATATGGGATGATTAGGTCGGAGAGAGGGT
TGAGGCAGGGAGATCCCTTGTCACCATATCTGTTCGTTTTGTGTTCGGAGGTGCTTACTTTCTTGCTGAATGAGGTGATTGGCTGTAACCAGCTCTCGAGATATAAGGTG
TCGAGGAATGGGCCTACCATTTCAAATATTTTCTTTGCTGATGATTCTTTGATATTTTGTAGTGCAACCAAAGAGGAGTGTGTTAAGCTACTTGTTGTGTTGGCTAGATA
TGGTGAGCTGTCTGGACAGTTAGTGAATTTTGATAAGAGTAATGTAATATTTAGTAGAAATACACCAACGGAAGTGAGAGAACGATTGGCGGGGCTGTTGGGAATTCGTC
ATACTTAG
Protein sequenceShow/hide protein sequence
MNRRRKGRKYSMAVKLDMGKAYDKVEWLFLERVMRVMGFDARWIGWIMECVTLVSYSLRVSNKPYGMIRSERGLRQGDPLSPYLFVLCSEVLTFLLNEVIGCNQLSRYKV
SRNGPTISNIFFADDSLIFCSATKEECVKLLVVLARYGELSGQLVNFDKSNVIFSRNTPTEVRERLAGLLGIRHT