; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006051 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006051
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:36662992..36665967
RNA-Seq ExpressionLag0006051
SyntenyLag0006051
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015382715.1 uncharacterized protein LOC107175626 [Citrus sinensis]8.2e-2727.75Show/hide
Query:  KVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAI-------
        K+FS  GKEILIK+V QA+P YAM  F L K L   I    AKFWWGS E+R+ IHW RW+K+ Q K  GGL FRD+  FNQ ++AK  W+ +       
Subjt:  KVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAI-------

Query:  ----------------TCIRSKVLGGIRALRVGERESEEQSKAYGFSSHHHVDTAIICPICGKHPKSTDHVLFR-----CSRAKQPIPDTHVRCKWIAEY
                        T + S +    +++  G +  E+  K +            + P   +H       +F      CS +K+   +  +   W   Y
Subjt:  ----------------TCIRSKVLGGIRALRVGERESEEQSKAYGFSSHHHVDTAIICPICGKHPKSTDHVLFR-----CSRAKQPIPDTHVRCKWIAEY

Query:  LREYQVQNGLMLKLM--------------------------TRNI-QPFNIGPTSNMLVIHVDATWDERRKVSDFGVVIRNYEGEVIVALKGNCPFLITS
         R   +     ++ M                          TR + Q     P  N+L ++VDA  + + +    G V+R+  G V+ A      F    
Subjt:  LREYQVQNGLMLKLM--------------------------TRNI-QPFNIGPTSNMLVIHVDATWDERRKVSDFGVVIRNYEGEVIVALKGNCPFLITS

Query:  SCAEARAILERLRLAENMGAQQIKIFSDCLTVVSMIQGTEIRAMEV
        S AEA AI   L++A+   A  + + +DC  V  ++  T+    E+
Subjt:  SCAEARAILERLRLAENMGAQQIKIFSDCLTVVSMIQGTEIRAMEV

XP_022131662.1 uncharacterized protein LOC111004787 [Momordica charantia]6.9e-2656.7Show/hide
Query:  GLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAI
        G +   FS  GKE+LIKSV QAIP YAM  F LPK    ++S + A+FWWGSS +  ++HWM WE++C PKELGGLNFRD+EGFNQ ++AK  W+ +
Subjt:  GLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAI

XP_030495108.1 uncharacterized protein LOC115710895 [Cannabis sativa]5.3e-2653.4Show/hide
Query:  VFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAIT---CIRS
        +FS  GKEILIK+V+QA+P Y M CF + K ++ +I SL A+FWWGS++ + ++HW  W+KLC+ KE GG+ FRD+E FNQ +LAK  WK IT   C+ S
Subjt:  VFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAIT---CIRS

Query:  KVL
        K+L
Subjt:  KVL

XP_030495126.1 uncharacterized protein LOC115710915 [Cannabis sativa]5.3e-2645.38Show/hide
Query:  GLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAITCI
        G +  +FS  G+EIL+K+++QAIPTY M CF LPK L+  I ++ A+FWWGSS+ + +IHW  W KLC+PKE GG+ F+++E FNQ +LAK  WK I   
Subjt:  GLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAITCI

Query:  RSKVLGGIRALRVGERESEEQSKAYGFSSH
         S +   ++A         E +K  GF S+
Subjt:  RSKVLGGIRALRVGERESEEQSKAYGFSSH

XP_041003981.1 uncharacterized protein LOC121249338 [Juglans microcarpa x Juglans regia]1.4e-2630.19Show/hide
Query:  TGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAITC
        T  +  + S  GKE L+KSV+Q+IPTY+MG F +PK+++ K++ L   FWWG S  R +IHW+  +KL + K+ GGL FRD E FN  +LAK  WK I  
Subjt:  TGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAITC

Query:  IRSKVLGGIRALRVGERESEEQSKAYGFSSHHHVDTAIICPICGKHPKSTDH----VLFRCSRAKQPIPDTHVRCKWIAE--YLREYQ------------
          S      +  R  + E  EQ K             I C  C    +  D      +F      + +  +  R  W+ E  +   YQ            
Subjt:  IRSKVLGGIRALRVGERESEEQSKAYGFSSHHHVDTAIICPICGKHPKSTDH----VLFRCSRAKQPIPDTHVRCKWIAE--YLREYQ------------

Query:  ---VQNGLMLKLMTRNIQPFNIGPTSNMLVIHVDATWDERRKVSDFGVVIRNYEGEVIVALKGNCPFLITSSCAEARAILERLRLAENMGAQQIKIFSDC
           +QNG   K+         I P  +M   + DA  D+       GV++RN EG V+ +L  +    +     EA A         ++G Q + +  D 
Subjt:  ---VQNGLMLKLMTRNIQPFNIGPTSNMLVIHVDATWDERRKVSDFGVVIRNYEGEVIVALKGNCPFLITSSCAEARAILERLRLAENMGAQQIKIFSDC

Query:  LTVVSMIQ
        L VV  IQ
Subjt:  LTVVSMIQ

TrEMBL top hitse value%identityAlignment
A0A803NUL5 Uncharacterized protein3.3e-2931Show/hide
Query:  GLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAI---
        G +  +FS  GKEILIK+V+QA+P Y M CF + K ++  I SL A+FWWGS+    +IHW +W KLC+ KE GG+ FRD+E FNQ +LAK  WK +   
Subjt:  GLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAI---

Query:  TCIRSKVLGGIRALRVGERESEEQSKAYGFSSHHHV--DTAIICPICGKHP------------KSTDHVLFRCSRAKQPIPDTHVRCKWIAEYLREYQVQ
         C+ ++VL  +      E   E +   +G S    +     +I    GK              +  ++ +F     +Q  P  H+   W  +++  YQ+ 
Subjt:  TCIRSKVLGGIRALRVGERESEEQSKAYGFSSHHHV--DTAIICPICGKHP------------KSTDHVLFRCSRAKQPIPDTHVRCKWIAEYLREYQVQ

Query:  NGLMLKLMTRNIQPFN-IGPTSNMLVIHVDATWDERRKVSDFGVVIRNYEGEVIVALKGNCPFLITSSCAEARAILERLRLAENMGAQQIKIFSDCLTVV
            L L+  N    +   P S+  +I+ DA+            +IRN  G+++VA        +T   AEA AI   ++LA      +  + SDC+ ++
Subjt:  NGLMLKLMTRNIQPFN-IGPTSNMLVIHVDATWDERRKVSDFGVVIRNYEGEVIVALKGNCPFLITSSCAEARAILERLRLAENMGAQQIKIFSDCLTVV

A0A803P8L6 Uncharacterized protein1.4e-2727.4Show/hide
Query:  VFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAIT---CIRS
        +FS  GKEILIK++VQA+P YAM CF + K ++ +I S+ A FWWG+S ++ +IHW  WEK+C+ KE GG+ FR++E FNQ +LAK  WK +T   C+ +
Subjt:  VFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAIT---CIRS

Query:  KVL-----------------------------------------GGIRALRVGERESEEQSKAYGFSSHHHVDTAIICPICGKHPKSTDHVLFRCSRAK-
        KVL                                         G    +R+ E     +   +   S  +V   +       H  S     ++C     
Subjt:  KVL-----------------------------------------GGIRALRVGERESEEQSKAYGFSSHHHVDTAIICPICGKHPKSTDHVLFRCSRAK-

Query:  -------------QPIPDTHVRCKWIAEYLREYQVQNGLMLKLMTRNIQPFNIGPTSNM--------------LVIHVDATWDERRKVSDFGVVIRNYEG
                      P  +      W      +Y V +G  L+     +Q  +  P  +M               +I+ DA+    +     G+VIR++ G
Subjt:  -------------QPIPDTHVRCKWIAEYLREYQVQNGLMLKLMTRNIQPFNIGPTSNM--------------LVIHVDATWDERRKVSDFGVVIRNYEG

Query:  EVIVALKGNCPFLITSSCAEARAILERLRLAENMGAQQIKIFSDCLTVVSMIQG
         V+VA     P  ++ + AE+ AI   L+LA+      I I SDC +V   + G
Subjt:  EVIVALKGNCPFLITSSCAEARAILERLRLAENMGAQQIKIFSDCLTVVSMIQG

A0A803PAN7 Uncharacterized protein2.8e-2854.46Show/hide
Query:  EETTGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKA
        ++  G +  +FS  G+E+L+K+V+QAIPTY M CF LPK L+  I ++ A+FWWGSSE + +IHW RWEKLC+PK+ GG+ F+D+E FNQ +LAK  WK 
Subjt:  EETTGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKA

Query:  I
        I
Subjt:  I

A0A803QSM5 Uncharacterized protein8.0e-2854.46Show/hide
Query:  EETTGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKA
        ++  G +G +FS  G+E+L+K+V+QAIPTY M CF LPK L+  I ++ A+FWWGSSE + +IHW RWEKLC+PK+  G+ F+D+E FNQ +LAK  WK 
Subjt:  EETTGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKA

Query:  I
        I
Subjt:  I

A0A803QSN3 Uncharacterized protein3.6e-2854.46Show/hide
Query:  EETTGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKA
        ++  G +  +FS  G+E+L+K+V+QAIPTY M CF LPK L+  I ++ A+FWWGSSE + +IHW RWEKLC+PK+ GG+ F+D+E FNQ +LAK  WK 
Subjt:  EETTGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKA

Query:  I
        I
Subjt:  I

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.1e-1331.63Show/hide
Query:  TGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAI
        +G   K  S  G+  L K+V+ ++P ++M    LP+S++ ++  L   F WGS+ E+ + H ++W K+C PK+ GGL  R  +  N+ +++K  W+ +
Subjt:  TGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAI

P93295 Uncharacterized mitochondrial protein AtMg003106.8e-1649.35Show/hide
Query:  AIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKE-LGGLNFRDIEGFNQVMLAKTNWKAI
        A+P YAM CF L K L  K++S   +FWW S E + +I W+ W+KLC+ KE  GGL FRD+  FNQ +LAK +++ I
Subjt:  AIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKE-LGGLNFRDIEGFNQVMLAKTNWKAI

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.5e-1542.86Show/hide
Query:  AIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAIT
        A+PTY M CF LPK++  +I S+ A FWW + +E   +HW  W+ L   K  GG+ F+DIE FN  +L K  W+ ++
Subjt:  AIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAIT

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.8e-1749.35Show/hide
Query:  AIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKE-LGGLNFRDIEGFNQVMLAKTNWKAI
        A+P YAM CF L K L  K++S   +FWW S E + +I W+ W+KLC+ KE  GGL FRD+  FNQ +LAK +++ I
Subjt:  AIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKE-LGGLNFRDIEGFNQVMLAKTNWKAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAGAAGATCGAGTTTGGAAGAAACTACAGGGTTGGAAGGAAAAGTTTTCTCTGGATGTGGAAAGGAGATTCTTATCAAAAGTGTTGTCCAAGCCATCCCAACATA
TGCTATGGGATGTTTCCATTTGCCGAAGAGCCTTGTCTTAAAAATATCTTCCCTTTGTGCAAAATTTTGGTGGGGGAGTAGTGAGGAACGTTGGAGAATTCATTGGATGC
GGTGGGAGAAGCTTTGCCAACCAAAAGAGTTAGGTGGCCTAAACTTCCGTGATATTGAGGGGTTCAATCAGGTCATGTTGGCAAAAACAAACTGGAAGGCAATCACCTGC
ATTCGTTCAAAAGTCCTCGGAGGAATCAGAGCCTTGAGGGTCGGGGAAAGAGAAAGCGAAGAACAAAGCAAAGCATATGGATTCAGTTCGCATCACCATGTGGACACCGC
TATCATTTGTCCAATCTGTGGTAAACATCCTAAATCAACTGATCATGTCTTATTCAGATGCTCCCGAGCCAAGCAGCCAATTCCAGATACTCATGTGAGATGCAAATGGA
TTGCAGAATACTTACGGGAGTACCAAGTTCAGAATGGGTTAATGTTGAAACTGATGACTCGCAATATACAACCTTTCAATATTGGTCCTACATCTAATATGTTGGTAATT
CACGTGGATGCAACTTGGGATGAAAGACGGAAGGTTTCGGATTTTGGAGTGGTAATTCGAAATTATGAAGGGGAGGTAATTGTTGCTCTGAAAGGAAATTGCCCTTTTTT
GATTACTTCAAGTTGTGCAGAGGCGAGAGCAATATTAGAGAGACTTCGACTTGCAGAAAATATGGGAGCTCAACAGATTAAGATTTTTTCTGACTGCCTAACTGTAGTGT
CAATGATACAAGGGACGGAGATAAGAGCAATGGAAGTGTCTTCTTTGTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTAGAAGATCGAGTTTGGAAGAAACTACAGGGTTGGAAGGAAAAGTTTTCTCTGGATGTGGAAAGGAGATTCTTATCAAAAGTGTTGTCCAAGCCATCCCAACATA
TGCTATGGGATGTTTCCATTTGCCGAAGAGCCTTGTCTTAAAAATATCTTCCCTTTGTGCAAAATTTTGGTGGGGGAGTAGTGAGGAACGTTGGAGAATTCATTGGATGC
GGTGGGAGAAGCTTTGCCAACCAAAAGAGTTAGGTGGCCTAAACTTCCGTGATATTGAGGGGTTCAATCAGGTCATGTTGGCAAAAACAAACTGGAAGGCAATCACCTGC
ATTCGTTCAAAAGTCCTCGGAGGAATCAGAGCCTTGAGGGTCGGGGAAAGAGAAAGCGAAGAACAAAGCAAAGCATATGGATTCAGTTCGCATCACCATGTGGACACCGC
TATCATTTGTCCAATCTGTGGTAAACATCCTAAATCAACTGATCATGTCTTATTCAGATGCTCCCGAGCCAAGCAGCCAATTCCAGATACTCATGTGAGATGCAAATGGA
TTGCAGAATACTTACGGGAGTACCAAGTTCAGAATGGGTTAATGTTGAAACTGATGACTCGCAATATACAACCTTTCAATATTGGTCCTACATCTAATATGTTGGTAATT
CACGTGGATGCAACTTGGGATGAAAGACGGAAGGTTTCGGATTTTGGAGTGGTAATTCGAAATTATGAAGGGGAGGTAATTGTTGCTCTGAAAGGAAATTGCCCTTTTTT
GATTACTTCAAGTTGTGCAGAGGCGAGAGCAATATTAGAGAGACTTCGACTTGCAGAAAATATGGGAGCTCAACAGATTAAGATTTTTTCTGACTGCCTAACTGTAGTGT
CAATGATACAAGGGACGGAGATAAGAGCAATGGAAGTGTCTTCTTTGTTGTGA
Protein sequenceShow/hide protein sequence
MFRRSSLEETTGLEGKVFSGCGKEILIKSVVQAIPTYAMGCFHLPKSLVLKISSLCAKFWWGSSEERWRIHWMRWEKLCQPKELGGLNFRDIEGFNQVMLAKTNWKAITC
IRSKVLGGIRALRVGERESEEQSKAYGFSSHHHVDTAIICPICGKHPKSTDHVLFRCSRAKQPIPDTHVRCKWIAEYLREYQVQNGLMLKLMTRNIQPFNIGPTSNMLVI
HVDATWDERRKVSDFGVVIRNYEGEVIVALKGNCPFLITSSCAEARAILERLRLAENMGAQQIKIFSDCLTVVSMIQGTEIRAMEVSSLL