; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041278 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041278
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:14888997..14889740
RNA-Seq ExpressionLag0041278
SyntenyLag0041278
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4274760.1 unnamed protein product [Prunus armeniaca]2.5e-1728.29Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV
        F  + ARAFWF S LQLD   + G  F+ +W+ ++  L+  + A E I +F FGLWR+WKARN   F   + D    + +    V +++ A  +      
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV

Query:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS
        +QD    P+    +  P+ W            DAAW A++ G  +                       +++  E  A +  L ECA N  L +I  +SDS
Subjt:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS

Query:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA
           + ++NG+     D + I  DI+ L    S  +  F  R  N  AH +A
Subjt:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA

CAB4274761.1 unnamed protein product [Prunus armeniaca]2.5e-1728.29Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV
        F  + ARAFWF S LQLD   + G  F+ +W+ ++  L+  + A E I +F FGLWR+WKARN   F   + D    + +    V +++ A  +      
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV

Query:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS
        +QD    P+    +  P+ W            DAAW A++ G  +                       +++  E  A +  L ECA N  L +I  +SDS
Subjt:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS

Query:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA
           + ++NG+     D + I  DI+ L    S  +  F  R  N  AH +A
Subjt:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA

CAB4303756.1 unnamed protein product [Prunus armeniaca]7.3e-1728.29Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV
        F  + ARAFWF S LQLD   + G  F+ +W+ ++  L+  + A E I +F FGLWR+WKARN   F   + D    + +    V +++ A         
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV

Query:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS
         QD    P+    +  P+ W            DAAW A++ G  +                       +++  E  A +  L ECA N  L +I  +SDS
Subjt:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS

Query:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA
           + ++NG+     D + I  DI+ L    S  +  F  R  N  AH +A
Subjt:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA

CAB4321714.1 unnamed protein product [Prunus armeniaca]1.2e-1628.63Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV
        F  + ARAFWF S LQLD   + G  F+ +W+ ++  L+  + A E I +F FGLWR+WKARN   F   + D    + +    V +++ A         
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV

Query:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS
         QD    P+    +  P+ W            DAAW A++ G  +                       ++L  E  A +  L ECA N  L +I  +SDS
Subjt:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS

Query:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAH
           + ++NG+     D + I  DI+ L    S  +  F  R  N  AH
Subjt:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAH

XP_008386432.2 uncharacterized protein LOC103448950 [Malus domestica]1.5e-1726.29Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACE----EKG
        F  + +   WF S L L+S+ + G  FL+SW      +   + AE  +  FAFGLWR+WK RN   F          + +   ++ +Y++A      ++G
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACE----EKG

Query:  RAEVVQDALMAPILADDAEKP-----VIWVDAAWCARK-------------------GGNSLNSLH----SEGKAFKWGLECAWNMGLREIWAKSDSLSL
            ++   M P+L    EKP      +  DA+WC+                     GG+  ++ H    +E  A + GL+     G   ++ ++D+ ++
Subjt:  RAEVVQDALMAPILADDAEKP-----VIWVDAAWCARK-------------------GGNSLNSLH----SEGKAFKWGLECAWNMGLREIWAKSDSLSL

Query:  VNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLAKSV
        + ++N        F+ I  DI+VL  R +     FV RE+N  AH +AK V
Subjt:  VNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLAKSV

TrEMBL top hitse value%identityAlignment
A0A6J5UE59 Reverse transcriptase domain-containing protein1.2e-1728.29Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV
        F  + ARAFWF S LQLD   + G  F+ +W+ ++  L+  + A E I +F FGLWR+WKARN   F   + D    + +    V +++ A  +      
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV

Query:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS
        +QD    P+    +  P+ W            DAAW A++ G  +                       +++  E  A +  L ECA N  L +I  +SDS
Subjt:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS

Query:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA
           + ++NG+     D + I  DI+ L    S  +  F  R  N  AH +A
Subjt:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA

A0A6J5UF50 Uncharacterized protein1.2e-1728.29Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV
        F  + ARAFWF S LQLD   + G  F+ +W+ ++  L+  + A E I +F FGLWR+WKARN   F   + D    + +    V +++ A  +      
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV

Query:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS
        +QD    P+    +  P+ W            DAAW A++ G  +                       +++  E  A +  L ECA N  L +I  +SDS
Subjt:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS

Query:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA
           + ++NG+     D + I  DI+ L    S  +  F  R  N  AH +A
Subjt:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA

A0A6J5VSP2 Uncharacterized protein3.0e-1627.89Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV
        F  + ARAFWF S LQLD   + G  F+ +W+ ++  L+  + A E I +F FGLWR+WKARN   F   + D    + +    V +++ A         
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV

Query:  VQDALMAPILADDAEKPVIWV-----------DAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS
         QD    P+    +  P+ W            DAAW A++ G  +                       ++L  E  A +  L ECA N  L +I  +SDS
Subjt:  VQDALMAPILADDAEKPVIWV-----------DAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS

Query:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA
           + ++NG+     D + I  DI+ L    S  +  F     N   H +A
Subjt:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA

A0A6J5WPU6 Reverse transcriptase domain-containing protein3.5e-1728.29Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV
        F  + ARAFWF S LQLD   + G  F+ +W+ ++  L+  + A E I +F FGLWR+WKARN   F   + D    + +    V +++ A         
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV

Query:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS
         QD    P+    +  P+ W            DAAW A++ G  +                       +++  E  A +  L ECA N  L +I  +SDS
Subjt:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS

Query:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA
           + ++NG+     D + I  DI+ L    S  +  F  R  N  AH +A
Subjt:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLA

A0A6J5YDN0 Uncharacterized protein6.0e-1728.63Show/hide
Query:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV
        F  + ARAFWF S LQLD   + G  F+ +W+ ++  L+  + A E I +F FGLWR+WKARN   F   + D    + +    V +++ A         
Subjt:  FSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEV

Query:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS
         QD    P+    +  P+ W            DAAW A++ G  +                       ++L  E  A +  L ECA N  L +I  +SDS
Subjt:  VQDALMAPILADDAEKPVIW-----------VDAAWCARKGGNSL-----------------------NSLHSEGKAFKWGL-ECAWNMGLREIWAKSDS

Query:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAH
           + ++NG+     D + I  DI+ L    S  +  F  R  N  AH
Subjt:  LSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein2.0e-0937.5Show/hide
Query:  NSLNSLHSEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLAKSV
        +S  SLH+E   F   L+  W  GLR +W +SDS SLV ++N  E   +    +  DI+    +  +C+L FV RE+N  A  LA  V
Subjt:  NSLNSLHSEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLAKSV

AT2G13980.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.7e-0632.26Show/hide
Query:  WCARKGGNSLNSLHSEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLAK
        W + +  N+   L +E KA    L+  W  G   +  + D  +L N+V+G          + DDI++   +FS    SFVRR  N +AH LAK
Subjt:  WCARKGGNSLNSLHSEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLAK

AT3G09510.1 Ribonuclease H-like superfamily protein2.0e-0923.83Show/hide
Query:  LWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGR-------------------AEVVQDALMAPILADDAEKPVIWV-------DAAWCAR
        +WR+WKARN   F   R      +  A+A+  D+  A +   +                   A  V+    A       E    W+         +W + 
Subjt:  LWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGR-------------------AEVVQDALMAPILADDAEKPVIWV-------DAAWCAR

Query:  KGGNSLNSLHSEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPI----RDDIQVLYGRFSFCNLSFVRREQNGIAHCLAK
        K  ++ N L +E KA    L+  W  G  +++ + D  +L+N++NG     + F        +DI     +F+     F+RR+ N +AH LAK
Subjt:  KGGNSLNSLHSEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPI----RDDIQVLYGRFSFCNLSFVRREQNGIAHCLAK

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0427.71Show/hide
Query:  SEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLAKSVMS
        +E  A  W ++CAW++G R +  + D++++  ++  KE  P       + IQ     F+    +F  REQN     LAK  ++
Subjt:  SEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLAKSVMS

AT4G03292.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0733.75Show/hide
Query:  SLNSLHSEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIA
        S +SLH+E   F   L+  W  GLR +W ++D+  LV ++N  E   +   P+  DI+    +  +C++ FV RE+N  A
Subjt:  SLNSLHSEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCTTTGAATGCGGCACGAGCATTCTGGTTCGGGTCGCAGCTGCAGCTTGATTCCATGAGTGTAAGAGGTCGGTCGTTCCTAGAAAGTTGGGAAAAAGTCATATG
CCGGCTTTCAATAGGGGACCAGGCTGAAGAGAGGATAAGCTTCTTCGCCTTCGGGCTATGGAGGCTGTGGAAAGCACGAAACGGAAATACCTTTGCAGATAAACGAGTGG
ATGTTCAGCTGTGTATTCAAATGGCAGAGGCGGATGTCCGGGACTACAAAAAGGCCTGTGAGGAGAAAGGGCGAGCTGAGGTAGTGCAAGACGCATTGATGGCCCCTATT
TTAGCTGATGATGCTGAAAAACCAGTGATATGGGTGGATGCTGCCTGGTGTGCACGGAAAGGGGGTAACAGCCTGAATAGCCTGCACTCAGAAGGTAAGGCCTTCAAATG
GGGCCTGGAATGCGCATGGAATATGGGACTTAGAGAAATTTGGGCTAAGTCTGACTCTCTTAGTCTTGTTAATATTGTAAATGGGAAAGAGTGCTGCCCAGTGGACTTCG
ATCCAATCCGAGATGACATTCAGGTGTTGTATGGGAGGTTTAGCTTTTGTAATCTCTCATTTGTGAGGAGAGAACAGAATGGGATAGCACATTGCTTGGCCAAGTCTGTA
ATGTCTCCCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTCTTTGAATGCGGCACGAGCATTCTGGTTCGGGTCGCAGCTGCAGCTTGATTCCATGAGTGTAAGAGGTCGGTCGTTCCTAGAAAGTTGGGAAAAAGTCATATG
CCGGCTTTCAATAGGGGACCAGGCTGAAGAGAGGATAAGCTTCTTCGCCTTCGGGCTATGGAGGCTGTGGAAAGCACGAAACGGAAATACCTTTGCAGATAAACGAGTGG
ATGTTCAGCTGTGTATTCAAATGGCAGAGGCGGATGTCCGGGACTACAAAAAGGCCTGTGAGGAGAAAGGGCGAGCTGAGGTAGTGCAAGACGCATTGATGGCCCCTATT
TTAGCTGATGATGCTGAAAAACCAGTGATATGGGTGGATGCTGCCTGGTGTGCACGGAAAGGGGGTAACAGCCTGAATAGCCTGCACTCAGAAGGTAAGGCCTTCAAATG
GGGCCTGGAATGCGCATGGAATATGGGACTTAGAGAAATTTGGGCTAAGTCTGACTCTCTTAGTCTTGTTAATATTGTAAATGGGAAAGAGTGCTGCCCAGTGGACTTCG
ATCCAATCCGAGATGACATTCAGGTGTTGTATGGGAGGTTTAGCTTTTGTAATCTCTCATTTGTGAGGAGAGAACAGAATGGGATAGCACATTGCTTGGCCAAGTCTGTA
ATGTCTCCCTTCTGA
Protein sequenceShow/hide protein sequence
MFSLNAARAFWFGSQLQLDSMSVRGRSFLESWEKVICRLSIGDQAEERISFFAFGLWRLWKARNGNTFADKRVDVQLCIQMAEADVRDYKKACEEKGRAEVVQDALMAPI
LADDAEKPVIWVDAAWCARKGGNSLNSLHSEGKAFKWGLECAWNMGLREIWAKSDSLSLVNIVNGKECCPVDFDPIRDDIQVLYGRFSFCNLSFVRREQNGIAHCLAKSV
MSPF