; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014073 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014073
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold3:37938393..37941282
RNA-Seq ExpressionSpg014073
SyntenySpg014073
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6693248.1 hypothetical protein I3842_10G159900, partial [Carya illinoinensis]9.3e-1425.37Show/hide
Query:  LEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYCSEFL
        +E A H  + C  VR +W+ +CP   EI   D S W   +        DRG     D +   LVV W++WN RN+ ++     +I            E+ 
Subjt:  LEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYCSEFL

Query:  R-----------NEIPY--------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKI-IPRDSPLVRVESDAMNV
        +           N + Y        L+LN D          G+G ILR  NG  + A+ K  R+     ++E  A+  GL++ +    P + +ESD++ +
Subjt:  R-----------NEIPY--------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKI-IPRDSPLVRVESDAMNV

Query:  VRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGVAHYLAQQAVSLNSSEFWSNSFPNW
        V  LN++ + ++E    + + + L+       ++ +  R  NG AH LA+ A  +     W +S P++
Subjt:  VRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGVAHYLAQQAVSLNSSEFWSNSFPNW

KAG6693249.1 hypothetical protein I3842_10G159900 [Carya illinoinensis]9.3e-1425.37Show/hide
Query:  LEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYCSEFL
        +E A H  + C  VR +W+ +CP   EI   D S W   +        DRG     D +   LVV W++WN RN+ ++     +I            E+ 
Subjt:  LEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYCSEFL

Query:  R-----------NEIPY--------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKI-IPRDSPLVRVESDAMNV
        +           N + Y        L+LN D          G+G ILR  NG  + A+ K  R+     ++E  A+  GL++ +    P + +ESD++ +
Subjt:  R-----------NEIPY--------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKI-IPRDSPLVRVESDAMNV

Query:  VRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGVAHYLAQQAVSLNSSEFWSNSFPNW
        V  LN++ + ++E    + + + L+       ++ +  R  NG AH LA+ A  +     W +S P++
Subjt:  VRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGVAHYLAQQAVSLNSSEFWSNSFPNW

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]9.6e-1928.18Show/hide
Query:  KKLEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYC--
        KK E   H+ WECK+++ +W+   P     F+ DR+ WT  +Y E  W  D+  E EE R  RS+++  QIW  RN+ +      + + ++  I  Y   
Subjt:  KKLEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYC--

Query:  -----SEFLRNEIPY---------------------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRD-SP
             +   R    +                      +LN D  W       GIGWILR   G  +    + +R    +++LEV AICEGL+ I ++   
Subjt:  -----SEFLRNEIPY---------------------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRD-SP

Query:  LVRVESDAMNVVRLLNDEVQ
         + +ESD++  + LL+  V+
Subjt:  LVRVESDAMNVVRLLNDEVQ

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]8.1e-1835.32Show/hide
Query:  LVVCWQIWNHRNEVL----HNKKQPDIQQLEQKIT--NYCSE--------FLRNEIPY-------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYK
        L+  W IWNHRN V+    H+     IQQL + +T  +Y SE         L N++ +         LN D +WS++  RGGIGWI+RS +G  + A  +
Subjt:  LVVCWQIWNHRNEVL----HNKKQPDIQQLEQKIT--NYCSE--------FLRNEIPY-------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYK

Query:  CVRKRWKVSWLEVAAICEGLKIIPRDSPL--VRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISA--IIPVEKVAYALRFHNGVAHYLAQQAVSLNS
         V     V  LE +AI EGL+ +     L  + +E+D+  V  LLN + +D+++    V E  +L  +  I+   KV    R  NG AH LAQ+A  L  
Subjt:  CVRKRWKVSWLEVAAICEGLKIIPRDSPL--VRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISA--IIPVEKVAYALRFHNGVAHYLAQQAVSLNS

Query:  SEFWSNSFPNWFLVLNAS
        S  W + FPNW  +L  S
Subjt:  SEFWSNSFPNWFLVLNAS

XP_024046691.1 uncharacterized protein LOC112101027 [Citrus clementina]5.5e-1427.59Show/hide
Query:  VDTKWLESTVLVKKLEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQ
        V   W +   L K  E   H  +ECK  + +W +  PF  +I F        L   +G+    R  E+E       +VVCW IW+ RN  L   K+ D Q
Subjt:  VDTKWLESTVLVKKLEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQ

Query:  QLEQKITNYCSEFLRNEIP----------------------YLELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKII
                    + R ++P                      + ++N D      QQR G+G ++R+  G  + A+ K  +   KV + +  AI  GL+I 
Subjt:  QLEQKITNYCSEFLRNEIP----------------------YLELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKII

Query:  P--RDSPLVRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGVAHYLAQQAVSLNSSEFWSNSFPNWFLVL
           R  PL+ VESD+  VV L+N +    +E+     E +  +  +  V KV +  R  N +AH LA+ A+      FW ++ P+ FL L
Subjt:  P--RDSPLVRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGVAHYLAQQAVSLNSSEFWSNSFPNWFLVL

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134123.8e-1326.79Show/hide
Query:  EDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKI-----------TNYCSEFLRNEIPYL-------------------ELNCDTTWSETQQRGGIG
        E+   RS+++ WQIW  RN+ +     P+ + ++  I           TN   +    ++  +                   +LN +  W      GGIG
Subjt:  EDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKI-----------TNYCSEFLRNEIPYL-------------------ELNCDTTWSETQQRGGIG

Query:  WILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRD-SPLVRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGV
        WILR   G  + AS + +R    +++LEV AICEGL+ I ++    + +ESD++  + LL+ + QD +E+ I+++E    +   + +  + +  R  N V
Subjt:  WILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRD-SPLVRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGV

Query:  AHYLAQQAV
        AH LA++A+
Subjt:  AHYLAQQAV

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X14.7e-1928.18Show/hide
Query:  KKLEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYC--
        KK E   H+ WECK+++ +W+   P     F+ DR+ WT  +Y E  W  D+  E EE R  RS+++  QIW  RN+ +      + + ++  I  Y   
Subjt:  KKLEMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYC--

Query:  -----SEFLRNEIPY---------------------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRD-SP
             +   R    +                      +LN D  W       GIGWILR   G  +    + +R    +++LEV AICEGL+ I ++   
Subjt:  -----SEFLRNEIPY---------------------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRD-SP

Query:  LVRVESDAMNVVRLLNDEVQ
         + +ESD++  + LL+  V+
Subjt:  LVRVESDAMNVVRLLNDEVQ

A0A6J1DNV9 uncharacterized protein LOC1110224033.9e-1835.32Show/hide
Query:  LVVCWQIWNHRNEVL----HNKKQPDIQQLEQKIT--NYCSE--------FLRNEIPY-------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYK
        L+  W IWNHRN V+    H+     IQQL + +T  +Y SE         L N++ +         LN D +WS++  RGGIGWI+RS +G  + A  +
Subjt:  LVVCWQIWNHRNEVL----HNKKQPDIQQLEQKIT--NYCSE--------FLRNEIPY-------LELNCDTTWSETQQRGGIGWILRSPNGTPLCASYK

Query:  CVRKRWKVSWLEVAAICEGLKIIPRDSPL--VRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISA--IIPVEKVAYALRFHNGVAHYLAQQAVSLNS
         V     V  LE +AI EGL+ +     L  + +E+D+  V  LLN + +D+++    V E  +L  +  I+   KV    R  NG AH LAQ+A  L  
Subjt:  CVRKRWKVSWLEVAAICEGLKIIPRDSPL--VRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISA--IIPVEKVAYALRFHNGVAHYLAQQAVSLNS

Query:  SEFWSNSFPNWFLVLNAS
        S  W + FPNW  +L  S
Subjt:  SEFWSNSFPNWFLVLNAS

A0A6J1DSV1 uncharacterized protein LOC1110236081.2e-1125.81Show/hide
Query:  EDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKI-----------TNYCSEFLRNEIPYL-------------------ELNCDTTWSETQQRGGIG
        E+   RS+++ WQIW  RN+ +      + + ++  I           TN   +    ++  +                   +LN D  W      GGIG
Subjt:  EDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKI-----------TNYCSEFLRNEIPYL-------------------ELNCDTTWSETQQRGGIG

Query:  WILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRD--SPL-------VRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAY
        WILR   G  + A  + +R    +++LEV AICEGL+ I ++   P+       + +ESD++  + LL+ + QD +E+ I+++E    +   + +  + +
Subjt:  WILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRD--SPL-------VRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAY

Query:  ALRFHNGVAHYLAQQAV
          R  N VAH LA++A+
Subjt:  ALRFHNGVAHYLAQQAV

B8AFN2 RNase H domain-containing protein8.5e-1326.74Show/hide
Query:  WKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYCS-------------------------------------------EFLRN
        W  D  ++L        L+  W+ W+ RNE+ H+K  P ++  ++ + +Y +                                            +++ 
Subjt:  WKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYCS-------------------------------------------EFLRN

Query:  EIPYLELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRDSPL-VRVESDAMNVVRLLNDEVQDISELAIFVMEA
        ++ +++LN D ++     +GGIG +LR  +G  + AS K + +       E+ A  EGL ++   + L + VE+D M+VV+LLND  +D SELA  V EA
Subjt:  EIPYLELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPRDSPL-VRVESDAMNVVRLLNDEVQDISELAIFVMEA

Query:  KSLISAI--IPVEKVAYALRFHNGVAHYLAQQAVSLNSSEFWSNSFPNWFLVLNASDI
        K L++    I V K+    R  N V+H LA +A   + S  W     N+ L L   DI
Subjt:  KSLISAI--IPVEKVAYALRFHNGVAHYLAQQAVSLNSSEFWSNSFPNWFLVLNASDI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.5e-0922.26Show/hide
Query:  EMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLV--VCWQIWNHRNEVLHNKKQPDIQQLEQKITNYCSEF
        E  NHL ++C   R  W       + I       W    Y    W  + G    +   A  LV  + W++W +RNE++   ++ + Q++ ++  +   E+
Subjt:  EMANHLFWECKMVRGLWLKFCPFTNEIFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLV--VCWQIWNHRNEVLHNKKQPDIQQLEQKITNYCSEF

Query:  -LRNEI----------------------PYLELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPR-DSPLVRVES
         +R E                        +++ N D TW+   +R GIGW+LR+  G       + + K   V   E+ A+   +  + R     V  ES
Subjt:  -LRNEI----------------------PYLELNCDTTWSETQQRGGIGWILRSPNGTPLCASYKCVRKRWKVSWLEVAAICEGLKIIPR-DSPLVRVES

Query:  DAMNVVRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGVAHYLAQQAVS-LNSSEFWSNSFPNW
        D+  ++ +LN++ +    L   + + + L+S    V K  +  R  N +A  +A++++S LN      +  P+W
Subjt:  DAMNVVRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGVAHYLAQQAVS-LNSSEFWSNSFPNW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGCGAAATGCCGACACCACTGGATGACATCGAAAATTCCTAGCAACACCAACATCAACCATGTGACACCAGAAGTTCTTCGCAATGTCAATGCCACACTTGTAGC
ATTGGCATTGCTAGGAACTGCCGACACCGACCAATCTAGCATCGACGTTTCTTCGAATTGTCGACATCACACTTTAACGTCGACAATTTTTGGAAATGTTGACACCAAAT
GGTTGGAGTCGACAGTACTTGTGAAAAAACTTGAGATGGCAAATCATCTGTTCTGGGAATGTAAAATGGTAAGAGGTCTTTGGTTGAAATTCTGCCCTTTTACTAATGAA
ATCTTTTTTGGTGACAGATCTGGATGGACTCCTTTGGATTACTGTGAAGGAATTTGGAAGGCAGACCGAGGAAAAGAACTAGAGGAAGACAGAATGGCTAGATCTCTTGT
GGTGTGTTGGCAAATTTGGAATCACAGAAATGAAGTTCTTCACAACAAAAAACAGCCAGACATTCAGCAATTGGAGCAGAAGATCACCAATTATTGTTCAGAGTTCCTAA
GGAATGAAATTCCTTACCTGGAGCTGAACTGCGACACCACTTGGTCTGAAACACAACAGCGAGGTGGTATTGGGTGGATTCTTCGATCTCCGAATGGCACCCCTCTCTGC
GCGAGTTACAAATGTGTGAGAAAACGGTGGAAGGTCAGCTGGTTAGAGGTTGCGGCAATTTGTGAAGGTCTGAAAATCATCCCCCGTGATTCTCCTCTGGTTCGTGTTGA
GTCGGATGCGATGAATGTGGTTCGGCTGCTCAACGATGAAGTTCAGGACATTTCAGAGCTGGCGATTTTCGTGATGGAGGCTAAATCCCTCATCTCTGCTATTATTCCCG
TGGAAAAGGTTGCATATGCCCTAAGATTCCATAATGGTGTGGCACATTATCTGGCCCAACAAGCTGTTTCTTTAAATTCTTCTGAGTTTTGGTCCAATTCATTTCCTAAT
TGGTTTTTAGTCTTAAATGCATCTGATATTGGATATGTAGTTAATACTTACGGGGGAGCCTGTCCCATAGGTAATTTATCTTTGGGAGCTGTTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGCGAAATGCCGACACCACTGGATGACATCGAAAATTCCTAGCAACACCAACATCAACCATGTGACACCAGAAGTTCTTCGCAATGTCAATGCCACACTTGTAGC
ATTGGCATTGCTAGGAACTGCCGACACCGACCAATCTAGCATCGACGTTTCTTCGAATTGTCGACATCACACTTTAACGTCGACAATTTTTGGAAATGTTGACACCAAAT
GGTTGGAGTCGACAGTACTTGTGAAAAAACTTGAGATGGCAAATCATCTGTTCTGGGAATGTAAAATGGTAAGAGGTCTTTGGTTGAAATTCTGCCCTTTTACTAATGAA
ATCTTTTTTGGTGACAGATCTGGATGGACTCCTTTGGATTACTGTGAAGGAATTTGGAAGGCAGACCGAGGAAAAGAACTAGAGGAAGACAGAATGGCTAGATCTCTTGT
GGTGTGTTGGCAAATTTGGAATCACAGAAATGAAGTTCTTCACAACAAAAAACAGCCAGACATTCAGCAATTGGAGCAGAAGATCACCAATTATTGTTCAGAGTTCCTAA
GGAATGAAATTCCTTACCTGGAGCTGAACTGCGACACCACTTGGTCTGAAACACAACAGCGAGGTGGTATTGGGTGGATTCTTCGATCTCCGAATGGCACCCCTCTCTGC
GCGAGTTACAAATGTGTGAGAAAACGGTGGAAGGTCAGCTGGTTAGAGGTTGCGGCAATTTGTGAAGGTCTGAAAATCATCCCCCGTGATTCTCCTCTGGTTCGTGTTGA
GTCGGATGCGATGAATGTGGTTCGGCTGCTCAACGATGAAGTTCAGGACATTTCAGAGCTGGCGATTTTCGTGATGGAGGCTAAATCCCTCATCTCTGCTATTATTCCCG
TGGAAAAGGTTGCATATGCCCTAAGATTCCATAATGGTGTGGCACATTATCTGGCCCAACAAGCTGTTTCTTTAAATTCTTCTGAGTTTTGGTCCAATTCATTTCCTAAT
TGGTTTTTAGTCTTAAATGCATCTGATATTGGATATGTAGTTAATACTTACGGGGGAGCCTGTCCCATAGGTAATTTATCTTTGGGAGCTGTTTCTTAA
Protein sequenceShow/hide protein sequence
MLAKCRHHWMTSKIPSNTNINHVTPEVLRNVNATLVALALLGTADTDQSSIDVSSNCRHHTLTSTIFGNVDTKWLESTVLVKKLEMANHLFWECKMVRGLWLKFCPFTNE
IFFGDRSGWTPLDYCEGIWKADRGKELEEDRMARSLVVCWQIWNHRNEVLHNKKQPDIQQLEQKITNYCSEFLRNEIPYLELNCDTTWSETQQRGGIGWILRSPNGTPLC
ASYKCVRKRWKVSWLEVAAICEGLKIIPRDSPLVRVESDAMNVVRLLNDEVQDISELAIFVMEAKSLISAIIPVEKVAYALRFHNGVAHYLAQQAVSLNSSEFWSNSFPN
WFLVLNASDIGYVVNTYGGACPIGNLSLGAVS