; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021679 (gene) of Snake gourd v1 genome

Gene IDTan0021679
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG01:68040638..68041710
RNA-Seq ExpressionTan0021679
SyntenyTan0021679
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEE57524.1 hypothetical protein OsJ_07834 [Oryza sativa Japonica Group]1.4e-0829.21Show/hide
Query:  QLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMV
        QLR    Q      +G    T   W+  P GW KLN D S+N +     +G ILR+  G  I  G + +++    L  +L A   GL   + +SI P+++
Subjt:  QLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMV

Query:  ESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVWTGESCNSVSGL
        E+D +  +S ING   D +     ++E++   K    +     H   N ++H+LA  A+    +  W G   N VS L
Subjt:  ESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVWTGESCNSVSGL

KAA8530045.1 hypothetical protein F0562_004754 [Nyssa sinensis]1.4e-0825Show/hide
Query:  WKNEVISTNFMVEDVDIIHNIPIARARGNNEIIWRLTPTGILERLLTFSKMLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNE
        W  + +++ F+  +V+ I  IP+      +++IW  +   I      FS         L +V+   +              W A   G +KLN D SW++
Subjt:  WKNEVISTNFMVEDVDIIHNIPIARARGNNEIIWRLTPTGILERLLTFSKMLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNE

Query:  AHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMVESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYC
        +   G +G ++RDC G +I    K L         ++ AIL GL+   +  I  +++E D +N++  I   L D + +  YV+EV      +++I + + 
Subjt:  AHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMVESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYC

Query:  HCSTNVMAHMLA
            N +AH+LA
Subjt:  HCSTNVMAHMLA

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]3.5e-1232.88Show/hide
Query:  LHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI--PLMVESDLMNTISTINGTLVDLTEV
        L W   P   W LN DASW+++   G +GWI+R  +G I+ AG++F+ +   V + + +AIL GL N+ +  +  PL +E+D     S +N    DLT+ 
Subjt:  LHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI--PLMVESDLMNTISTINGTLVDLTEV

Query:  QDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW
           V+E+ +     + + F      TN  AH LA+RA   + + +W
Subjt:  QDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW

XP_025877173.1 uncharacterized protein LOC107278050 isoform X1 [Oryza sativa Japonica Group]2.2e-1131.25Show/hide
Query:  STTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMVESDLMNTISTINGTLVDL
        ++T LHW     GW KLN D S++     G +G ILRD +G  + A  K L S    L  ++ A + GL+  + +++ P+++E+D ++ ++ +     DL
Subjt:  STTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMVESDLMNTISTINGTLVDL

Query:  TEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVWTGESCNSVSGL
        +E+ + VQE+K        +     H S N ++H+LA RA+    + +W   SCN +S L
Subjt:  TEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVWTGESCNSVSGL

XP_030964220.1 uncharacterized protein LOC115985421 [Quercus lobata]1.4e-0830.77Show/hide
Query:  WLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSIP-LMVESDLMNTISTINGTLVDLTEVQDY
        W A P G++K+NTDA+ +       +G ++R C G I+ A  K L + +   I +   +L G+L VV   +  +++ESD ++ I  IN  +    E+   
Subjt:  WLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSIP-LMVESDLMNTISTINGTLVDLTEVQDY

Query:  VQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW
        VQ + + S C+++  F +     N +AH LAK  +    +QVW
Subjt:  VQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW

TrEMBL top hitse value%identityAlignment
A0A5J5AG70 RNase H domain-containing protein6.6e-0925Show/hide
Query:  WKNEVISTNFMVEDVDIIHNIPIARARGNNEIIWRLTPTGILERLLTFSKMLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNE
        W  + +++ F+  +V+ I  IP+      +++IW  +   I      FS         L +V+   +              W A   G +KLN D SW++
Subjt:  WKNEVISTNFMVEDVDIIHNIPIARARGNNEIIWRLTPTGILERLLTFSKMLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNE

Query:  AHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMVESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYC
        +   G +G ++RDC G +I    K L         ++ AIL GL+   +  I  +++E D +N++  I   L D + +  YV+EV      +++I + + 
Subjt:  AHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMVESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYC

Query:  HCSTNVMAHMLA
            N +AH+LA
Subjt:  HCSTNVMAHMLA

A0A6J1DNV9 uncharacterized protein LOC1110224031.7e-1232.88Show/hide
Query:  LHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI--PLMVESDLMNTISTINGTLVDLTEV
        L W   P   W LN DASW+++   G +GWI+R  +G I+ AG++F+ +   V + + +AIL GL N+ +  +  PL +E+D     S +N    DLT+ 
Subjt:  LHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI--PLMVESDLMNTISTINGTLVDLTEV

Query:  QDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW
           V+E+ +     + + F      TN  AH LA+RA   + + +W
Subjt:  QDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW

A0A6J5VSP2 Uncharacterized protein1.9e-0826.44Show/hide
Query:  MLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDF
        +L    ++ R       P P + V S  PL W   P G  K+N+DA+W+   + G +GW++RD  G ++ AG +        L+ +L AI   L    +F
Subjt:  MLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDF

Query:  SI-PLMVESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW
         +  ++VESD    I  +NG     ++++  V +++        + FV+   S N   H +A          VW
Subjt:  SI-PLMVESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW

A0A6P5S2T0 uncharacterized protein LOC1107519131.1e-0825Show/hide
Query:  IWRLTPTGILERL----LTFSKMLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSS
        IW+     + E      +  + +L    T+ R    +  P P + + S+    W   P    K+N D +W    L G +GW++RD +G ++ AG +    
Subjt:  IWRLTPTGILERL----LTFSKMLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSS

Query:  RWPVLICKLAAILLGLLNVVDFSIP-LMVESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW
            L+ +L AI   L     F +  +MVESD    IS +NG     ++++  V +++     +  + FV+   S N +AH +A     +    VW
Subjt:  RWPVLICKLAAILLGLLNVVDFSIP-LMVESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVW

B9F1I2 Uncharacterized protein6.6e-0929.21Show/hide
Query:  QLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMV
        QLR    Q      +G    T   W+  P GW KLN D S+N +     +G ILR+  G  I  G + +++    L  +L A   GL   + +SI P+++
Subjt:  QLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSI-PLMV

Query:  ESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVWTGESCNSVSGL
        E+D +  +S ING   D +     ++E++   K    +     H   N ++H+LA  A+    +  W G   N VS L
Subjt:  ESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAKRAQTYQCTQVWTGESCNSVSGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.2e-0424.24Show/hide
Query:  WLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSIP-LMVESDLMNTISTINGTLVDLTEVQDY
        W    +GW K N D S+    +    GW++RD NG  + AG          L  ++ A+++ + +        +  E D       ING+ V    V ++
Subjt:  WLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSIP-LMVESDLMNTISTINGTLVDLTEVQDY

Query:  VQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAK
        ++++    + ++ IHF +     N  A +LAK
Subjt:  VQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAK

AT1G52990.1 thioredoxin family protein8.2e-0424.8Show/hide
Query:  KLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSIPLMV-ESDLMNTISTINGTLVDLTEVQDYVQEVKDKSK
        K N DAS +E  +   LGW++R+  G ++  G      R      + +A++  +     F    ++ E D  N    IN T  D   ++ Y+  +K    
Subjt:  KLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSIPLMV-ESDLMNTISTINGTLVDLTEVQDYVQEVKDKSK

Query:  CWDYIHFVYCHCSTNVMAHMLAKRA
         +    F++ H   N  A  L K+A
Subjt:  CWDYIHFVYCHCSTNVMAHMLAKRA

AT2G02650.1 Ribonuclease H-like superfamily protein1.4e-0629.6Show/hide
Query:  SKMLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVV
        ++ L   ET     VH +T   Q     ++   W   P+GW K N D+ + +   Y   GW +R+CNGHI+  G+  L S      C L A  LG L+ +
Subjt:  SKMLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVV

Query:  DFSIP-----LMVESDLMNTISTIN
                  +  ESD  + ++ IN
Subjt:  DFSIP-----LMVESDLMNTISTIN

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)4.8e-0425.84Show/hide
Query:  ETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGL
        ET + +  +  +        S     W   P+G+ K N D+ + +   Y    WI+RD NGH+I +G   L   +  L  +    L  L
Subjt:  ETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAHLYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTATAAATGATAATAGGACATGGAAGAATGAGGTCATAAGCACAAATTTTATGGTCGAGGATGTTGATATCATTCACAATATACCTATTGCACGGGCTAGAGGAAA
TAATGAGATCATTTGGAGACTTACTCCAACTGGAATTCTTGAGCGGTTGCTGACTTTTTCCAAGATGTTGGCCACTACCGAGACTCAGCTTCGTGAGGTGGTACATCAAA
GTACCCCTCATCCACAACGAGGGGTAATGAGTACGACACCTTTGCACTGGTTGGCTGCGCCACAAGGATGGTGGAAACTTAATACAGATGCATCTTGGAATGAAGCACAT
TTGTATGGAGATCTGGGTTGGATCCTTCGTGATTGTAATGGACATATTATCGGCGCGGGACACAAATTTCTTTCTTCAAGATGGCCGGTACTAATCTGCAAACTTGCTGC
TATTCTTCTTGGTTTATTGAACGTTGTGGACTTTTCCATTCCTTTAATGGTTGAGTCTGACTTAATGAATACCATTTCCACGATCAATGGCACACTGGTTGATTTAACGG
AGGTCCAAGACTATGTGCAAGAGGTGAAAGATAAATCGAAATGTTGGGACTACATACATTTTGTTTATTGTCATTGTAGCACTAATGTAATGGCTCATATGCTAGCTAAG
AGGGCTCAAACTTACCAGTGCACTCAAGTCTGGACGGGCGAGAGTTGTAATTCTGTGTCGGGATTGTTCTCTTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTATAAATGATAATAGGACATGGAAGAATGAGGTCATAAGCACAAATTTTATGGTCGAGGATGTTGATATCATTCACAATATACCTATTGCACGGGCTAGAGGAAA
TAATGAGATCATTTGGAGACTTACTCCAACTGGAATTCTTGAGCGGTTGCTGACTTTTTCCAAGATGTTGGCCACTACCGAGACTCAGCTTCGTGAGGTGGTACATCAAA
GTACCCCTCATCCACAACGAGGGGTAATGAGTACGACACCTTTGCACTGGTTGGCTGCGCCACAAGGATGGTGGAAACTTAATACAGATGCATCTTGGAATGAAGCACAT
TTGTATGGAGATCTGGGTTGGATCCTTCGTGATTGTAATGGACATATTATCGGCGCGGGACACAAATTTCTTTCTTCAAGATGGCCGGTACTAATCTGCAAACTTGCTGC
TATTCTTCTTGGTTTATTGAACGTTGTGGACTTTTCCATTCCTTTAATGGTTGAGTCTGACTTAATGAATACCATTTCCACGATCAATGGCACACTGGTTGATTTAACGG
AGGTCCAAGACTATGTGCAAGAGGTGAAAGATAAATCGAAATGTTGGGACTACATACATTTTGTTTATTGTCATTGTAGCACTAATGTAATGGCTCATATGCTAGCTAAG
AGGGCTCAAACTTACCAGTGCACTCAAGTCTGGACGGGCGAGAGTTGTAATTCTGTGTCGGGATTGTTCTCTTCGTAA
Protein sequenceShow/hide protein sequence
MLINDNRTWKNEVISTNFMVEDVDIIHNIPIARARGNNEIIWRLTPTGILERLLTFSKMLATTETQLREVVHQSTPHPQRGVMSTTPLHWLAAPQGWWKLNTDASWNEAH
LYGDLGWILRDCNGHIIGAGHKFLSSRWPVLICKLAAILLGLLNVVDFSIPLMVESDLMNTISTINGTLVDLTEVQDYVQEVKDKSKCWDYIHFVYCHCSTNVMAHMLAK
RAQTYQCTQVWTGESCNSVSGLFSS