; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022315 (gene) of Snake gourd v1 genome

Gene IDTan0022315
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG11:14868048..14868704
RNA-Seq ExpressionTan0022315
SyntenyTan0022315
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONK81429.1 uncharacterized protein A4U43_C01F28990 [Asparagus officinalis]9.9e-1128.99Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQELISKVVAQSHLYNVNQGCTSSLPPPLFLSK------GKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKGQFEIGG
        W++W  RN   +   + +  + +   +A + L    Q    ++ PPL  S+       K S  A   +KLNVD A   N +  G+G+I+RD++G F    
Subjt:  WSLWTQRNFTRFNGVNLSTQELISKVVAQSHLYNVNQGCTSSLPPPLFLSK------GKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKGQFEIGG

Query:  CNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPS-FVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLASS-
           +   +     EAL    G++E L+W +  FP    ++ D++ V +  +R++ D S  G IIED LS+ + I +I F    R  NK  H ++  A + 
Subjt:  CNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPS-FVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLASS-

Query:  -SDVLWS
         + V+WS
Subjt:  -SDVLWS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]3.3e-1430.88Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQEL---ISKVVAQSHLYNVN-QGCTSSLPPPLFLSKGKRSLQAGKP-----WKLNVDVACSYNLQHEGMGWIVRDEKGQFE
        W +W  RN + F GV+  T+++   I + +  S   N N +G +++    L       +    KP     WKLN + A   +    G+GWI+RDEKG+  
Subjt:  WSLWTQRNFTRFNGVNLSTQEL---ISKVVAQSHLYNVN-QGCTSSLPPPLFLSKGKRSLQAGKP-----WKLNVDVACSYNLQHEGMGWIVRDEKGQFE

Query:  IGGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLAS
           C  I     I  LE +AI +GL+   A RQ       ++SDSLE I L +R   D +E  +++E+I  ++K +  +    I RE NK+ H ++  A 
Subjt:  IGGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLAS

Query:  SSDV
         +D+
Subjt:  SSDV

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]3.0e-1527.31Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQELISKVV---------AQSHLYNVNQGCTSSL---PPPLFLSKGKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKG
        W +W  RN+  F G + S   +I ++          +++ L  +++   + L   PPP+ +            W LN D + S +    G+GWI+R   G
Subjt:  WSLWTQRNFTRFNGVNLSTQELISKVV---------AQSHLYNVNQGCTSSL---PPPLFLSKGKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKG

Query:  QFEIGGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISV
           + G  F++    +K LEA AI +GL+       ++     +++DS EV SL NR   DL++TG+++E+IL++  +   + F K+ RE N   H ++ 
Subjt:  QFEIGGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISV

Query:  LAS--SSDVLWSEKNFPECLIPLLQEK
         AS     ++W +  FP  L  L + +
Subjt:  LAS--SSDVLWSEKNFPECLIPLLQEK

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]4.7e-1329.19Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQEL---ISKVVAQSHLYNVN-QGCTSSLPPPLFLSKGKRSLQAGKP-----WKLNVDVACSYNLQHEGMGWIVRDEKGQFE
        W +W  RN + F GV+  T+++   I + +  S   + N +G +++    L    G  +    KP     WKLN D A   +    G+GWI+RDEKG+  
Subjt:  WSLWTQRNFTRFNGVNLSTQEL---ISKVVAQSHLYNVN-QGCTSSLPPPLFLSKGKRSLQAGKP-----WKLNVDVACSYNLQHEGMGWIVRDEKGQFE

Query:  IGGCNFIDKPLLIKSLEALAIRDGLKEF-----LAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCI
           C  I     I  LE +AI +GL+          +Q       ++SDSLE I L +R   D +E  +++E+I  +++ +  +    I RE NK+ H +
Subjt:  IGGCNFIDKPLLIKSLEALAIRDGLKEF-----LAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCI

Query:  SVLASSSDV
        +  A  +D+
Subjt:  SVLASSSDV

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]6.2e-1329.11Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQELISKV--------VAQSHLYNVNQGCTSSLPPPLFLSKGKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKGQFEI
        W LW  RN  +  GV   ++ ++            AQ    NV +  T          K K         KLN D A  Y  +   +G +VRD +G+ + 
Subjt:  WSLWTQRNFTRFNGVNLSTQELISKV--------VAQSHLYNVNQGCTSSLPPPLFLSKGKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKGQFEI

Query:  GGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCIS--VLA
         G   +     I ++EALA+  GL   L +R+  F + VV+SDS  VI   N+ ++DLS  G +++DI  +     S+ + K+ RE N   H ++   L 
Subjt:  GGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCIS--VLA

Query:  SSSDVLWSEKNFP
        +  D LW E   P
Subjt:  SSSDVLWSEKNFP

TrEMBL top hitse value%identityAlignment
A0A5P1EPS3 RNase H domain-containing protein1.4e-1028.5Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQELISKVVAQSHLYNVNQGCTSSLPPPL------FLSKGKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKGQFEIGG
        W++W  RN   +   + +  + +   +A + L    Q    ++ PPL        +  K S  A   +KLNVD A   N +  G+G+I+RD++G F    
Subjt:  WSLWTQRNFTRFNGVNLSTQELISKVVAQSHLYNVNQGCTSSLPPPL------FLSKGKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKGQFEIGG

Query:  CNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPS-FVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLASS-
           +   +     EAL    G++E L+W +  FP    ++ D++ V +  +R++ D S  G IIED LS+ + I +I F    R  NK  H ++  A + 
Subjt:  CNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPS-FVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLASS-

Query:  -SDVLWS
         + V+WS
Subjt:  -SDVLWS

A0A5P1FW40 RNase H domain-containing protein4.8e-1128.99Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQELISKVVAQSHLYNVNQGCTSSLPPPLFLSK------GKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKGQFEIGG
        W++W  RN   +   + +  + +   +A + L    Q    ++ PPL  S+       K S  A   +KLNVD A   N +  G+G+I+RD++G F    
Subjt:  WSLWTQRNFTRFNGVNLSTQELISKVVAQSHLYNVNQGCTSSLPPPLFLSK------GKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKGQFEIGG

Query:  CNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPS-FVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLASS-
           +   +     EAL    G++E L+W +  FP    ++ D++ V +  +R++ D S  G IIED LS+ + I +I F    R  NK  H ++  A + 
Subjt:  CNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPS-FVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLASS-

Query:  -SDVLWS
         + V+WS
Subjt:  -SDVLWS

A0A6J1CP26 uncharacterized protein LOC1110134121.6e-1430.88Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQEL---ISKVVAQSHLYNVN-QGCTSSLPPPLFLSKGKRSLQAGKP-----WKLNVDVACSYNLQHEGMGWIVRDEKGQFE
        W +W  RN + F GV+  T+++   I + +  S   N N +G +++    L       +    KP     WKLN + A   +    G+GWI+RDEKG+  
Subjt:  WSLWTQRNFTRFNGVNLSTQEL---ISKVVAQSHLYNVN-QGCTSSLPPPLFLSKGKRSLQAGKP-----WKLNVDVACSYNLQHEGMGWIVRDEKGQFE

Query:  IGGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLAS
           C  I     I  LE +AI +GL+   A RQ       ++SDSLE I L +R   D +E  +++E+I  ++K +  +    I RE NK+ H ++  A 
Subjt:  IGGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLAS

Query:  SSDV
         +D+
Subjt:  SSDV

A0A6J1DNV9 uncharacterized protein LOC1110224031.4e-1527.31Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQELISKVV---------AQSHLYNVNQGCTSSL---PPPLFLSKGKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKG
        W +W  RN+  F G + S   +I ++          +++ L  +++   + L   PPP+ +            W LN D + S +    G+GWI+R   G
Subjt:  WSLWTQRNFTRFNGVNLSTQELISKVV---------AQSHLYNVNQGCTSSL---PPPLFLSKGKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKG

Query:  QFEIGGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISV
           + G  F++    +K LEA AI +GL+       ++     +++DS EV SL NR   DL++TG+++E+IL++  +   + F K+ RE N   H ++ 
Subjt:  QFEIGGCNFIDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISV

Query:  LAS--SSDVLWSEKNFPECLIPLLQEK
         AS     ++W +  FP  L  L + +
Subjt:  LAS--SSDVLWSEKNFPECLIPLLQEK

A0A6J1DSV1 uncharacterized protein LOC1110236082.3e-1329.19Show/hide
Query:  WSLWTQRNFTRFNGVNLSTQEL---ISKVVAQSHLYNVN-QGCTSSLPPPLFLSKGKRSLQAGKP-----WKLNVDVACSYNLQHEGMGWIVRDEKGQFE
        W +W  RN + F GV+  T+++   I + +  S   + N +G +++    L    G  +    KP     WKLN D A   +    G+GWI+RDEKG+  
Subjt:  WSLWTQRNFTRFNGVNLSTQEL---ISKVVAQSHLYNVN-QGCTSSLPPPLFLSKGKRSLQAGKP-----WKLNVDVACSYNLQHEGMGWIVRDEKGQFE

Query:  IGGCNFIDKPLLIKSLEALAIRDGLKEF-----LAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCI
           C  I     I  LE +AI +GL+          +Q       ++SDSLE I L +R   D +E  +++E+I  +++ +  +    I RE NK+ H +
Subjt:  IGGCNFIDKPLLIKSLEALAIRDGLKEF-----LAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCI

Query:  SVLASSSDV
        +  A  +D+
Subjt:  SVLASSSDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein5.2e-1023.96Show/hide
Query:  MWSLWTQRNFTRFNGVNLSTQELISKVVAQSHLYNVNQGCTS-SLPPPLFLSKGKRSLQAGKPW-KLNVDVACSYNLQHEGMGWIVRDEKGQFEIGGCNF
        +W LW  RN   F G   + QE++ +       + +     S    P +  S   R       W K N D   + + +  G+GW++R+EKG+ +  G   
Subjt:  MWSLWTQRNFTRFNGVNLSTQELISKVVAQSHLYNVNQGCTS-SLPPPLFLSKGKRSLQAGKPW-KLNVDVACSYNLQHEGMGWIVRDEKGQFEIGGCNF

Query:  IDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDV--DLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCIS
        + K   +   E  A+R  +   L+  + Q+   + +SDS  +I + N +++   L  T   I+D+  ++     + F  IPRE N +   ++
Subjt:  IDKPLLIKSLEALAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDV--DLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTCATTGTGGACCCAACGAAACTTCACCCGGTTTAATGGGGTGAATCTCTCAACCCAAGAGCTCATTTCAAAAGTGGTTGCGCAGAGCCACCTTTATAATGTCAA
TCAGGGTTGTACCAGCTCCCTGCCTCCCCCTTTGTTCCTGTCAAAAGGCAAGAGATCCTTGCAAGCTGGTAAGCCTTGGAAACTCAACGTCGATGTTGCTTGCTCTTATA
ACTTACAGCATGAAGGTATGGGTTGGATTGTCAGAGACGAGAAGGGGCAGTTCGAGATTGGAGGTTGCAATTTCATTGATAAGCCTCTGCTAATCAAGTCGCTTGAAGCA
TTAGCCATTCGGGACGGGCTGAAAGAGTTTTTAGCTTGGAGACAGGTTCAGTTCCCAAGTTTTGTGGTGAAATCAGACTCGCTTGAGGTTATCTCCCTTCACAATAGGAA
TGATGTGGATTTATCTGAAACTGGCTTTATCATTGAGGATATCCTATCTATTGTAAAGGCTATTGGTTCAATTCTTTTTTGTAAAATCCCTAGAGAGGAAAACAAAATTA
CCCATTGTATATCAGTTTTGGCTTCATCTTCAGATGTTCTTTGGTCTGAAAAGAACTTTCCAGAGTGCTTAATTCCTCTCCTGCAGGAGAAAGAAGGTTGTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGTCATTGTGGACCCAACGAAACTTCACCCGGTTTAATGGGGTGAATCTCTCAACCCAAGAGCTCATTTCAAAAGTGGTTGCGCAGAGCCACCTTTATAATGTCAA
TCAGGGTTGTACCAGCTCCCTGCCTCCCCCTTTGTTCCTGTCAAAAGGCAAGAGATCCTTGCAAGCTGGTAAGCCTTGGAAACTCAACGTCGATGTTGCTTGCTCTTATA
ACTTACAGCATGAAGGTATGGGTTGGATTGTCAGAGACGAGAAGGGGCAGTTCGAGATTGGAGGTTGCAATTTCATTGATAAGCCTCTGCTAATCAAGTCGCTTGAAGCA
TTAGCCATTCGGGACGGGCTGAAAGAGTTTTTAGCTTGGAGACAGGTTCAGTTCCCAAGTTTTGTGGTGAAATCAGACTCGCTTGAGGTTATCTCCCTTCACAATAGGAA
TGATGTGGATTTATCTGAAACTGGCTTTATCATTGAGGATATCCTATCTATTGTAAAGGCTATTGGTTCAATTCTTTTTTGTAAAATCCCTAGAGAGGAAAACAAAATTA
CCCATTGTATATCAGTTTTGGCTTCATCTTCAGATGTTCTTTGGTCTGAAAAGAACTTTCCAGAGTGCTTAATTCCTCTCCTGCAGGAGAAAGAAGGTTGTATGTAG
Protein sequenceShow/hide protein sequence
MWSLWTQRNFTRFNGVNLSTQELISKVVAQSHLYNVNQGCTSSLPPPLFLSKGKRSLQAGKPWKLNVDVACSYNLQHEGMGWIVRDEKGQFEIGGCNFIDKPLLIKSLEA
LAIRDGLKEFLAWRQVQFPSFVVKSDSLEVISLHNRNDVDLSETGFIIEDILSIVKAIGSILFCKIPREENKITHCISVLASSSDVLWSEKNFPECLIPLLQEKEGCM