; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011594 (gene) of Snake gourd v1 genome

Gene IDTan0011594
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG01:97849166..97854558
RNA-Seq ExpressionTan0011594
SyntenyTan0011594
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAY69209.1 hypothetical protein CUMW_270170 [Citrus unshiu]2.1e-1228.57Show/hide
Query:  FPNYIQKKILCKIARKPNLHQEAKVENLLTEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS-------------------D
        + N+I +    + A    L  +  V  LL E G WN+E+I + F   D+  +L I      + D L+WH+ +  +Y+VKS                   D
Subjt:  FPNYIQKKILCKIARKPNLHQEAKVENLLTEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS-------------------D

Query:  AALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLENSLAHGIAKEAL
        AA++ E   + +G +IRN +GEVM T       + ++++ EA  I  G+ +A ++    + VESDS  V    +          N +AHG+AK A+
Subjt:  AALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLENSLAHGIAKEAL

OIT02058.1 hypothetical protein A4A49_07674 [Nicotiana attenuata]1.6e-0927.13Show/hide
Query:  KVENLLTEEGN-WNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKSDAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIE
        KV +L+ E+ + W    I + F+ ED +V+LSI  +   + DRL+WH+ K   Y VKSD+ L   ++   IGV   N  G+++      I  + +    E
Subjt:  KVENLLTEEGN-WNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKSDAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIE

Query:  ALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLE-------------------------NSLAHGIAKEALLLPEEMVW
        A+ IR  L  A  +G++ V + SD+  V+ ++Q+      ++E                         N +AH +AK ++ L +++ W
Subjt:  ALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLE-------------------------NSLAHGIAKEALLLPEEMVW

XP_023924674.1 uncharacterized protein LOC112036081 [Quercus suber]6.6e-1129.79Show/hide
Query:  PNYIQKKILCKIARKPNLHQEAKVENLLTEEGN-WNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKSDAALNKENQKSEIGVVIRNE
        PN+   KIL     +     E +V +L+  E   WN+E+I S F +ED++ +L IP +R  V D ++W +   ++Y+VKS   +    Q S +G +I NE
Subjt:  PNYIQKKILCKIARKPNLHQEAKVENLLTEEGN-WNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKSDAALNKENQKSEIGVVIRNE

Query:  KGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLENSLAHGIAKEALLLPEEMVWIEE
         GEVM  ++     + +   +E LA R  L  A ++GF ++ VE      ++L++++        N +AH +A+ A  + E++ W+++
Subjt:  KGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLENSLAHGIAKEALLLPEEMVWIEE

XP_042962354.1 uncharacterized protein LOC122296618 [Carya illinoinensis]3.3e-1023.63Show/hide
Query:  NLHQEAKVENLL-TEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS------------------------------------
        +L  +++ + L+  ++  W++E I S+F ++++E +LSIP ++ K  D++IW   K  ++T+ S                                    
Subjt:  NLHQEAKVENLL-TEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS------------------------------------

Query:  ---DAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLS-------DLE-----
           +A+L+   ++ E+G++IR+E+GE +  +      +    V E  A+   L +  ++  ++   E D+  VI  +   R++LS       D++     
Subjt:  ---DAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLS-------DLE-----

Query:  -------------NSLAHGIAKEALLLPEEMVWIEEV
                     N++AH +AKEAL L E+ VWIEEV
Subjt:  -------------NSLAHGIAKEALLLPEEMVWIEEV

XP_042973059.1 uncharacterized protein LOC122304861 [Carya illinoinensis]1.5e-1025.36Show/hide
Query:  LHQEAKVENLLT-EEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS-------------------------------------
        L+ EA+VE LL+ E G WN+ +I+ +FD+E++ +V ++P + SK+PD+ IW + K+ IY VKS                                     
Subjt:  LHQEAKVENLLT-EEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS-------------------------------------

Query:  -----------------------------------------------DAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMA
                                                       DAAL  +++K  +GVVIR+  G+V+ +L      +    V E  A+   + + 
Subjt:  -----------------------------------------------DAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMA

Query:  KEMGFRQVEVESDSAKVI--------------QLLQ------QNRQNLSDLE-----NSLAHGIAKEALLLPEEMVWIEE
        +E+ F +V++E D+  VI              QL++      +NR+  S L      NS+AH +AK AL + EE VW+E+
Subjt:  KEMGFRQVEVESDSAKVI--------------QLLQ------QNRQNLSDLE-----NSLAHGIAKEALLLPEEMVWIEE

TrEMBL top hitse value%identityAlignment
A0A1J6IBV8 RNase H domain-containing protein7.9e-1027.13Show/hide
Query:  KVENLLTEEGN-WNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKSDAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIE
        KV +L+ E+ + W    I + F+ ED +V+LSI  +   + DRL+WH+ K   Y VKSD+ L   ++   IGV   N  G+++      I  + +    E
Subjt:  KVENLLTEEGN-WNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKSDAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIE

Query:  ALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLE-------------------------NSLAHGIAKEALLLPEEMVW
        A+ IR  L  A  +G++ V + SD+  V+ ++Q+      ++E                         N +AH +AK ++ L +++ W
Subjt:  ALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLE-------------------------NSLAHGIAKEALLLPEEMVW

A0A2H5QX83 RNase H domain-containing protein1.0e-1228.57Show/hide
Query:  FPNYIQKKILCKIARKPNLHQEAKVENLLTEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS-------------------D
        + N+I +    + A    L  +  V  LL E G WN+E+I + F   D+  +L I      + D L+WH+ +  +Y+VKS                   D
Subjt:  FPNYIQKKILCKIARKPNLHQEAKVENLLTEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS-------------------D

Query:  AALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLENSLAHGIAKEAL
        AA++ E   + +G +IRN +GEVM T       + ++++ EA  I  G+ +A ++    + VESDS  V    +          N +AHG+AK A+
Subjt:  AALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLENSLAHGIAKEAL

A0A6J1CIF1 uncharacterized protein LOC1110112372.3e-0927.7Show/hide
Query:  LIWHYEKDDIYTVKSDAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDL-
        ++W      IY + +DA+    +Q + +G++IRN++G+VM +  K ++ I  +D+ EA+    GL +A ++G   V +E+DS+++  L  Q  ++LS+  
Subjt:  LIWHYEKDDIYTVKSDAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDL-

Query:  ------------------------ENSLAHGIAKEALLLPEEMVWIEE
                                 N  AH +A+ ALLL E  +W+E+
Subjt:  ------------------------ENSLAHGIAKEALLLPEEMVWIEE

A0A803NVH0 Uncharacterized protein5.5e-1130.54Show/hide
Query:  VENLLTEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS----------------------DAALNKENQKSEIGVVIRNEKG
        V +LLT +  W+  ++ SLF   D E +L+IP +     D LIWH+E D  YTVKS                      DAA++K  Q +  G ++RN  G
Subjt:  VENLLTEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKS----------------------DAALNKENQKSEIGVVIRNEKG

Query:  EVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLENSLA
        E +  ++    G  + + +EALA+ + L   +++      +ESDS  V+  LQ    ++S+    LA
Subjt:  EVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLENSLA

A0A803PY52 Uncharacterized protein7.9e-1027.92Show/hide
Query:  VENLLTEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTV------------KSDAALNKENQKSEIGVVIRNEKGEVMLTLAKSI
        V + +TE+  WN  ++ + F   D + +++IP +     DRLIWH+     YTV              DA +N+E++   +G +IR+  G V+   +K +
Subjt:  VENLLTEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTV------------KSDAALNKENQKSEIGVVIRNEKGEVMLTLAKSI

Query:  DGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLEN
         G    D +EA A+ + L  AK++  +   VE+D+ +V   +    +NLS  ++
Subjt:  DGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLEN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-0427.52Show/hide
Query:  SDAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQN------RQNLSDLENSLAHGIA
        +DA  N++N++  IG V+RNEKGEV    A+++  +  +   E  A+R  +       +  V  ESDS  +I++L  +      +  + DL+  L+    
Subjt:  SDAALNKENQKSEIGVVIRNEKGEVMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQN------RQNLSDLENSLAHGIA

Query:  KEALLLPEE
         + + +P E
Subjt:  KEALLLPEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATGAGTCTAAAGAAGATCATGATGTTCCTGGCTATGGTCCAAACCTTAGAGTAGCAAAGTTTAGAAGACAAGGACAAAATAAATTTGTACATTCAAAGGCTGA
TACCATCTTGACGAAAGAAAAATCGAAGGGTAGAAAAGAAAGCCTAGTTCCTCCTAATCCTACTCACAATAACGTTAAAAACCCTACAAGAGATGGAAGGTCCATCCAGA
TGAACTATGGTGATGGACAACCTGATGCTGATATGAACTTAGATGAAAATGTATCTTCAAAGGTTAGTAAGAATGTGATGAAGAATACTGATCTGGGAGGGGAATTTACG
ATTGATGGGGACCCCCCTAATTTCCCAAACTACATCCAAAAGAAAATATTATGCAAAATAGCAAGGAAACCAAATTTACATCAAGAGGCTAAAGTTGAGAACTTATTAAC
TGAAGAAGGTAATTGGAACAAAGAAATGATAGAGAGCCTGTTTGATGAGGAAGATAGTGAAGTTGTTCTAAGTATCCCGAGAACTAGATCAAAGGTCCCTGACAGATTGA
TATGGCACTATGAGAAAGACGACATTTATACGGTAAAGTCAGATGCTGCATTAAACAAAGAAAATCAGAAATCTGAGATAGGCGTGGTAATCAGAAATGAAAAAGGAGAG
GTGATGCTTACTCTAGCTAAATCGATCGATGGAATCATGGAAATTGATGTCATCGAAGCTCTGGCAATCCGAAATGGGTTGTATATGGCAAAAGAAATGGGATTCCGACA
GGTCGAAGTTGAGTCAGATTCGGCCAAAGTCATCCAACTCCTACAACAAAATCGCCAAAACTTATCAGATCTAGAAAACTCACTAGCCCATGGAATAGCAAAAGAGGCGC
TGCTTCTGCCGGAAGAAATGGTGTGGATCGAGGAAGTACTGGAGGCCACCATGGAAGTTTACCTTGAAGAACGAAAGGGGGAAACTCATTCTGCGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATGAGTCTAAAGAAGATCATGATGTTCCTGGCTATGGTCCAAACCTTAGAGTAGCAAAGTTTAGAAGACAAGGACAAAATAAATTTGTACATTCAAAGGCTGA
TACCATCTTGACGAAAGAAAAATCGAAGGGTAGAAAAGAAAGCCTAGTTCCTCCTAATCCTACTCACAATAACGTTAAAAACCCTACAAGAGATGGAAGGTCCATCCAGA
TGAACTATGGTGATGGACAACCTGATGCTGATATGAACTTAGATGAAAATGTATCTTCAAAGGTTAGTAAGAATGTGATGAAGAATACTGATCTGGGAGGGGAATTTACG
ATTGATGGGGACCCCCCTAATTTCCCAAACTACATCCAAAAGAAAATATTATGCAAAATAGCAAGGAAACCAAATTTACATCAAGAGGCTAAAGTTGAGAACTTATTAAC
TGAAGAAGGTAATTGGAACAAAGAAATGATAGAGAGCCTGTTTGATGAGGAAGATAGTGAAGTTGTTCTAAGTATCCCGAGAACTAGATCAAAGGTCCCTGACAGATTGA
TATGGCACTATGAGAAAGACGACATTTATACGGTAAAGTCAGATGCTGCATTAAACAAAGAAAATCAGAAATCTGAGATAGGCGTGGTAATCAGAAATGAAAAAGGAGAG
GTGATGCTTACTCTAGCTAAATCGATCGATGGAATCATGGAAATTGATGTCATCGAAGCTCTGGCAATCCGAAATGGGTTGTATATGGCAAAAGAAATGGGATTCCGACA
GGTCGAAGTTGAGTCAGATTCGGCCAAAGTCATCCAACTCCTACAACAAAATCGCCAAAACTTATCAGATCTAGAAAACTCACTAGCCCATGGAATAGCAAAAGAGGCGC
TGCTTCTGCCGGAAGAAATGGTGTGGATCGAGGAAGTACTGGAGGCCACCATGGAAGTTTACCTTGAAGAACGAAAGGGGGAAACTCATTCTGCGAAATGA
Protein sequenceShow/hide protein sequence
MEDESKEDHDVPGYGPNLRVAKFRRQGQNKFVHSKADTILTKEKSKGRKESLVPPNPTHNNVKNPTRDGRSIQMNYGDGQPDADMNLDENVSSKVSKNVMKNTDLGGEFT
IDGDPPNFPNYIQKKILCKIARKPNLHQEAKVENLLTEEGNWNKEMIESLFDEEDSEVVLSIPRTRSKVPDRLIWHYEKDDIYTVKSDAALNKENQKSEIGVVIRNEKGE
VMLTLAKSIDGIMEIDVIEALAIRNGLYMAKEMGFRQVEVESDSAKVIQLLQQNRQNLSDLENSLAHGIAKEALLLPEEMVWIEEVLEATMEVYLEERKGETHSAK