; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011800 (gene) of Snake gourd v1 genome

Gene IDTan0011800
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H-like domain containing protein
Genome locationLG01:34740168..34740768
RNA-Seq ExpressionTan0011800
SyntenyTan0011800
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG66206.1 hypothetical protein EZV62_007481 [Acer yangbiense]5.1e-1329.87Show/hide
Query:  ESAEVEILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDS
        +S   E+LCV CW +W  RN+   G +   + +  EW   ++ +     E  RSK+  +T +     W+    G  K+N D   H S  ++G+G +I+D+
Subjt:  ESAEVEILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDS

Query:  NGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI
        + H+  +  +       P  AEAL +L G+ +A  NGF    + S+A  +VN I
Subjt:  NGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI

TXG69190.1 hypothetical protein EZV62_004125 [Acer yangbiense]1.5e-1231.37Show/hide
Query:  EILCVSCWAIWSDRNEISQGKITPSL--IRRVEWIQEYIQEI--GKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSN
        E LCV  W +W  RN++   K + ++     ++W   +IQ+    KT +TG   V+ + +A    KWK  P G  K+N D        +TG+G +IRD  
Subjt:  EILCVSCWAIWSDRNEISQGKITPSL--IRRVEWIQEYIQEI--GKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSN

Query:  GHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI
        GH++ +  +   G L P + EA+ +L G ++A + G     I S++  +VN+I
Subjt:  GHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI

XP_022131661.1 uncharacterized protein LOC111004786 [Momordica charantia]7.8e-1431.54Show/hide
Query:  EILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSNGHML
        ++   + WAIW DRN  + G    +   R  WI  Y Q   +  E  R      ++     +W  P +  +K+N D  C  +   TGLG IIRD  G +L
Subjt:  EILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSNGHML

Query:  VARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI
        VA+  F    L+PL AE   +L  +K+AA   +T L + S+ Q  + ++
Subjt:  VARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI

XP_023897447.1 uncharacterized protein LOC112009345 [Quercus suber]1.7e-1327.81Show/hide
Query:  KDFESAEVEILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKE---KWKKPPEGILKLNVDVVCHPSLPITGLG
        K F+  ++ ++    WA W +RNEI  G    S    V+W+  Y+ E    +E+          AV++E    W  PP  ILK+NVD     +L   G+G
Subjt:  KDFESAEVEILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKE---KWKKPPEGILKLNVDVVCHPSLPITGLG

Query:  AIIRDSNGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVIYKNGFRSAT
        A++RD  G ++ A  +     L PL  E     +G+++A   G+ N+ +  ++ ++V  +      S+T
Subjt:  AIIRDSNGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVIYKNGFRSAT

XP_024043083.1 uncharacterized protein LOC112099827 [Citrus clementina]1.0e-1338.97Show/hide
Query:  AEVEILCVSCWAIWSDRNE-ISQGKITPS--LIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRD
        AE E++ V CW IWS RN+ I +GK + S  L  + E + +  Q + K         +TK   V ++KWK PP+ +LKLNVD   +     TGLGAIIRD
Subjt:  AEVEILCVSCWAIWSDRNE-ISQGKITPS--LIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRD

Query:  SNGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQ
        + G +L   +K  + R     AEA  +L G+++A Q
Subjt:  SNGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQ

TrEMBL top hitse value%identityAlignment
A0A2P5EX40 Ribonuclease H-like domain containing protein1.0e-1130.68Show/hide
Query:  NMGVSMTKDFESAEVEILCVSCWAIWSDRNEISQG---KITPSLIRRV-EWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHP
        N  +S+       + E+  V  W IW DRN I  G   ++  SL+     W++E+ + +G         + ++T+    +KWK P  G LKLNVD     
Subjt:  NMGVSMTKDFESAEVEILCVSCWAIWSDRNEISQG---KITPSLIRRV-EWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHP

Query:  SLPITGLGAIIRDSNGHML-VARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVIYKNGFRS
        S    G+G I+RD NG +L  + LKF  G L P  AE + +  G+K    +G     I ++AQ +V  + +  F +
Subjt:  SLPITGLGAIIRDSNGHML-VARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVIYKNGFRS

A0A5C7ICQ3 RNase H domain-containing protein2.5e-1329.87Show/hide
Query:  ESAEVEILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDS
        +S   E+LCV CW +W  RN+   G +   + +  EW   ++ +     E  RSK+  +T +     W+    G  K+N D   H S  ++G+G +I+D+
Subjt:  ESAEVEILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDS

Query:  NGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI
        + H+  +  +       P  AEAL +L G+ +A  NGF    + S+A  +VN I
Subjt:  NGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI

A0A5C7IIT4 Uncharacterized protein7.2e-1331.37Show/hide
Query:  EILCVSCWAIWSDRNEISQGKITPSL--IRRVEWIQEYIQEI--GKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSN
        E LCV  W +W  RN++   K + ++     ++W   +IQ+    KT +TG   V+ + +A    KWK  P G  K+N D        +TG+G +IRD  
Subjt:  EILCVSCWAIWSDRNEISQGKITPSL--IRRVEWIQEYIQEI--GKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSN

Query:  GHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI
        GH++ +  +   G L P + EA+ +L G ++A + G     I S++  +VN+I
Subjt:  GHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI

A0A6J1BQ49 uncharacterized protein LOC1110047863.8e-1431.54Show/hide
Query:  EILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSNGHML
        ++   + WAIW DRN  + G    +   R  WI  Y Q   +  E  R      ++     +W  P +  +K+N D  C  +   TGLG IIRD  G +L
Subjt:  EILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSNGHML

Query:  VARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI
        VA+  F    L+PL AE   +L  +K+AA   +T L + S+ Q  + ++
Subjt:  VARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI

A0A803QGD0 Uncharacterized protein4.6e-1229.03Show/hide
Query:  FESAEVEILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRD
        +   ++E +    W IWSDRN    GK+  + I+ +     Y+Q+    +   +    ++T      KW+ PPE   KLNVD     S    G+GAIIR+
Subjt:  FESAEVEILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRD

Query:  SNGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI
        S G ++ A  K   G       EA  M  G+  A Q      ++ ++  +LVN +
Subjt:  SNGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)4.0e-0827.54Show/hide
Query:  WAIWSDRNEISQGKIT--PSLIRR------VEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSNGHM
        W +W  RNE    +I   P  + +       EW++  I +   +  T +        + + ++W  PPEG LK N D         T    IIRDSNGH+
Subjt:  WAIWSDRNEISQGKIT--PSLIRR------VEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLGAIIRDSNGHM

Query:  LVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLW
        + +     +     L AEAL  L  ++M    G+  +W
Subjt:  LVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGACGTGCAGTGCAAATATGGGAGTCAGTATGACCAAAGACTTTGAAAGTGCTGAAGTAGAAATTTTGTGTGTAAGCTGTTGGGCAATCTGGTCTGATCGAAATGA
AATATCCCAAGGCAAAATTACACCAAGTTTAATCAGACGTGTTGAATGGATTCAGGAGTACATTCAAGAAATTGGTAAAACTAGTGAGACAGGAAGATCGAAAGTAATAA
CAAAAACCTTAGCTGTTCAGAAGGAGAAATGGAAGAAGCCACCTGAAGGAATTCTGAAGTTGAATGTTGATGTTGTCTGCCATCCATCTCTACCGATCACGGGATTAGGA
GCGATAATCAGGGATTCAAACGGACACATGCTGGTAGCTAGGTTGAAATTCTGTGAAGGACGTTTAGATCCTCTCTCAGCAGAGGCATTGGTAATGTTAAGTGGTATGAA
AATGGCTGCTCAGAATGGTTTTACGAATCTATGGATTTCGTCAAACGCGCAGGTGCTGGTTAATGTTATTTACAAAAATGGCTTTCGATCGGCTACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGACGTGCAGTGCAAATATGGGAGTCAGTATGACCAAAGACTTTGAAAGTGCTGAAGTAGAAATTTTGTGTGTAAGCTGTTGGGCAATCTGGTCTGATCGAAATGA
AATATCCCAAGGCAAAATTACACCAAGTTTAATCAGACGTGTTGAATGGATTCAGGAGTACATTCAAGAAATTGGTAAAACTAGTGAGACAGGAAGATCGAAAGTAATAA
CAAAAACCTTAGCTGTTCAGAAGGAGAAATGGAAGAAGCCACCTGAAGGAATTCTGAAGTTGAATGTTGATGTTGTCTGCCATCCATCTCTACCGATCACGGGATTAGGA
GCGATAATCAGGGATTCAAACGGACACATGCTGGTAGCTAGGTTGAAATTCTGTGAAGGACGTTTAGATCCTCTCTCAGCAGAGGCATTGGTAATGTTAAGTGGTATGAA
AATGGCTGCTCAGAATGGTTTTACGAATCTATGGATTTCGTCAAACGCGCAGGTGCTGGTTAATGTTATTTACAAAAATGGCTTTCGATCGGCTACTTAA
Protein sequenceShow/hide protein sequence
MQTCSANMGVSMTKDFESAEVEILCVSCWAIWSDRNEISQGKITPSLIRRVEWIQEYIQEIGKTSETGRSKVITKTLAVQKEKWKKPPEGILKLNVDVVCHPSLPITGLG
AIIRDSNGHMLVARLKFCEGRLDPLSAEALVMLSGMKMAAQNGFTNLWISSNAQVLVNVIYKNGFRSAT