; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020040 (gene) of Snake gourd v1 genome

Gene IDTan0020040
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationLG02:34921660..34923379
RNA-Seq ExpressionTan0020040
SyntenyTan0020040
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]4.5e-0836.11Show/hide
Query:  VSWTLESILTFTYQPR-----RMIIARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQRL
        +S T  S     Y PR      M+  R   PPT  WK+NTDASW+     GG+GW++ DC+GE++  G   I     I ++EL  I+  L     +F  +
Subjt:  VSWTLESILTFTYQPR-----RMIIARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQRL

Query:  TRRSPLNL
          RSP+ L
Subjt:  TRRSPLNL

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.0e-0442.86Show/hide
Query:  PPTM-GWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR
        PPT   WK+NT+A+W      GG+GW++RD KGE+I      I     I  +E+ AI E LR+I  E  R
Subjt:  PPTM-GWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]1.6e-0541.89Show/hide
Query:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR
        AR   P +  WK+NTDA+W       G+GW++RD KGE+I  G   I     I  +E+ AI E LR+I  E  R
Subjt:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]1.6e-0541.89Show/hide
Query:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR
        AR   P +  WK+NTDA+W       G+GW++RD KGE+I  G   I     I  +E+ AI E LR+I  E  R
Subjt:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.6e-0541.89Show/hide
Query:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR
        AR   P +  WK+NTDA+W      GG+GW++RD KGE+I      I     I  +E+ AI E LR+I  E  R
Subjt:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134125.1e-0542.86Show/hide
Query:  PPTM-GWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR
        PPT   WK+NT+A+W      GG+GW++RD KGE+I      I     I  +E+ AI E LR+I  E  R
Subjt:  PPTM-GWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR

A0A6J1CQG0 uncharacterized protein LOC1110132162.2e-0836.11Show/hide
Query:  VSWTLESILTFTYQPR-----RMIIARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQRL
        +S T  S     Y PR      M+  R   PPT  WK+NTDASW+     GG+GW++ DC+GE++  G   I     I ++EL  I+  L     +F  +
Subjt:  VSWTLESILTFTYQPR-----RMIIARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQRL

Query:  TRRSPLNL
          RSP+ L
Subjt:  TRRSPLNL

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X17.8e-0641.89Show/hide
Query:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR
        AR   P +  WK+NTDA+W       G+GW++RD KGE+I  G   I     I  +E+ AI E LR+I  E  R
Subjt:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X27.8e-0641.89Show/hide
Query:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR
        AR   P +  WK+NTDA+W       G+GW++RD KGE+I  G   I     I  +E+ AI E LR+I  E  R
Subjt:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR

A0A6J1DSV1 uncharacterized protein LOC1110236087.8e-0641.89Show/hide
Query:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR
        AR   P +  WK+NTDA+W      GG+GW++RD KGE+I      I     I  +E+ AI E LR+I  E  R
Subjt:  ARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.2e-0429.41Show/hide
Query:  QPRRMIIARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAI
        Q  R +  +   PP    K NTDA+W     + G+GW++R+  G ++ +G   + R   +   EL+A+
Subjt:  QPRRMIIARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAI

AT4G29090.1 Ribonuclease H-like superfamily protein3.8e-0535Show/hide
Query:  QPRRMIIARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQ
        Q  R    R   PP    K NTDA+WN    + G+GWV+R+ KGE+  +G   + +   +   EL+A+  ++ S L+ FQ
Subjt:  QPRRMIIARMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGTGACATTTTGGGAGTGAAGCTCGTGGAGTCTCTTGGTCATTATCTAGGGAAACCGTTATTTGTTGCTCCGAATTTGAAAGGCAGGAAAGTTGCTATGCTTAT
TGATGACAGGAATAGGTGGCAGAAGTCGGGAGAAGGAAAAGTCTATAAATCACCTGTTTTGAGGGTGCAAGCACACAAAAAAGGAACTTACTCTACCATGGGAAAATTCA
AAACAACACAAAAGTCCTATCCAATCACATCATCAGAAAAGCTGAAAGTTTCGTGGACACTAGAATCAATTCTCACCTTTACTTACCAGCCTCGAAGGATGATAATCGCA
AGAATGTGTGGTCCCCCTACTATGGGTTGGAAAGTCAACACTGATGCATCATGGAATAATGCGTTGGGCAAGGGAGGTCTGGGTTGGGTCATTCGTGACTGCAAAGGAGA
GCTAATAGGGGTAGGTCTACTTAACATAAACAGGAATTGGGGGATAAAATCGATGGAATTGAAAGCTATTGTTGAGAGCTTGAGAAGCATACTTACTGAATTTCAACGCC
TTACCCGCCGATCACCATTGAATCTGATTCAGTGGAAGCTATTAAAGCTCTCAAAAACCCAAGCAACGATTGTTCGACGACGAAAAGCTTCAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGTGACATTTTGGGAGTGAAGCTCGTGGAGTCTCTTGGTCATTATCTAGGGAAACCGTTATTTGTTGCTCCGAATTTGAAAGGCAGGAAAGTTGCTATGCTTAT
TGATGACAGGAATAGGTGGCAGAAGTCGGGAGAAGGAAAAGTCTATAAATCACCTGTTTTGAGGGTGCAAGCACACAAAAAAGGAACTTACTCTACCATGGGAAAATTCA
AAACAACACAAAAGTCCTATCCAATCACATCATCAGAAAAGCTGAAAGTTTCGTGGACACTAGAATCAATTCTCACCTTTACTTACCAGCCTCGAAGGATGATAATCGCA
AGAATGTGTGGTCCCCCTACTATGGGTTGGAAAGTCAACACTGATGCATCATGGAATAATGCGTTGGGCAAGGGAGGTCTGGGTTGGGTCATTCGTGACTGCAAAGGAGA
GCTAATAGGGGTAGGTCTACTTAACATAAACAGGAATTGGGGGATAAAATCGATGGAATTGAAAGCTATTGTTGAGAGCTTGAGAAGCATACTTACTGAATTTCAACGCC
TTACCCGCCGATCACCATTGAATCTGATTCAGTGGAAGCTATTAAAGCTCTCAAAAACCCAAGCAACGATTGTTCGACGACGAAAAGCTTCAGTTTAG
Protein sequenceShow/hide protein sequence
MSSDILGVKLVESLGHYLGKPLFVAPNLKGRKVAMLIDDRNRWQKSGEGKVYKSPVLRVQAHKKGTYSTMGKFKTTQKSYPITSSEKLKVSWTLESILTFTYQPRRMIIA
RMCGPPTMGWKVNTDASWNNALGKGGLGWVIRDCKGELIGVGLLNINRNWGIKSMELKAIVESLRSILTEFQRLTRRSPLNLIQWKLLKLSKTQATIVRRRKASV