; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003660 (gene) of Snake gourd v1 genome

Gene IDTan0003660
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG07:12654603..12655403
RNA-Seq ExpressionTan0003660
SyntenyTan0003660
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]5.8e-1140.28Show/hide
Query:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI
        WKPP  N  KLN DA  S K  + GLG IV D+    L  G K   F   V L EA A+  GL+      +   SSLIVE+D  EV  LLN      TEI
Subjt:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI

Query:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLA--ALTSSSGD
         +++ D+    K+ + + F  +PR  N   H+LA  AL +SS D
Subjt:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLA--ALTSSSGD

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.3e-1536.57Show/hide
Query:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI
        WKPP SN+WKLNT+A W    N GG+GWI+ D     +   C+ I    ++  LE  A+ EGL+ +    +     + +E+DSLE   LL+    D TEI
Subjt:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI

Query:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLA
         +++++I  ++KD+  +    + RE NK  H LA
Subjt:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLA

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]3.3e-1427.49Show/hide
Query:  ETTTHLFWECKLTKHMWNLLFPSSASVFPSNRDFRKSWDYWECFKDGKVQEVRSNIML---ALWSV----------------------------------
        ETT H+ WECK+ K +W    P  A+ F  +R    + +YWE   D   +E R   M+    +W +                                  
Subjt:  ETTTHLFWECKLTKHMWNLLFPSSASVFPSNRDFRKSWDYWECFKDGKVQEVRSNIML---ALWSV----------------------------------

Query:  ------------------GGWKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVE
                            WKPP SN+WKLNTDA W    N  G+GWI+ D     +  GC+ I    ++  LE  A+ EGL+ +    +     + +E
Subjt:  ------------------GGWKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVE

Query:  TDSLEVTLLLN
        +DSLE   LL+
Subjt:  TDSLEVTLLLN

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]7.1e-1740Show/hide
Query:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI
        W+PP  + W LN DA+WS+  +RGG+GWI+     + +  G +F+    +VKLLEA A+ EGL+ +   G   +  L +ETDS EV  LLN    DLT+ 
Subjt:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI

Query:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLAALTSSSGDSRI
         +V+++IL +      + F KV RE N   HSLA   S   +S I
Subjt:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLAALTSSSGDSRI

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.5e-1435.25Show/hide
Query:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEV-----LAWGKSQVSSLIVETDSLEVTLLLNGITT
        WKPP SN+WKLNTDA W    N GG+GWI+ D     +   C+ I    ++  LE  A+ EGL+ +         +     + +E+DSLE   LL+    
Subjt:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEV-----LAWGKSQVSSLIVETDSLEVTLLLNGITT

Query:  DLTEISFVIDDILLIVKDIRSIIFCKVPREENKATHSLA
        D TEI +++++I  +++D++ +    + RE NK  H LA
Subjt:  DLTEISFVIDDILLIVKDIRSIIFCKVPREENKATHSLA

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134121.1e-1536.57Show/hide
Query:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI
        WKPP SN+WKLNT+A W    N GG+GWI+ D     +   C+ I    ++  LE  A+ EGL+ +    +     + +E+DSLE   LL+    D TEI
Subjt:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI

Query:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLA
         +++++I  ++KD+  +    + RE NK  H LA
Subjt:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLA

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X11.6e-1427.49Show/hide
Query:  ETTTHLFWECKLTKHMWNLLFPSSASVFPSNRDFRKSWDYWECFKDGKVQEVRSNIML---ALWSV----------------------------------
        ETT H+ WECK+ K +W    P  A+ F  +R    + +YWE   D   +E R   M+    +W +                                  
Subjt:  ETTTHLFWECKLTKHMWNLLFPSSASVFPSNRDFRKSWDYWECFKDGKVQEVRSNIML---ALWSV----------------------------------

Query:  ------------------GGWKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVE
                            WKPP SN+WKLNTDA W    N  G+GWI+ D     +  GC+ I    ++  LE  A+ EGL+ +    +     + +E
Subjt:  ------------------GGWKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVE

Query:  TDSLEVTLLLN
        +DSLE   LL+
Subjt:  TDSLEVTLLLN

A0A6J1DNV9 uncharacterized protein LOC1110224033.4e-1740Show/hide
Query:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI
        W+PP  + W LN DA+WS+  +RGG+GWI+     + +  G +F+    +VKLLEA A+ EGL+ +   G   +  L +ETDS EV  LLN    DLT+ 
Subjt:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEI

Query:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLAALTSSSGDSRI
         +V+++IL +      + F KV RE N   HSLA   S   +S I
Subjt:  SFVIDDILLIVKDIRSIIFCKVPREENKATHSLAALTSSSGDSRI

A0A6J1DSV1 uncharacterized protein LOC1110236087.1e-1535.25Show/hide
Query:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEV-----LAWGKSQVSSLIVETDSLEVTLLLNGITT
        WKPP SN+WKLNTDA W    N GG+GWI+ D     +   C+ I    ++  LE  A+ EGL+ +         +     + +E+DSLE   LL+    
Subjt:  WKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEV-----LAWGKSQVSSLIVETDSLEVTLLLNGITT

Query:  DLTEISFVIDDILLIVKDIRSIIFCKVPREENKATHSLA
        D TEI +++++I  +++D++ +    + RE NK  H LA
Subjt:  DLTEISFVIDDILLIVKDIRSIIFCKVPREENKATHSLA

A0A803P6E4 Uncharacterized protein6.9e-1025.81Show/hide
Query:  ETTTHLFWECKLTKHMWNLL--------------FPSSASVFPSNRDFRKS---WDYWECFKDGKVQEVRSNIMLALWSVGGWKPPKSNAWKLNTDATWS
        ET  H+  +C  ++  W                 F S AS F S+     +   W  W+   D K Q   S++   + +   W PP+ N  K+N DA++ 
Subjt:  ETTTHLFWECKLTKHMWNLL--------------FPSSASVFPSNRDFRKS---WDYWECFKDGKVQEVRSNIMLALWSVGGWKPPKSNAWKLNTDATWS

Query:  NKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGK-SQVSSLIVETDSLEVTLLLNGITTDLTEISFVIDDILLIVKDIRSI
            R G+GW+  D     ++     ++    V +    A   GLKEVL W K +    ++VE+D   +   +    T  +    ++DD + ++ D+RS+
Subjt:  NKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAVWEGLKEVLAWGK-SQVSSLIVETDSLEVTLLLNGITTDLTEISFVIDDILLIVKDIRSI

Query:  IFCKVPREENKATHSLA
            V + ENKA + LA
Subjt:  IFCKVPREENKATHSLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein6.0e-0628.06Show/hide
Query:  SVGGWKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAV-WEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITT
        S G W+PP     K NTDATW+    R G+GW++ +      + G + +    SV   E  A+ W     VL+  + Q + +I E+DS  +  +LN    
Subjt:  SVGGWKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKFISFNWSVKLLEAFAV-WEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITT

Query:  DLTEISFVIDDILLIVKDIRSIIFCKVPREENKATHSLA
            +   I D+  ++     + F  +PRE N     +A
Subjt:  DLTEISFVIDDILLIVKDIRSIIFCKVPREENKATHSLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACAACCACTCATTTGTTTTGGGAGTGTAAGTTAACTAAACATATGTGGAATCTTCTCTTCCCATCTTCTGCTTCTGTCTTTCCTTCTAACAGGGATTTTCGGAA
GTCTTGGGACTACTGGGAATGCTTCAAGGATGGCAAAGTTCAAGAAGTTCGTAGTAACATCATGTTGGCGTTGTGGAGTGTTGGTGGGTGGAAGCCTCCGAAGTCGAATG
CTTGGAAACTGAATACTGATGCAACATGGTCCAACAAGCTGAATCGAGGTGGGTTGGGTTGGATTGTTTGGGATTCCTCTAAGAACCCTCTCTTCGGTGGTTGTAAGTTC
ATCTCCTTTAATTGGTCTGTCAAGCTTCTTGAAGCTTTTGCAGTTTGGGAAGGCCTCAAGGAAGTCCTTGCTTGGGGAAAGAGCCAAGTTTCGTCTCTGATCGTAGAAAC
AGATTCATTGGAGGTTACATTGCTTTTGAATGGAATTACTACTGACTTAACTGAAATCTCTTTTGTTATTGATGATATTCTTTTGATTGTTAAGGATATTAGGTCCATTA
TTTTTTGTAAAGTCCCTAGAGAGGAGAACAAAGCAACTCACTCCTTAGCAGCTTTGACATCCTCTTCTGGAGATTCTAGGATCAGGAATGATGTCCTTGAGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACAACCACTCATTTGTTTTGGGAGTGTAAGTTAACTAAACATATGTGGAATCTTCTCTTCCCATCTTCTGCTTCTGTCTTTCCTTCTAACAGGGATTTTCGGAA
GTCTTGGGACTACTGGGAATGCTTCAAGGATGGCAAAGTTCAAGAAGTTCGTAGTAACATCATGTTGGCGTTGTGGAGTGTTGGTGGGTGGAAGCCTCCGAAGTCGAATG
CTTGGAAACTGAATACTGATGCAACATGGTCCAACAAGCTGAATCGAGGTGGGTTGGGTTGGATTGTTTGGGATTCCTCTAAGAACCCTCTCTTCGGTGGTTGTAAGTTC
ATCTCCTTTAATTGGTCTGTCAAGCTTCTTGAAGCTTTTGCAGTTTGGGAAGGCCTCAAGGAAGTCCTTGCTTGGGGAAAGAGCCAAGTTTCGTCTCTGATCGTAGAAAC
AGATTCATTGGAGGTTACATTGCTTTTGAATGGAATTACTACTGACTTAACTGAAATCTCTTTTGTTATTGATGATATTCTTTTGATTGTTAAGGATATTAGGTCCATTA
TTTTTTGTAAAGTCCCTAGAGAGGAGAACAAAGCAACTCACTCCTTAGCAGCTTTGACATCCTCTTCTGGAGATTCTAGGATCAGGAATGATGTCCTTGAGAAGTGA
Protein sequenceShow/hide protein sequence
METTTHLFWECKLTKHMWNLLFPSSASVFPSNRDFRKSWDYWECFKDGKVQEVRSNIMLALWSVGGWKPPKSNAWKLNTDATWSNKLNRGGLGWIVWDSSKNPLFGGCKF
ISFNWSVKLLEAFAVWEGLKEVLAWGKSQVSSLIVETDSLEVTLLLNGITTDLTEISFVIDDILLIVKDIRSIIFCKVPREENKATHSLAALTSSSGDSRIRNDVLEK