; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000100 (gene) of Snake gourd v1 genome

Gene IDTan0000100
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF2431 domain-containing protein
Genome locationLG01:87029542..87030138
RNA-Seq ExpressionTan0000100
SyntenyTan0000100
Gene Ontology termsGO:0070475 - rRNA base methylation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0070042 - rRNA (uridine-N3-)-methyltransferase activity (molecular function)
InterPro domainsIPR019446 - Domain of unknown function DUF2431
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646034.1 hypothetical protein Csa_015629 [Cucumis sativus]7.4e-4561.84Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        M QHP L +  FDRI+FNFPHAGF ++ +E    QI+LHQNLV  F+ NAK++L+ NGEIHITHKTSHPYS+W+IE++GEEEGL+LKEE +F   D+PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL
        VNK+GSG NSN++FPVG+ STF FVK+LS  KK+ N++   + SLA+EF  L
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL

KAG7024996.1 hypothetical protein SDJN02_13816, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-4161.84Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        MSQHP L    FDRIVFNFPHAGF ++  E    QI+LHQNLV  FL NAKE+L+ NG+IHITHK S PYS+WEIE++ EEE LFL+E  +F I D+PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL
        VNK+GSG NSN +FPVG CSTF FVK+LSK  ++K   K      AAEF+ L
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL

XP_004139853.1 uncharacterized protein At4g26485 [Cucumis sativus]7.4e-4561.84Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        M QHP L +  FDRI+FNFPHAGF ++ +E    QI+LHQNLV  F+ NAK++L+ NGEIHITHKTSHPYS+W+IE++GEEEGL+LKEE +F   D+PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL
        VNK+GSG NSN++FPVG+ STF FVK+LS  KK+ N++   + SLA+EF  L
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL

XP_022139694.1 uncharacterized protein At4g26485-like [Momordica charantia]2.4e-4363.16Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        MSQHP LR + FDRIVFNFPHAGF   + E   +QI+LHQNLV  F+ NA EM+S NGEIHITHKTSHP+S+WEI ++ EEEGLFLKEEA+F   D+PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL
        +NK+G G   NR+FPVG C TF F K+L      K+K  K + SLAAEFAHL
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL

XP_022936898.1 uncharacterized protein At4g26485 isoform X1 [Cucurbita moschata]5.0e-4161.84Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        MSQHP L    FDRIVFNFPHAGF +    S ++  +LHQNLV  FL NAKE+   NGEIHITHK S+PYS+WEIE++ EEE LFL+EE +F   D+PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL
        VNK+GSG NSNR+F VG CSTF FVK+LSK  +KK   K      AAEFA L
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL

TrEMBL top hitse value%identityAlignment
A0A0A0K8J5 DUF2431 domain-containing protein3.6e-4561.84Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        M QHP L +  FDRI+FNFPHAGF ++ +E    QI+LHQNLV  F+ NAK++L+ NGEIHITHKTSHPYS+W+IE++GEEEGL+LKEE +F   D+PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL
        VNK+GSG NSN++FPVG+ STF FVK+LS  KK+ N++   + SLA+EF  L
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL

A0A6J1CDH1 uncharacterized protein At4g26485-like2.9e-3965.32Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        MSQHP LR KSFDR+VFNFPHAGFH  F+ES   QIELH++LV  FL NAKE L   GEIHITHKT+HP+S WEI +L  EEGL LKEE +F++ ++P Y
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIF
         NKRG G+NS+ +FPVG+C TF F
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIF

A0A6J1CDR1 uncharacterized protein At4g26485-like1.2e-4363.16Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        MSQHP LR + FDRIVFNFPHAGF   + E   +QI+LHQNLV  F+ NA EM+S NGEIHITHKTSHP+S+WEI ++ EEEGLFLKEEA+F   D+PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL
        +NK+G G   NR+FPVG C TF F K+L      K+K  K + SLAAEFAHL
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL

A0A6J1FEI5 uncharacterized protein At4g26485 isoform X12.4e-4161.84Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        MSQHP L    FDRIVFNFPHAGF +    S ++  +LHQNLV  FL NAKE+   NGEIHITHK S+PYS+WEIE++ EEE LFL+EE +F   D+PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL
        VNK+GSG NSNR+F VG CSTF FVK+LSK  +KK   K      AAEFA L
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHL

A0A6J1IKX9 uncharacterized protein At4g26485 isoform X11.2e-4059.74Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        MSQH  L    FDRIVFNFPHAGF ++ RE    QI+LHQNLV  FL NAKE+L+ NGEIHITHK S+PYS+W+IEK+ EEEGLFL+EE +F   D+P Y
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHLGL
         NK+GSG NSNR+FPVG C TF FVK+LS+ +++        ++  AEFA L L
Subjt:  VNKRGSGFNSNRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHLGL

SwissProt top hitse value%identityAlignment
F4I1X0 Heavy metal-associated isoprenylated plant protein 413.0e-2545.45Show/hide
Query:  HPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGYVNK
        HP LR + FDR++FNFPHAGFH   RES +  I  H+ LV  F   A  +L  NGE+H++HK   P+S+W +E+L     L L +   F  +++PGY NK
Subjt:  HPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGYVNK

Query:  RGSGFNSNRSFPVGSCSTFIF
        RG G   ++ F +G CSTF F
Subjt:  RGSGFNSNRSFPVGSCSTFIF

P0C8L4 Uncharacterized protein At4g264852.2e-3154.69Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        MS    L  + +DRIVFNFPHAG     RE ++  IE H+ LV  FL NAKEML  +GEIHITHKT++P+SDW I+KLG+ EGL L +++ F +S +PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGS-GFNSNRSFPVGSCSTFIFVKS
        + KRGS G  S+  FPVG CST++F +S
Subjt:  VNKRGS-GFNSNRSFPVGSCSTFIFVKS

Arabidopsis top hitse value%identityAlignment
AT1G55790.1 Domain of unknown function (DUF2431)2.2e-2645.45Show/hide
Query:  HPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGYVNK
        HP LR + FDR++FNFPHAGFH   RES +  I  H+ LV  F   A  +L  NGE+H++HK   P+S+W +E+L     L L +   F  +++PGY NK
Subjt:  HPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGYVNK

Query:  RGSGFNSNRSFPVGSCSTFIF
        RG G   ++ F +G CSTF F
Subjt:  RGSGFNSNRSFPVGSCSTFIF

AT1G55800.1 Domain of unknown function (DUF2431)1.1e-1133.79Show/hide
Query:  HPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIE----LHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPG
        HP LR + FDR++FNFPH GFH   +ES   QI+      +NL  +FL  A  ML  +GE+    K                              ++PG
Subjt:  HPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIE----LHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPG

Query:  YVNKRGSGFNSNRSFPVGSCSTFIF-----VKSLSKNKKKKNKKK
        Y NKRG G   ++ F +G CSTF F      K L   K K+ + K
Subjt:  YVNKRGSGFNSNRSFPVGSCSTFIF-----VKSLSKNKKKKNKKK

AT4G26485.1 Domain of unknown function (DUF2431)1.5e-3254.69Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY
        MS    L  + +DRIVFNFPHAG     RE ++  IE H+ LV  FL NAKEML  +GEIHITHKT++P+SDW I+KLG+ EGL L +++ F +S +PGY
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGY

Query:  VNKRGS-GFNSNRSFPVGSCSTFIFVKS
        + KRGS G  S+  FPVG CST++F +S
Subjt:  VNKRGS-GFNSNRSFPVGSCSTFIFVKS

AT5G56060.1 Domain of unknown function (DUF2431)3.0e-2048.7Show/hide
Query:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSG-NGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPG
        MS   RL R  +DRI+FNFPH+G      E     I LHQ LV  FL +A++ML   +GEIH+THKT+ P++ W IE L  E+GL L  E +F+   FPG
Subjt:  MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSG-NGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPG

Query:  YVNKRGSGFNSNRSF
        Y NK+G G N N +F
Subjt:  YVNKRGSGFNSNRSF

AT5G56075.1 Domain of unknown function (DUF2431)1.6e-2144.07Show/hide
Query:  FDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSG---NGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGYVNKRGSGF
        +DR++FNFP           T+E       LV  F+ +A+ ++      GEIH+ HKT +P+S+W+++ LGE+EGL L  E +F +S +PGY NKRGSG 
Subjt:  FDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSG---NGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGYVNKRGSGF

Query:  NSNRSFPVGSCSTFIFVK
         S+ SFPVG  STF+F K
Subjt:  NSNRSFPVGSCSTFIFVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCAACATCCACGTCTTCGACGCAAGTCGTTTGATCGTATCGTATTCAATTTCCCACATGCTGGGTTTCATCACAACTTTAGAGAATCAACTAACGAGCAAATTGA
ACTCCATCAGAATCTAGTGACAGAATTCCTAATGAATGCAAAGGAAATGTTGAGTGGAAATGGGGAAATTCACATCACCCACAAGACATCACATCCATACAGTGATTGGG
AAATTGAGAAACTTGGAGAAGAAGAAGGTTTATTTCTAAAAGAGGAAGCAGATTTCTATATAAGTGATTTTCCAGGTTATGTCAATAAGAGAGGCAGTGGCTTTAATAGC
AATCGAAGTTTTCCTGTGGGATCTTGCAGCACTTTCATCTTCGTTAAATCACTTTCCAAGAACAAGAAGAAGAAGAACAAGAAGAAGAAGTTTGCCATGTCCTTGGCTGC
TGAATTTGCCCACCTTGGACTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCAACATCCACGTCTTCGACGCAAGTCGTTTGATCGTATCGTATTCAATTTCCCACATGCTGGGTTTCATCACAACTTTAGAGAATCAACTAACGAGCAAATTGA
ACTCCATCAGAATCTAGTGACAGAATTCCTAATGAATGCAAAGGAAATGTTGAGTGGAAATGGGGAAATTCACATCACCCACAAGACATCACATCCATACAGTGATTGGG
AAATTGAGAAACTTGGAGAAGAAGAAGGTTTATTTCTAAAAGAGGAAGCAGATTTCTATATAAGTGATTTTCCAGGTTATGTCAATAAGAGAGGCAGTGGCTTTAATAGC
AATCGAAGTTTTCCTGTGGGATCTTGCAGCACTTTCATCTTCGTTAAATCACTTTCCAAGAACAAGAAGAAGAAGAACAAGAAGAAGAAGTTTGCCATGTCCTTGGCTGC
TGAATTTGCCCACCTTGGACTCTAA
Protein sequenceShow/hide protein sequence
MSQHPRLRRKSFDRIVFNFPHAGFHHNFRESTNEQIELHQNLVTEFLMNAKEMLSGNGEIHITHKTSHPYSDWEIEKLGEEEGLFLKEEADFYISDFPGYVNKRGSGFNS
NRSFPVGSCSTFIFVKSLSKNKKKKNKKKKFAMSLAAEFAHLGL