; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000716 (gene) of Snake gourd v1 genome

Gene IDTan0000716
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMyb_DNA-bind_3 domain-containing protein
Genome locationLG01:75610133..75612081
RNA-Seq ExpressionTan0000716
SyntenyTan0000716
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]2.2e-4544.17Show/hide
Query:  MLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEM--------------
        ML++ GGL  TQ VD+EEMV IFLHI+AHDVKNRV R +  RS   VSRHFN VLN VL+LH++LLK+P+ +T++C+ E+W+WF+M              
Subjt:  MLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEM--------------

Query:  ------------LAEK---------------------LPNSCLEQN-----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK-
                    L EK                     +    LE N      ++  V+ LKKQY  IAEM+   CSGF WN+E KC+EAEK V + WVK 
Subjt:  ------------LAEK---------------------LPNSCLEQN-----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK-

Query:  -----WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS
             ++ NKPFP++ DL  VFG+DRATG   +TP+EM S
Subjt:  -----WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS

KAG6532280.1 hypothetical protein ZIOFF_006120 [Zingiber officinale]3.0e-4746.08Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWF-----
        MDRR+F+ LC +L     L   +N+ I E+V  FLHI+AH+VKNRV++    RS   +SR F+ +LN++L+LH++LLKKPEPI  +CTDERWKWF     
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWF-----

Query:  ----------EMLAEKLPNSCLEQNT-IDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK------WMRNKPFPHYDDLAYVFGKD
                  +++A KLP+S L+    ID + + LK+++HAI +ML N  SGFGWN+  KC+   K+VF+ WVK       +RNK FP++DDL +V+GKD
Subjt:  ----------EMLAEKLPNSCLEQNT-IDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK------WMRNKPFPHYDDLAYVFGKD

Query:  RATGMGAETPMEMASSV
         ATG  AETP++    +
Subjt:  RATGMGAETPMEMASSV

TYK05796.1 retrotransposon protein [Cucumis melo var. makuwa]5.3e-4442.26Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEMLAE
        MDRRTF ILC +L++  GL  T+ VD+EEMVA+FLH+LAHD+KN VI+   +RS   VSRHFN VL  V++L++ L+K+P P+TNNC D+RW+ FE    
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEMLAE

Query:  KLPNSCLEQN---TIDCKVRTLK----------KQY-----------------------------HAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWV-
         L  + ++ N   T     RT K          K Y                              AI EM   ACSGFGWN+E KC+  EKE+FD WV 
Subjt:  KLPNSCLEQN---TIDCKVRTLK----------KQY-----------------------------HAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWV-

Query:  -----KWMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSVLQSKWRRRFGWDHKTSWGGNNE
             K + NKPFP+YD+L YVFG+DRATG   ET + + S+          G+D      GN E
Subjt:  -----KWMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSVLQSKWRRRFGWDHKTSWGGNNE

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]2.6e-4642.75Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNC-------------
        MDRR FTILCTML++ GGL  TQ VD++EMV IFLHI+AHDVKNRV R +  RS   VSRHFN+VLNAVL+LH++LLK+P+P+T++C             
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNC-------------

Query:  --------------TDERWKWFEMLAEKLPNSCL--------------------EQNTIDCKVRTLK----------------KQYHAIAEMLSNACSGF
                      T+    +  ++ + L +S L                    ++  ++C ++ ++                KQY AIAEM+  ACSGF
Subjt:  --------------TDERWKWFEMLAEKLPNSCL--------------------EQNTIDCKVRTLK----------------KQYHAIAEMLSNACSGF

Query:  GWNEEFKCVEAEKEVFDAWVK------WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS
        GWNE  KC+E EK VFD WVK       + NKPFP++ DL  VFG+DRATG   +TP+EM+S
Subjt:  GWNEEFKCVEAEKEVFDAWVK------WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]2.2e-4544.17Show/hide
Query:  MLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEM--------------
        ML++ GGL  TQ VD+EEMV IFLHI+AHDVKNRV R +  RS   VSRHFN VLN VL+LH++LLK+P+ +T++C+ E+W+WF+M              
Subjt:  MLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEM--------------

Query:  ------------LAEK---------------------LPNSCLEQN-----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK-
                    L EK                     +    LE N      ++  V+ LKKQY  IAEM+   CSGF WN+E KC+EAEK V + WVK 
Subjt:  ------------LAEK---------------------LPNSCLEQN-----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK-

Query:  -----WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS
             ++ NKPFP++ DL  VFG+DRATG   +TP+EM S
Subjt:  -----WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein1.0e-4544.17Show/hide
Query:  MLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEM--------------
        ML++ GGL  TQ VD+EEMV IFLHI+AHDVKNRV R +  RS   VSRHFN VLN VL+LH++LLK+P+ +T++C+ E+W+WF+M              
Subjt:  MLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEM--------------

Query:  ------------LAEK---------------------LPNSCLEQN-----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK-
                    L EK                     +    LE N      ++  V+ LKKQY  IAEM+   CSGF WN+E KC+EAEK V + WVK 
Subjt:  ------------LAEK---------------------LPNSCLEQN-----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK-

Query:  -----WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS
             ++ NKPFP++ DL  VFG+DRATG   +TP+EM S
Subjt:  -----WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS

A0A5C7ICT5 Uncharacterized protein1.5e-3942.08Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFE----
        MDRRTF +LC +L++TG L     V +EE V +FLHILAH VKNR IR+   R    +SR+FNSVL+AVLQLH+ LL  P+P+  NCTDERWK F+    
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFE----

Query:  -----------------------------------MLAEKLPNSCLEQNT-IDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKWM
                                            LA  LPN  L     I+ K++T KKQY  I +M++   SGFGWN   KCVE +    +AW  ++
Subjt:  -----------------------------------MLAEKLPNSCLEQNT-IDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKWM

Query:  -----RNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSV
             R+K FP Y+ L  +FGKDRATG  A TP  +A+++
Subjt:  -----RNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSV

A0A5D3C620 Retrotransposon protein2.6e-4442.26Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEMLAE
        MDRRTF ILC +L++  GL  T+ VD+EEMVA+FLH+LAHD+KN VI+   +RS   VSRHFN VL  V++L++ L+K+P P+TNNC D+RW+ FE    
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEMLAE

Query:  KLPNSCLEQN---TIDCKVRTLK----------KQY-----------------------------HAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWV-
         L  + ++ N   T     RT K          K Y                              AI EM   ACSGFGWN+E KC+  EKE+FD WV 
Subjt:  KLPNSCLEQN---TIDCKVRTLK----------KQY-----------------------------HAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWV-

Query:  -----KWMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSVLQSKWRRRFGWDHKTSWGGNNE
             K + NKPFP+YD+L YVFG+DRATG   ET + + S+          G+D      GN E
Subjt:  -----KWMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSVLQSKWRRRFGWDHKTSWGGNNE

A0A5D3C7T4 Uncharacterized protein1.2e-4642.75Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNC-------------
        MDRR FTILCTML++ GGL  TQ VD++EMV IFLHI+AHDVKNRV R +  RS   VSRHFN+VLNAVL+LH++LLK+P+P+T++C             
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNC-------------

Query:  --------------TDERWKWFEMLAEKLPNSCL--------------------EQNTIDCKVRTLK----------------KQYHAIAEMLSNACSGF
                      T+    +  ++ + L +S L                    ++  ++C ++ ++                KQY AIAEM+  ACSGF
Subjt:  --------------TDERWKWFEMLAEKLPNSCL--------------------EQNTIDCKVRTLK----------------KQYHAIAEMLSNACSGF

Query:  GWNEEFKCVEAEKEVFDAWVK------WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS
        GWNE  KC+E EK VFD WVK       + NKPFP++ DL  VFG+DRATG   +TP+EM+S
Subjt:  GWNEEFKCVEAEKEVFDAWVK------WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein1.0e-4544.17Show/hide
Query:  MLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEM--------------
        ML++ GGL  TQ VD+EEMV IFLHI+AHDVKNRV R +  RS   VSRHFN VLN VL+LH++LLK+P+ +T++C+ E+W+WF+M              
Subjt:  MLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEM--------------

Query:  ------------LAEK---------------------LPNSCLEQN-----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK-
                    L EK                     +    LE N      ++  V+ LKKQY  IAEM+   CSGF WN+E KC+EAEK V + WVK 
Subjt:  ------------LAEK---------------------LPNSCLEQN-----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVK-

Query:  -----WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS
             ++ NKPFP++ DL  VFG+DRATG   +TP+EM S
Subjt:  -----WMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein1.0e-0542.86Show/hide
Query:  FTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNA
        FT LC ML++   L PT N+ IEE VA+FL I  H+   R +     R+   V R F  VL A
Subjt:  FTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNA

AT4G02210.1 unknown protein3.4e-0429.03Show/hide
Query:  WFEMLAEKLPNSCLEQN----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKWMRN------KPFPHYDDLAYVFG
        W EM+   L N+  E N     +  + ++L++Q++AI  +L +   GF W+ E + V A+  V+  ++K  R+      +P P+Y DL  + G
Subjt:  WFEMLAEKLPNSCLEQN----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKWMRN------KPFPHYDDLAYVFG

AT4G02210.2 unknown protein3.4e-0429.03Show/hide
Query:  WFEMLAEKLPNSCLEQN----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKWMRN------KPFPHYDDLAYVFG
        W EM+   L N+  E N     +  + ++L++Q++AI  +L +   GF W+ E + V A+  V+  ++K  R+      +P P+Y DL  + G
Subjt:  WFEMLAEKLPNSCLEQN----TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKWMRN------KPFPHYDDLAYVFG

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.6e-0940.28Show/hide
Query:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQL
        MD+  F  LC +L++ G L  T  + IE  +AIFL I+ H+++ R ++     S   +SRHFN+VLNAV+ +
Subjt:  MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGAAGGACCTTCACCATCCTGTGTACTATGCTAAAGTCGACTGGTGGTTTAGTACCGACACAGAATGTTGATATCGAAGAAATGGTTGCTATATTCTTGCACAT
CCTGGCACACGATGTTAAGAATCGGGTGATTCGCAGCAATTTACTCCGGTCCGTGAGAGCGGTTTCTAGACACTTCAACTCGGTGTTAAACGCAGTTTTACAACTACACG
ACTTGTTGTTGAAAAAACCAGAACCAATCACCAACAACTGCACTGATGAGCGATGGAAATGGTTTGAGATGCTAGCTGAGAAATTACCAAACTCATGCCTAGAACAAAAC
ACAATTGATTGCAAGGTTAGAACTCTCAAGAAACAATACCATGCTATTGCAGAGATGCTTAGTAATGCATGTAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTAGA
GGCAGAGAAGGAGGTGTTTGATGCATGGGTTAAGTGGATGAGGAATAAACCATTTCCACACTATGATGACCTCGCCTATGTCTTTGGAAAGGATAGAGCTACAGGAATGG
GTGCAGAGACCCCAATGGAAATGGCATCTAGCGTACTGCAGAGCAAATGGAGGAGGAGATTCGGTTGGGATCACAAGACTTCATGGGGGGGGAACAACGAACAATGGAGA
ATCCAGAATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGAAGGACCTTCACCATCCTGTGTACTATGCTAAAGTCGACTGGTGGTTTAGTACCGACACAGAATGTTGATATCGAAGAAATGGTTGCTATATTCTTGCACAT
CCTGGCACACGATGTTAAGAATCGGGTGATTCGCAGCAATTTACTCCGGTCCGTGAGAGCGGTTTCTAGACACTTCAACTCGGTGTTAAACGCAGTTTTACAACTACACG
ACTTGTTGTTGAAAAAACCAGAACCAATCACCAACAACTGCACTGATGAGCGATGGAAATGGTTTGAGATGCTAGCTGAGAAATTACCAAACTCATGCCTAGAACAAAAC
ACAATTGATTGCAAGGTTAGAACTCTCAAGAAACAATACCATGCTATTGCAGAGATGCTTAGTAATGCATGTAGTGGCTTCGGCTGGAACGAAGAGTTCAAGTGTGTAGA
GGCAGAGAAGGAGGTGTTTGATGCATGGGTTAAGTGGATGAGGAATAAACCATTTCCACACTATGATGACCTCGCCTATGTCTTTGGAAAGGATAGAGCTACAGGAATGG
GTGCAGAGACCCCAATGGAAATGGCATCTAGCGTACTGCAGAGCAAATGGAGGAGGAGATTCGGTTGGGATCACAAGACTTCATGGGGGGGGAACAACGAACAATGGAGA
ATCCAGAATTGGTGA
Protein sequenceShow/hide protein sequence
MDRRTFTILCTMLKSTGGLVPTQNVDIEEMVAIFLHILAHDVKNRVIRSNLLRSVRAVSRHFNSVLNAVLQLHDLLLKKPEPITNNCTDERWKWFEMLAEKLPNSCLEQN
TIDCKVRTLKKQYHAIAEMLSNACSGFGWNEEFKCVEAEKEVFDAWVKWMRNKPFPHYDDLAYVFGKDRATGMGAETPMEMASSVLQSKWRRRFGWDHKTSWGGNNEQWR
IQNW