; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005597 (gene) of Snake gourd v1 genome

Gene IDTan0005597
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMyb_DNA-bind_3 domain-containing protein
Genome locationLG03:51139938..51141918
RNA-Seq ExpressionTan0005597
SyntenyTan0005597
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]1.9e-6154.47Show/hide
Query:  MLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE---------------
        ML+  G L +TQYVDVEEMV IFLHI+AHDVKNRV RR+FARSGETVSRHFN VLN VLRLH +LLK P+ VT +C  EKWRWF+               
Subjt:  MLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE---------------

Query:  ---------------------------------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKA
                                               EK+ +S+IQVTPNLES VKILKKQY TI EM+G  CSGF WN ERKCIEAEK++ +DWVK 
Subjt:  ---------------------------------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKA

Query:  HPHARGLRNKPFPYFDELSITFGKDKATGASVETP
        H +AR L NKPFPYF +L I FG+D+ATG   +TP
Subjt:  HPHARGLRNKPFPYFDELSITFGKDKATGASVETP

KAG6532280.1 hypothetical protein ZIOFF_006120 [Zingiber officinale]9.9e-5047.62Show/hide
Query:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFEE---
        MDRR+F+ L  +L     L   + + + E+V  FLHIIAH+VKNRV++   ARSGET+SR F+ +LN++LRLH++LLK PEP+   C DE+W+WF+    
Subjt:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFEE---

Query:  ---------------KLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKD
                       KLP S ++ TP+++SR K+LK++++ I +M+  + SGFGWND  KCI   K +F+DWVK HP   GLRNK FPY D+L   +GKD
Subjt:  ---------------KLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKD

Query:  KATGASVETP
         ATGA+ ETP
Subjt:  KATGASVETP

TYK05796.1 retrotransposon protein [Cucumis melo var. makuwa]1.4e-4846.81Show/hide
Query:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFEEKLP
        MDRRTFAIL  +L+ +  L ST+ VDVEEMVA+FLH++AHD+KN V++R F RS ETVSRHFN VL  V+RL+  L+K P PVT  C+D++W+ FE  L 
Subjt:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFEEKLP

Query:  KSD---IQVT------PNLESRVKILKKQY-----------------------------------NTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVK
          D   I+V       P   +R   +   Y                                     I EM G ACSGFGWNDE KCI  EK +FD+WV+
Subjt:  KSD---IQVT------PNLESRVKILKKQY-----------------------------------NTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVK

Query:  AHPHARGLRNKPFPYFDELSITFGKDKATGASVET
        +HP  +GL NKPFPY+DEL+  FG+D+ATG  VET
Subjt:  AHPHARGLRNKPFPYFDELSITFGKDKATGASVET

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]3.1e-5146.9Show/hide
Query:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTC-------------
        MDRR F IL  ML+  G L +TQYVDV+EMV IFLHI+AHDVKNRV RR+ ARSGETVSRHFNAVLNAVLRLH +LLK P+PVT +C             
Subjt:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTC-------------

Query:  -------------------------------------------QDEKWRWF---EEKLPKSDIQV-------TPNLESRVKILKKQYNTIVEMIGLACSG
                                                   +  K RW    +E L +  +Q+         N   ++  L KQY  I EM+G ACSG
Subjt:  -------------------------------------------QDEKWRWF---EEKLPKSDIQV-------TPNLESRVKILKKQYNTIVEMIGLACSG

Query:  FGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKDKATGASVETP
        FGWN+ +KCIE EK +FDDWVK HP+A+GL NKPFPYF +L + FG+D+ATG   +TP
Subjt:  FGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKDKATGASVETP

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]1.9e-6154.47Show/hide
Query:  MLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE---------------
        ML+  G L +TQYVDVEEMV IFLHI+AHDVKNRV RR+FARSGETVSRHFN VLN VLRLH +LLK P+ VT +C  EKWRWF+               
Subjt:  MLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE---------------

Query:  ---------------------------------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKA
                                               EK+ +S+IQVTPNLES VKILKKQY TI EM+G  CSGF WN ERKCIEAEK++ +DWVK 
Subjt:  ---------------------------------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKA

Query:  HPHARGLRNKPFPYFDELSITFGKDKATGASVETP
        H +AR L NKPFPYF +L I FG+D+ATG   +TP
Subjt:  HPHARGLRNKPFPYFDELSITFGKDKATGASVETP

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein9.3e-6254.47Show/hide
Query:  MLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE---------------
        ML+  G L +TQYVDVEEMV IFLHI+AHDVKNRV RR+FARSGETVSRHFN VLN VLRLH +LLK P+ VT +C  EKWRWF+               
Subjt:  MLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE---------------

Query:  ---------------------------------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKA
                                               EK+ +S+IQVTPNLES VKILKKQY TI EM+G  CSGF WN ERKCIEAEK++ +DWVK 
Subjt:  ---------------------------------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKA

Query:  HPHARGLRNKPFPYFDELSITFGKDKATGASVETP
        H +AR L NKPFPYF +L I FG+D+ATG   +TP
Subjt:  HPHARGLRNKPFPYFDELSITFGKDKATGASVETP

A0A5D3BTU3 Putative nuclease HARBI15.7e-4340.44Show/hide
Query:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE----
        MD+R F IL  ML+  G L +TQYVDVEEMVAIF HI+AHDVKNRV RR+FARSGET+SRHFNAV NAVLRLH + LK P+PVT +C  EKW+WF+    
Subjt:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDW
                      EK+P+S+IQVT NLESRV ILKKQY  I EM+G AC+GFG N++RKCIEAEK +FDDW
Subjt:  --------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDW

A0A5D3C620 Retrotransposon protein6.9e-4946.81Show/hide
Query:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFEEKLP
        MDRRTFAIL  +L+ +  L ST+ VDVEEMVA+FLH++AHD+KN V++R F RS ETVSRHFN VL  V+RL+  L+K P PVT  C+D++W+ FE  L 
Subjt:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFEEKLP

Query:  KSD---IQVT------PNLESRVKILKKQY-----------------------------------NTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVK
          D   I+V       P   +R   +   Y                                     I EM G ACSGFGWNDE KCI  EK +FD+WV+
Subjt:  KSD---IQVT------PNLESRVKILKKQY-----------------------------------NTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVK

Query:  AHPHARGLRNKPFPYFDELSITFGKDKATGASVET
        +HP  +GL NKPFPY+DEL+  FG+D+ATG  VET
Subjt:  AHPHARGLRNKPFPYFDELSITFGKDKATGASVET

A0A5D3C7T4 Uncharacterized protein1.5e-5146.9Show/hide
Query:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTC-------------
        MDRR F IL  ML+  G L +TQYVDV+EMV IFLHI+AHDVKNRV RR+ ARSGETVSRHFNAVLNAVLRLH +LLK P+PVT +C             
Subjt:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTC-------------

Query:  -------------------------------------------QDEKWRWF---EEKLPKSDIQV-------TPNLESRVKILKKQYNTIVEMIGLACSG
                                                   +  K RW    +E L +  +Q+         N   ++  L KQY  I EM+G ACSG
Subjt:  -------------------------------------------QDEKWRWF---EEKLPKSDIQV-------TPNLESRVKILKKQYNTIVEMIGLACSG

Query:  FGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKDKATGASVETP
        FGWN+ +KCIE EK +FDDWVK HP+A+GL NKPFPYF +L + FG+D+ATG   +TP
Subjt:  FGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKDKATGASVETP

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein9.3e-6254.47Show/hide
Query:  MLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE---------------
        ML+  G L +TQYVDVEEMV IFLHI+AHDVKNRV RR+FARSGETVSRHFN VLN VLRLH +LLK P+ VT +C  EKWRWF+               
Subjt:  MLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFE---------------

Query:  ---------------------------------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKA
                                               EK+ +S+IQVTPNLES VKILKKQY TI EM+G  CSGF WN ERKCIEAEK++ +DWVK 
Subjt:  ---------------------------------------EKLPKSDIQVTPNLESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKA

Query:  HPHARGLRNKPFPYFDELSITFGKDKATGASVETP
        H +AR L NKPFPYF +L I FG+D+ATG   +TP
Subjt:  HPHARGLRNKPFPYFDELSITFGKDKATGASVETP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24960.2 unknown protein4.5e-0835.06Show/hide
Query:  LESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKDKATG
        L++R K L++ YN I  +  L  +GF W+  R  + A+  I++ +++AHP AR  R K  P +  L   FGK+ + G
Subjt:  LESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKDKATG

AT4G02210.1 unknown protein1.1e-0935.21Show/hide
Query:  LESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFG
        L++R K L++Q+N I  +  L   GF W++ER+ + A+  ++ D++KAH  AR    +P PY+ +L +  G
Subjt:  LESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFG

AT4G02210.2 unknown protein1.1e-0935.21Show/hide
Query:  LESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFG
        L++R K L++Q+N I  +  L   GF W++ER+ + A+  ++ D++KAH  AR    +P PY+ +L +  G
Subjt:  LESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFG

AT5G27260.1 unknown protein2.5e-1142.67Show/hide
Query:  SRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKDKATG
        SR+K LK QY + +++   + SGFGW+   K   A   ++ D++KAHP+ + LR   F +FDEL I FG+  ATG
Subjt:  SRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKDKATG

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.3e-1137.5Show/hide
Query:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRL---------HSVLLKSPEPVTRTC
        MD+  F  L ++L+  G+L  T  + +E  +AIFL II H+++ R ++  F  SGET+SRHFN VLNAV+ +         +S  L++ +P  + C
Subjt:  MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRL---------HSVLLKSPEPVTRTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGAAGAACATTTGCTATCCTCTATGAGATGTTGAAGGGGATTGGTGTATTAGTATCAACACAATATGTCGATGTAGAAGAGATGGTCGCAATATTCTTGCATAT
CATAGCACATGATGTGAAGAATCGTGTGATGCGACGATATTTTGCAAGATCTGGTGAGACCGTATCCAGACATTTTAATGCAGTGTTAAACGCGGTGTTAAGACTCCACT
CAGTCTTATTGAAATCACCAGAGCCAGTCACAAGAACGTGTCAGGATGAAAAGTGGCGATGGTTCGAGGAGAAACTCCCTAAAAGCGACATACAAGTGACCCCAAACCTA
GAATCTAGAGTGAAGATTCTGAAGAAGCAATACAATACTATAGTTGAGATGATAGGCCTAGCATGTAGTGGGTTTGGGTGGAATGACGAACGTAAGTGCATTGAGGCAGA
GAAAGCAATTTTCGATGACTGGGTTAAGGCACATCCTCATGCTCGAGGTCTTAGGAACAAGCCATTTCCATACTTCGACGAGTTATCAATTACATTCGGTAAAGACAAGG
CAACTGGTGCGAGTGTAGAGACTCCCTATAAACCCGAGATCTCTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGAAGAACATTTGCTATCCTCTATGAGATGTTGAAGGGGATTGGTGTATTAGTATCAACACAATATGTCGATGTAGAAGAGATGGTCGCAATATTCTTGCATAT
CATAGCACATGATGTGAAGAATCGTGTGATGCGACGATATTTTGCAAGATCTGGTGAGACCGTATCCAGACATTTTAATGCAGTGTTAAACGCGGTGTTAAGACTCCACT
CAGTCTTATTGAAATCACCAGAGCCAGTCACAAGAACGTGTCAGGATGAAAAGTGGCGATGGTTCGAGGAGAAACTCCCTAAAAGCGACATACAAGTGACCCCAAACCTA
GAATCTAGAGTGAAGATTCTGAAGAAGCAATACAATACTATAGTTGAGATGATAGGCCTAGCATGTAGTGGGTTTGGGTGGAATGACGAACGTAAGTGCATTGAGGCAGA
GAAAGCAATTTTCGATGACTGGGTTAAGGCACATCCTCATGCTCGAGGTCTTAGGAACAAGCCATTTCCATACTTCGACGAGTTATCAATTACATTCGGTAAAGACAAGG
CAACTGGTGCGAGTGTAGAGACTCCCTATAAACCCGAGATCTCTCTTTGA
Protein sequenceShow/hide protein sequence
MDRRTFAILYEMLKGIGVLVSTQYVDVEEMVAIFLHIIAHDVKNRVMRRYFARSGETVSRHFNAVLNAVLRLHSVLLKSPEPVTRTCQDEKWRWFEEKLPKSDIQVTPNL
ESRVKILKKQYNTIVEMIGLACSGFGWNDERKCIEAEKAIFDDWVKAHPHARGLRNKPFPYFDELSITFGKDKATGASVETPYKPEISL