; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008784 (gene) of Snake gourd v1 genome

Gene IDTan0008784
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheme-binding protein 2-like
Genome locationLG04:12276745..12277485
RNA-Seq ExpressionTan0008784
SyntenyTan0008784
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011655613.1 uncharacterized protein LOC101213086 [Cucumis sativus]5.0e-9784.62Show/hide
Query:  MKGKLLINFVLTIC-FFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH
        MKGK+LINF LTIC FFCCSSGRVIESPHYKVIH+ESDFEIRQYKQ+SWMSA VQGTASFEKST+QGFHRLYQY+HGANSNS HFL TSPVTTTIM  T 
Subjt:  MKGKLLINFVLTIC-FFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH

Query:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF
         PERLVRYYLP +  E+PPLPNSEL+V FEKWR+NCLAVRRF GFAKDDNINKEI+ALKSSL+KYLP+S+A+SEYTIAQYNSSR L GRLNEVWLDVS F
Subjt:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

XP_022944900.1 heme-binding protein 2-like [Cucurbita moschata]2.7e-10388.78Show/hide
Query:  MKGKLLINFVLTICFFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTHG
        M GK++INF LTICFFCCSSGRVIESPHY VIH+E++FEIRQYKQVSW+SA VQGTASFEKSTQQGFHRLYQYIHGANSNSSHFL+TSPVTTTIMAST G
Subjt:  MKGKLLINFVLTICFFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTHG

Query:  PERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAFT
        PERLVRYYLP++YTE+PPLPNSEL+VQFEKWRSNCLAVRRF+GFAKDDNINKE+EALKSSL KYLPKSSAISEYT+AQYNSSRHLSGRLNEVW+DVSA T
Subjt:  PERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAFT

Query:  AEGCQ
        +EGCQ
Subjt:  AEGCQ

XP_022967039.1 heme-binding protein 2-like [Cucurbita maxima]4.7e-10389.32Show/hide
Query:  MKGKLLINFVLTICFF-CCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH
        M GK+L+NF LTICFF CCSSGRVIESPHY VIH+E++FEIRQYKQVSW+SA VQGTASFEKSTQQGFHRLYQYIHGAN NSSHFL+TSPVTTTIMASTH
Subjt:  MKGKLLINFVLTICFF-CCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH

Query:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF
        GPERLVRYYLP++YTE+PPLPNSEL+VQFEKWRSNCLAVRRF+GFAKDDNINKE+EALKSSLNKYLPKSSAISEYT+AQYNSSRHLSGRLNEVWLDVSA 
Subjt:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF

Query:  TAEGCQ
        T+EGCQ
Subjt:  TAEGCQ

XP_023542916.1 heme-binding protein 2-like [Cucurbita pepo subsp. pepo]8.0e-10389.32Show/hide
Query:  MKGKLLINFVLTICFF-CCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH
        M GK+L+NF LTICFF CCSSGRVIESPHY VIH+E++FEIRQYKQVSW+SA VQGTASFEKSTQQGFHRLYQYIHGANSNSSHFL+TSPVTTTIMAST 
Subjt:  MKGKLLINFVLTICFF-CCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH

Query:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF
        GPERLVRYYLP++YTE+PPLPNSEL+VQFEKWRSNCLAVRRF+GFAKDDNINKE+EALKSSLNKYLPKSSAISEYT+AQYNSSRHLSGRLNEVWLDVSA 
Subjt:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF

Query:  TAEGCQ
        T+EGCQ
Subjt:  TAEGCQ

XP_038892072.1 heme-binding protein 2-like [Benincasa hispida]7.5e-10187.44Show/hide
Query:  MKGKLLINFVLTICFFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTHG
        MKGKLLINF L ICFFCCSSGRVIESPHYKVIH+ESDFEIRQYKQ+SWMSA VQGTASFEKSTQQGFHRLYQYIHGANSNSSH L+TSPVTTT++AS H 
Subjt:  MKGKLLINFVLTICFFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTHG

Query:  PERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAFT
        PE L+RYYLP V  E PPLPNSEL+VQFEKWRSNCLAVRRF GFAKDDNINKEI+ALK SL+KYLP+S+AISEYTIAQYNSSR LSGRLNEVWLDVS FT
Subjt:  PERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAFT

Query:  AEGCQPL
        AEGCQPL
Subjt:  AEGCQPL

TrEMBL top hitse value%identityAlignment
A0A0A0KVJ5 Uncharacterized protein2.4e-9784.62Show/hide
Query:  MKGKLLINFVLTIC-FFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH
        MKGK+LINF LTIC FFCCSSGRVIESPHYKVIH+ESDFEIRQYKQ+SWMSA VQGTASFEKST+QGFHRLYQY+HGANSNS HFL TSPVTTTIM  T 
Subjt:  MKGKLLINFVLTIC-FFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH

Query:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF
         PERLVRYYLP +  E+PPLPNSEL+V FEKWR+NCLAVRRF GFAKDDNINKEI+ALKSSL+KYLP+S+A+SEYTIAQYNSSR L GRLNEVWLDVS F
Subjt:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

A0A1S3BEF7 uncharacterized protein LOC1034889841.7e-9582.69Show/hide
Query:  MKGKLLINFVLTIC-FFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH
        MKGK+LINF LTIC FFCCSSGRVIESPHYKVIH+ESDFEIRQYKQ+SWMSA VQGT+SFEKSTQQGFHRLYQY+HGANSNS  FL TSPVTTTIM ST 
Subjt:  MKGKLLINFVLTIC-FFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH

Query:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF
         PE LVRYYLP +  E+PPLPNSEL++ FEKW++NCLAVRRF GFAKDDNINKEI+ALKS+L+K+LP+S+AISEYTIAQYNSSR L GRLNEVWLDVS+F
Subjt:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

A0A5A7ST56 Heme-binding protein 2-like1.7e-9582.69Show/hide
Query:  MKGKLLINFVLTIC-FFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH
        MKGK+LINF LTIC FFCCSSGRVIESPHYKVIH+ESDFEIRQYKQ+SWMSA VQGT+SFEKSTQQGFHRLYQY+HGANSNS  FL TSPVTTTIM ST 
Subjt:  MKGKLLINFVLTIC-FFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH

Query:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF
         PE LVRYYLP +  E+PPLPNSEL++ FEKW++NCLAVRRF GFAKDDNINKEI+ALKS+L+K+LP+S+AISEYTIAQYNSSR L GRLNEVWLDVS+F
Subjt:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

A0A6J1FZD2 heme-binding protein 2-like1.3e-10388.78Show/hide
Query:  MKGKLLINFVLTICFFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTHG
        M GK++INF LTICFFCCSSGRVIESPHY VIH+E++FEIRQYKQVSW+SA VQGTASFEKSTQQGFHRLYQYIHGANSNSSHFL+TSPVTTTIMAST G
Subjt:  MKGKLLINFVLTICFFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTHG

Query:  PERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAFT
        PERLVRYYLP++YTE+PPLPNSEL+VQFEKWRSNCLAVRRF+GFAKDDNINKE+EALKSSL KYLPKSSAISEYT+AQYNSSRHLSGRLNEVW+DVSA T
Subjt:  PERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAFT

Query:  AEGCQ
        +EGCQ
Subjt:  AEGCQ

A0A6J1HVL2 heme-binding protein 2-like2.3e-10389.32Show/hide
Query:  MKGKLLINFVLTICFF-CCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH
        M GK+L+NF LTICFF CCSSGRVIESPHY VIH+E++FEIRQYKQVSW+SA VQGTASFEKSTQQGFHRLYQYIHGAN NSSHFL+TSPVTTTIMASTH
Subjt:  MKGKLLINFVLTICFF-CCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTH

Query:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF
        GPERLVRYYLP++YTE+PPLPNSEL+VQFEKWRSNCLAVRRF+GFAKDDNINKE+EALKSSLNKYLPKSSAISEYT+AQYNSSRHLSGRLNEVWLDVSA 
Subjt:  GPERLVRYYLPAVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAF

Query:  TAEGCQ
        T+EGCQ
Subjt:  TAEGCQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein6.7e-2336.02Show/hide
Query:  IESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTHGP----ERLVRYYLPAVYTESPPL
        IE P Y+++H  + +EIR+Y    W+S       S   +T+  F +L+ YI G N       MT+PV + +  S  GP       V +Y+P    +  P 
Subjt:  IESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTHGP----ERLVRYYLPAVYTESPPL

Query:  PNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLN-----KYLPKS------SAISEYTIAQYNSSRHLSGRLNEVWL
        P+  L +Q  KW S  +AVR+F+GF  DD+I ++  AL SSL        + KS       + S YT+AQYNS    SGR+NE+WL
Subjt:  PNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLN-----KYLPKS------SAISEYTIAQYNSSRHLSGRLNEVWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGTAAGTTATTGATCAACTTTGTTCTAACAATATGCTTCTTCTGTTGTAGCTCAGGCAGAGTGATTGAATCTCCACATTATAAAGTGATTCATTTGGAATCAGA
TTTTGAGATCAGACAGTACAAACAAGTCTCATGGATGTCTGCTTTTGTCCAAGGAACAGCCTCCTTTGAAAAGTCAACCCAACAAGGCTTCCACAGATTGTATCAATACA
TTCATGGTGCTAATAGCAACTCTTCTCACTTTCTAATGACTTCTCCTGTCACAACTACCATTATGGCATCGACACATGGACCCGAGCGATTGGTTAGGTATTATCTGCCT
GCGGTTTATACCGAAAGCCCACCGCTGCCCAATTCTGAACTGGATGTTCAGTTTGAAAAGTGGAGAAGCAATTGCTTAGCAGTCAGGAGGTTTGCTGGGTTTGCTAAAGA
TGATAACATCAACAAAGAAATTGAAGCTCTAAAGAGCAGCTTGAACAAGTACCTACCTAAGAGTTCAGCCATTTCAGAATACACCATTGCTCAGTATAATTCTTCACGTC
ACCTGTCGGGGCGTTTGAACGAAGTTTGGCTCGACGTTTCAGCGTTTACTGCAGAGGGATGTCAACCCCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGTAAGTTATTGATCAACTTTGTTCTAACAATATGCTTCTTCTGTTGTAGCTCAGGCAGAGTGATTGAATCTCCACATTATAAAGTGATTCATTTGGAATCAGA
TTTTGAGATCAGACAGTACAAACAAGTCTCATGGATGTCTGCTTTTGTCCAAGGAACAGCCTCCTTTGAAAAGTCAACCCAACAAGGCTTCCACAGATTGTATCAATACA
TTCATGGTGCTAATAGCAACTCTTCTCACTTTCTAATGACTTCTCCTGTCACAACTACCATTATGGCATCGACACATGGACCCGAGCGATTGGTTAGGTATTATCTGCCT
GCGGTTTATACCGAAAGCCCACCGCTGCCCAATTCTGAACTGGATGTTCAGTTTGAAAAGTGGAGAAGCAATTGCTTAGCAGTCAGGAGGTTTGCTGGGTTTGCTAAAGA
TGATAACATCAACAAAGAAATTGAAGCTCTAAAGAGCAGCTTGAACAAGTACCTACCTAAGAGTTCAGCCATTTCAGAATACACCATTGCTCAGTATAATTCTTCACGTC
ACCTGTCGGGGCGTTTGAACGAAGTTTGGCTCGACGTTTCAGCGTTTACTGCAGAGGGATGTCAACCCCTTTAA
Protein sequenceShow/hide protein sequence
MKGKLLINFVLTICFFCCSSGRVIESPHYKVIHLESDFEIRQYKQVSWMSAFVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLMTSPVTTTIMASTHGPERLVRYYLP
AVYTESPPLPNSELDVQFEKWRSNCLAVRRFAGFAKDDNINKEIEALKSSLNKYLPKSSAISEYTIAQYNSSRHLSGRLNEVWLDVSAFTAEGCQPL