; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012140 (gene) of Snake gourd v1 genome

Gene IDTan0012140
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSOUL heme-binding protein
Genome locationLG09:66116182..66117083
RNA-Seq ExpressionTan0012140
SyntenyTan0012140
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144957.1 uncharacterized protein LOC111014503 isoform X2 [Momordica charantia]1.0e-2177.46Show/hide
Query:  FSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        F DPKQ+ ISLR I+GGIAAVLKFSG P++++ QEKAKELRYSL KDGLKPI G LLARYN+ SRTWSFVM
Subjt:  FSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

XP_022930671.1 heme-binding-like protein At3g10130, chloroplastic isoform X2 [Cucurbita moschata]5.1e-2171.62Show/hide
Query:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        L+S  DP+Q+TI LRK+EGG AAVLKFSG PT++I QEKAKELR SL KDGLKP NG LLARYN+  RTW+F+M
Subjt:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

XP_022933414.1 uncharacterized protein LOC111440839 [Cucurbita moschata]3.0e-2176.71Show/hide
Query:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        S SD KQ +T+S+R++EGGI AVLKFSG+PT+D+AQ+KAKELR SLKKDGLKPING LLARYN+S RTWSFVM
Subjt:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

XP_022965046.1 uncharacterized protein LOC111465022 [Cucurbita maxima]3.9e-2175.34Show/hide
Query:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        SF D KQ +T+S+R++EGGI AVLKFSG+PT+D+AQ+KAKELR SLKKDGLKPING LLARYN+S RTW FVM
Subjt:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

XP_023531546.1 uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo]7.1e-2379.45Show/hide
Query:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        SFSD KQ +T+S+R++EGGI AVLKFSG+PT+D+AQ+KAKELR SLKKDGLKPING LLARYNNS+RTWSFVM
Subjt:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

TrEMBL top hitse value%identityAlignment
A0A6J1CV62 uncharacterized protein LOC111014503 isoform X25.0e-2277.46Show/hide
Query:  FSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        F DPKQ+ ISLR I+GGIAAVLKFSG P++++ QEKAKELRYSL KDGLKPI G LLARYN+ SRTWSFVM
Subjt:  FSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X12.5e-2171.62Show/hide
Query:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        L+S  DP+Q+TI LRK+EGG AAVLKFSG PT++I QEKAKELR SL KDGLKP NG LLARYN+  RTW+F+M
Subjt:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

A0A6J1ERK9 heme-binding-like protein At3g10130, chloroplastic isoform X22.5e-2171.62Show/hide
Query:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        L+S  DP+Q+TI LRK+EGG AAVLKFSG PT++I QEKAKELR SL KDGLKP NG LLARYN+  RTW+F+M
Subjt:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

A0A6J1EZQ2 uncharacterized protein LOC1114408391.4e-2176.71Show/hide
Query:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        S SD KQ +T+S+R++EGGI AVLKFSG+PT+D+AQ+KAKELR SLKKDGLKPING LLARYN+S RTWSFVM
Subjt:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

A0A6J1HKM5 uncharacterized protein LOC1114650221.9e-2175.34Show/hide
Query:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        SF D KQ +T+S+R++EGGI AVLKFSG+PT+D+AQ+KAKELR SLKKDGLKPING LLARYN+S RTW FVM
Subjt:  SFSDPKQ-NTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein9.6e-1852.7Show/hide
Query:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        L S   P +  ++L+K+EGG AA +KFSG PT+D+ Q K  ELR SL KDGL+   G +LARYN+  RTW+F+M
Subjt:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM

AT5G20140.2 SOUL heme-binding family protein9.6e-1852.7Show/hide
Query:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM
        L S   P +  ++L+K+EGG AA +KFSG PT+D+ Q K  ELR SL KDGL+   G +LARYN+  RTW+F+M
Subjt:  LFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATTACATTTCTGTTCAGTTTCTCAGATCCTAAACAGAACACAATCAGTTTAAGAAAGATCGAAGGAGGCATTGCTGCAGTGTTGAAATTCAGTGGAAATCCCAC
TAAAGATATAGCTCAAGAAAAGGCAAAAGAATTACGATATAGTCTCAAAAAAGATGGCCTTAAACCCATTAATGGTTTTTTGCTAGCTCGCTACAACAATTCTTCTCGAA
CATGGAGCTTTGTAATGTTATTTGTCTTCATCGAGGTCCGCCAAACAAAAGAACTCTGTATTGGATACGGCACTTCACAAATTGTTCCAAGTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCATTACATTTCTGTTCAGTTTCTCAGATCCTAAACAGAACACAATCAGTTTAAGAAAGATCGAAGGAGGCATTGCTGCAGTGTTGAAATTCAGTGGAAATCCCAC
TAAAGATATAGCTCAAGAAAAGGCAAAAGAATTACGATATAGTCTCAAAAAAGATGGCCTTAAACCCATTAATGGTTTTTTGCTAGCTCGCTACAACAATTCTTCTCGAA
CATGGAGCTTTGTAATGTTATTTGTCTTCATCGAGGTCCGCCAAACAAAAGAACTCTGTATTGGATACGGCACTTCACAAATTGTTCCAAGTTTCTAG
Protein sequenceShow/hide protein sequence
MTITFLFSFSDPKQNTISLRKIEGGIAAVLKFSGNPTKDIAQEKAKELRYSLKKDGLKPINGFLLARYNNSSRTWSFVMLFVFIEVRQTKELCIGYGTSQIVPSF