; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011932 (gene) of Snake gourd v1 genome

Gene IDTan0011932
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSOUL heme-binding protein
Genome locationLG09:66118463..66122225
RNA-Seq ExpressionTan0011932
SyntenyTan0011932
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587902.1 hypothetical protein SDJN03_16467, partial [Cucurbita argyrosperma subsp. sororia]7.0e-9183.84Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK
        MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N K  IRS LGD  R +K TVDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TK
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK

Query:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ
        Y+GI+GY+LNIALLREFFRPEII+HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN +TGKFCSHVDLWDS+QNNDYFSLE LWDVFKQ
Subjt:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ

KAG7021789.1 hypothetical protein SDJN02_15516 [Cucurbita argyrosperma subsp. argyrosperma]2.4e-9184.34Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK
        MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N K  IRS LGD  R +K TVDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TK
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK

Query:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ
        Y+GI+GY+LNIALLREFFRPEII+HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS+QNNDYFSLE LWDVFKQ
Subjt:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ

XP_022933414.1 uncharacterized protein LOC111440839 [Cucurbita moschata]6.6e-8981.82Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK
        MA AQVS QNFLSIPTV  G+RPRKS GPT  AQSRT   N K  IRS L D  R +K TVDVDRLVDF+Y+DLRHVFDEQGIDRTAYD++VRFRDP+TK
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK

Query:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ
        Y+GI+GY+LNIALLREFFRPEII+HWVKK+GPYEITTRWTA+MKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS+QNNDYFS+E LWDVFKQ
Subjt:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ

XP_022965046.1 uncharacterized protein LOC111465022 [Cucurbita maxima]9.8e-9384.34Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK
        MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N KW IRS L D QR +K TVDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TK
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK

Query:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ
        Y+GI+GY+LNIALLREFFRPEII+HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS+QNNDYFS+E LWDVFKQ
Subjt:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ

XP_023531546.1 uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo]4.6e-9083.33Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK
        MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N K  IRS L D  R +K TVDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TK
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK

Query:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ
        Y+GI+GY+LNIALLREFFRPEII HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN QTGKFCSHVD+WDS+QNNDYFSLE LWDVFKQ
Subjt:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ

TrEMBL top hitse value%identityAlignment
A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X11.3e-7972.46Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTG----VAQSRTVG-----QNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQ
        MAA Q+S+QNFLS PT+   LRP KSG  T     + QSRT       QNSKWV+R  L D Q P KSTVDV RLVDFLYEDL H+FDEQGIDRTAYDEQ
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTG----VAQSRTVG-----QNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQ

Query:  VRFRDPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEG
        VRFRDP+TK++ I+GYL NI+LLRE FRPE  +HWVK++GPYEITTRWT +MKF LLPWKPEL+ TGTSIMGIN +TGKFCSHVDLWDS+QNNDYFS+EG
Subjt:  VRFRDPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEG

Query:  LWDVFKQ
        LWDVFKQ
Subjt:  LWDVFKQ

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X14.6e-8071.29Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGG------PTGVAQSRTV-----GQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYD
        MAA Q+S+QNFLS PT GFG RP KSGG      P  + +SRTV      +NSKW +R  L D Q P KS VDVDRLVDFLYEDLRH+FDEQGIDRTAYD
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGG------PTGVAQSRTV-----GQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYD

Query:  EQVRFRDPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSL
        E VRFRDP+TK++ I+GY  NI+LLRE FRPE  +HWVK++GPYEITTRWT VMKF+LLPWKPE + TG SIMGIN +TGKFCSHVDLWDS+QNNDYFSL
Subjt:  EQVRFRDPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSL

Query:  EGLWDVFKQ
        EGL DVFKQ
Subjt:  EGLWDVFKQ

A0A6J1CV62 uncharacterized protein LOC111014503 isoform X23.8e-8275.25Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTG----VAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRD
        M   QVS+QNFLSIPTVG G RP+KSG  TG    + +SRT  +  K V+RS+L D + P KSTVDVDRLVDFLYEDLRHVFD QGID TAYDE VRFRD
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTG----VAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRD

Query:  PVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVF
        P+TKYNGI GY+LNIALLR+ FRP+ ++HWVKK+GPYEITTRWTAVMKF+LLPWKPELVLTGTSIM I+ +TGKFC+HVDLWDSVQNN+YFSLEGLWD+F
Subjt:  PVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVF

Query:  KQ
        KQ
Subjt:  KQ

A0A6J1EZQ2 uncharacterized protein LOC1114408393.2e-8981.82Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK
        MA AQVS QNFLSIPTV  G+RPRKS GPT  AQSRT   N K  IRS L D  R +K TVDVDRLVDF+Y+DLRHVFDEQGIDRTAYD++VRFRDP+TK
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK

Query:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ
        Y+GI+GY+LNIALLREFFRPEII+HWVKK+GPYEITTRWTA+MKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS+QNNDYFS+E LWDVFKQ
Subjt:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ

A0A6J1HKM5 uncharacterized protein LOC1114650224.7e-9384.34Show/hide
Query:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK
        MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N KW IRS L D QR +K TVDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TK
Subjt:  MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTK

Query:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ
        Y+GI+GY+LNIALLREFFRPEII+HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS+QNNDYFS+E LWDVFKQ
Subjt:  YNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46100.1 Nuclear transport factor 2 (NTF2) family protein7.4e-0628.12Show/hide
Query:  LVDFLYEDL-RHVFDEQGIDRTAYDEQVRFRDPVTKYNGIT---------GYLL---NIALLR-EFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWK
        +VD + +D  R  F    +    Y+E+  F DP   + G+          G L+   N+ L++ E F  + I HW           +++ VM F   PWK
Subjt:  LVDFLYEDL-RHVFDEQGIDRTAYDEQVRFRDPVTKYNGIT---------GYLL---NIALLR-EFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWK

Query:  PELVLTGTSIMGINSQTGKFCSHVDLWD
        P L  TG +    ++++GK C HV+ W+
Subjt:  PELVLTGTSIMGINSQTGKFCSHVDLWD

AT5G20140.1 SOUL heme-binding family protein4.4e-5961.96Show/hide
Query:  RTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEI
        R V    + ++  ++G       STV+++ LV FLYEDL H+FD+QGID+TAYDE+V+FRDP+TK++ I+GYL NIA L+  F P+  +HW K++GPYEI
Subjt:  RTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEI

Query:  TTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ
        TTRWT VMKFI LPWKPELV TG SIM +N +T KFCSH+DLWDS++NNDYFSLEGL DVFKQ
Subjt:  TTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ

AT5G20140.2 SOUL heme-binding family protein4.4e-5961.96Show/hide
Query:  RTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEI
        R V    + ++  ++G       STV+++ LV FLYEDL H+FD+QGID+TAYDE+V+FRDP+TK++ I+GYL NIA L+  F P+  +HW K++GPYEI
Subjt:  RTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEI

Query:  TTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ
        TTRWT VMKFI LPWKPELV TG SIM +N +T KFCSH+DLWDS++NNDYFSLEGL DVFKQ
Subjt:  TTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTGCCCAAGTTTCAATCCAAAACTTCCTCTCAATCCCAACCGTTGGTTTTGGTCTCCGGCCGAGGAAATCCGGCGGACCGACCGGCGTCGCACAAAGCAGAAC
CGTAGGCCAAAATTCGAAGTGGGTTATTCGATCAAAATTGGGAGATCATCAAAGGCCTCGGAAATCGACGGTGGACGTTGACCGATTGGTGGATTTCTTGTACGAGGATC
TCCGGCATGTGTTCGATGAGCAGGGAATTGATCGGACGGCGTACGATGAACAAGTGAGATTTCGAGACCCAGTTACAAAGTATAATGGCATTACAGGGTATTTGCTGAAT
ATTGCCCTGTTGCGAGAATTCTTCAGGCCTGAGATCATAGTGCACTGGGTCAAAAAGAGTGGACCATATGAAATAACTACAAGATGGACTGCAGTGATGAAGTTCATCCT
TCTACCATGGAAACCAGAATTAGTTTTGACTGGAACTTCCATTATGGGTATCAATTCACAGACGGGCAAGTTTTGTAGCCATGTGGATCTTTGGGATTCAGTTCAAAATA
ATGACTACTTTTCTCTAGAAGGCTTATGGGATGTATTTAAACAGAATTGGAATCACCCAAATATCAGATATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTGCCCAAGTTTCAATCCAAAACTTCCTCTCAATCCCAACCGTTGGTTTTGGTCTCCGGCCGAGGAAATCCGGCGGACCGACCGGCGTCGCACAAAGCAGAAC
CGTAGGCCAAAATTCGAAGTGGGTTATTCGATCAAAATTGGGAGATCATCAAAGGCCTCGGAAATCGACGGTGGACGTTGACCGATTGGTGGATTTCTTGTACGAGGATC
TCCGGCATGTGTTCGATGAGCAGGGAATTGATCGGACGGCGTACGATGAACAAGTGAGATTTCGAGACCCAGTTACAAAGTATAATGGCATTACAGGGTATTTGCTGAAT
ATTGCCCTGTTGCGAGAATTCTTCAGGCCTGAGATCATAGTGCACTGGGTCAAAAAGAGTGGACCATATGAAATAACTACAAGATGGACTGCAGTGATGAAGTTCATCCT
TCTACCATGGAAACCAGAATTAGTTTTGACTGGAACTTCCATTATGGGTATCAATTCACAGACGGGCAAGTTTTGTAGCCATGTGGATCTTTGGGATTCAGTTCAAAATA
ATGACTACTTTTCTCTAGAAGGCTTATGGGATGTATTTAAACAGAATTGGAATCACCCAAATATCAGATATTGA
Protein sequenceShow/hide protein sequence
MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLN
IALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQNWNHPNIRY