; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014921 (gene) of Snake gourd v1 genome

Gene IDTan0014921
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPhytocyanin domain-containing protein
Genome locationLG05:20025899..20026840
RNA-Seq ExpressionTan0014921
SyntenyTan0014921
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605126.1 hypothetical protein SDJN03_02443, partial [Cucurbita argyrosperma subsp. sororia]8.8e-4663.23Show/hide
Query:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF
        MASSSS AIL+L+VSCM  VCSAR                GP KIVVG  + W FG +YS+WALKNGPF+VND LVF+Y  PNG   PHSV+LL N ESF
Subjt:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF

Query:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP
          CD   A +VAN +QG GDGFE VL+QSK YYFACGERNG HC  GNMKFSVLP
Subjt:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP

XP_022148560.1 uncharacterized protein LOC111017194 [Momordica charantia]7.5e-5360.54Show/hide
Query:  MASSSSSAILLLMVSCMLAVCSARSARDQ-----------------------------LSPSPAPSPVQGPN-KIVVGGSEHWRFGFNYSEWALKNGPFH
        MA ++S AILL++V  MLAV SA S  DQ                             L PSP P  +   N KI+VGGSE+WRFGF+Y+ WALKNGPF+
Subjt:  MASSSSSAILLLMVSCMLAVCSARSARDQ-----------------------------LSPSPAPSPVQGPN-KIVVGGSEHWRFGFNYSEWALKNGPFH

Query:  VNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP
        +NDTLVFKY+PPN TTFPHSV+LL NL SF +CDLR A+KVANWTQGGGDGFE VLQQSK YYFACGERNGFHC  G MKFS+ P
Subjt:  VNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP

XP_022148634.1 leucine-rich repeat extensin-like protein 3 [Momordica charantia]4.7e-4769.6Show/hide
Query:  PSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKI
        PS  P P Q P KI+VGGSE+W  GF+YS WALKNGPF +ND LVFKY+PP G T PHSV+LLSN++SF  CDLR A K+ANWTQG GDGF+ VL+Q K 
Subjt:  PSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKI

Query:  YYFACGERNGFHCSGGNMKFSVLPI
        YYFACGE NGFHC  G+MKFSV PI
Subjt:  YYFACGERNGFHCSGGNMKFSVLPI

XP_022947924.1 uncharacterized protein LOC111451661 [Cucurbita moschata]8.0e-4763.87Show/hide
Query:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF
        MASSSS+AIL+L+VSCM  VCSAR                GP KIVVGG + W FG +YS+WALKN PF+VND LVF+Y  PNG   PHSV+LL NLESF
Subjt:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF

Query:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP
          CD   A +VAN +QG GDGFE VL+QSK YYFACGERNG HC  GNMKFSVLP
Subjt:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP

XP_023007536.1 uncharacterized protein LOC111499996 [Cucurbita maxima]1.0e-4663.87Show/hide
Query:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF
        MASSSS+AIL+L+VSCM  VCSAR  +                KIVVGG + W FG +YS+WALKNGPF+VND LVF+Y  PNG   PHSV+LL NLESF
Subjt:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF

Query:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP
         KCD   A +VAN +QG GDGFE VL+QSK YYFACGERNG HC  GNMKFSVLP
Subjt:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP

TrEMBL top hitse value%identityAlignment
A0A6J1D388 uncharacterized protein LOC1110171943.6e-5360.54Show/hide
Query:  MASSSSSAILLLMVSCMLAVCSARSARDQ-----------------------------LSPSPAPSPVQGPN-KIVVGGSEHWRFGFNYSEWALKNGPFH
        MA ++S AILL++V  MLAV SA S  DQ                             L PSP P  +   N KI+VGGSE+WRFGF+Y+ WALKNGPF+
Subjt:  MASSSSSAILLLMVSCMLAVCSARSARDQ-----------------------------LSPSPAPSPVQGPN-KIVVGGSEHWRFGFNYSEWALKNGPFH

Query:  VNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP
        +NDTLVFKY+PPN TTFPHSV+LL NL SF +CDLR A+KVANWTQGGGDGFE VLQQSK YYFACGERNGFHC  G MKFS+ P
Subjt:  VNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP

A0A6J1D4K0 leucine-rich repeat extensin-like protein 32.3e-4769.6Show/hide
Query:  PSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKI
        PS  P P Q P KI+VGGSE+W  GF+YS WALKNGPF +ND LVFKY+PP G T PHSV+LLSN++SF  CDLR A K+ANWTQG GDGF+ VL+Q K 
Subjt:  PSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKI

Query:  YYFACGERNGFHCSGGNMKFSVLPI
        YYFACGE NGFHC  G+MKFSV PI
Subjt:  YYFACGERNGFHCSGGNMKFSVLPI

A0A6J1G8B8 uncharacterized protein LOC1114516613.9e-4763.87Show/hide
Query:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF
        MASSSS+AIL+L+VSCM  VCSAR                GP KIVVGG + W FG +YS+WALKN PF+VND LVF+Y  PNG   PHSV+LL NLESF
Subjt:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF

Query:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP
          CD   A +VAN +QG GDGFE VL+QSK YYFACGERNG HC  GNMKFSVLP
Subjt:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP

A0A6J1L0T6 uncharacterized protein LOC1114999965.1e-4763.87Show/hide
Query:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF
        MASSSS+AIL+L+VSCM  VCSAR  +                KIVVGG + W FG +YS+WALKNGPF+VND LVF+Y  PNG   PHSV+LL NLESF
Subjt:  MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESF

Query:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP
         KCD   A +VAN +QG GDGFE VL+QSK YYFACGERNG HC  GNMKFSVLP
Subjt:  WKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLP

A0A6P6AEP5 uncharacterized protein LOC111308916 isoform X17.3e-4667.52Show/hide
Query:  QGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGER
        +GPNKI+VGGSE+WRFGFNYSEWA +N PF+ NDTLVFKY+PP+  TFPHSV+L  +L S+W CDL+ A+ +AN TQGGGDGFEL L + + YYFACGER
Subjt:  QGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGER

Query:  NGFHCSGGNMKFSVLPI
        NGFHC  G M+F V+P+
Subjt:  NGFHCSGGNMKFSVLPI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15770.1 Cupredoxin superfamily protein3.2e-2540Show/hide
Query:  PNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNG
        P KI+VGGS+ W+ G +Y +WA KN PF+VND LVFKY+        ++V+L  +  S+  CD++ A K+ +  +G  + F   L++ + Y+FA GE +G
Subjt:  PNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNG

Query:  FHCSGGNMKFSVLPI
         +C   NMKF++ P+
Subjt:  FHCSGGNMKFSVLPI

AT2G15780.1 Cupredoxin superfamily protein6.8e-3654.31Show/hide
Query:  GPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERN
        GP KI+VGG + W +GFNY++WA K  PF +ND LVFKY PP    F HSV+LL N  S+ KCD++  + +A+  QG G GFE VL+Q K YY +CGE +
Subjt:  GPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERN

Query:  GFHCSGGNMKFSVLPI
        G HCS G MKF+V+P+
Subjt:  GFHCSGGNMKFSVLPI

AT4G34300.1 Cupredoxin superfamily protein3.9e-1535.4Show/hide
Query:  WRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTF---------PHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFH
        W+ G+ Y+EW  K+ PF+VND LVF Y   + T            + V+LL +++SF +C++   +K+         GF+L+L++   YYF  G+ N   
Subjt:  WRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTF---------PHSVFLLSNLESFWKCDLRGAEKVANWTQGGGDGFELVLQQSKIYYFACGERNGFH

Query:  CSGGNMKFSVLPI
            NMKFSV PI
Subjt:  CSGGNMKFSVLPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCATCTTCATCAGCCATCCTCCTCCTCATGGTCTCTTGCATGTTGGCTGTCTGTTCAGCCCGCAGCGCTCGAGATCAGCTTTCGCCATCGCCAGCTCCATC
ACCGGTACAAGGTCCCAACAAGATCGTCGTCGGTGGTTCCGAGCATTGGCGTTTTGGCTTCAACTATAGCGAGTGGGCGTTAAAAAATGGCCCTTTTCACGTCAACGATA
CTCTTGTTTTCAAGTACGAGCCTCCAAACGGCACGACATTTCCTCATAGTGTATTCTTGCTATCAAATTTGGAGAGCTTCTGGAAGTGTGATTTGAGAGGGGCTGAAAAG
GTAGCAAATTGGACGCAAGGAGGAGGAGATGGATTCGAGTTGGTGCTCCAACAATCCAAAATTTACTACTTTGCTTGTGGAGAACGCAATGGCTTCCATTGCAGTGGTGG
AAACATGAAGTTCTCTGTGCTGCCAATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTCATCTTCATCAGCCATCCTCCTCCTCATGGTCTCTTGCATGTTGGCTGTCTGTTCAGCCCGCAGCGCTCGAGATCAGCTTTCGCCATCGCCAGCTCCATC
ACCGGTACAAGGTCCCAACAAGATCGTCGTCGGTGGTTCCGAGCATTGGCGTTTTGGCTTCAACTATAGCGAGTGGGCGTTAAAAAATGGCCCTTTTCACGTCAACGATA
CTCTTGTTTTCAAGTACGAGCCTCCAAACGGCACGACATTTCCTCATAGTGTATTCTTGCTATCAAATTTGGAGAGCTTCTGGAAGTGTGATTTGAGAGGGGCTGAAAAG
GTAGCAAATTGGACGCAAGGAGGAGGAGATGGATTCGAGTTGGTGCTCCAACAATCCAAAATTTACTACTTTGCTTGTGGAGAACGCAATGGCTTCCATTGCAGTGGTGG
AAACATGAAGTTCTCTGTGCTGCCAATATAA
Protein sequenceShow/hide protein sequence
MASSSSSAILLLMVSCMLAVCSARSARDQLSPSPAPSPVQGPNKIVVGGSEHWRFGFNYSEWALKNGPFHVNDTLVFKYEPPNGTTFPHSVFLLSNLESFWKCDLRGAEK
VANWTQGGGDGFELVLQQSKIYYFACGERNGFHCSGGNMKFSVLPI