; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G000100 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G000100
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGST N-terminal domain-containing protein
Genome locationCmo_Chr15:72829..75482
RNA-Seq ExpressionCmoCh15G000100
SyntenyCmoCh15G000100
Gene Ontology termsGO:0006749 - glutathione metabolic process (biological process)
GO:0004362 - glutathione-disulfide reductase activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR004045 - Glutathione S-transferase, N-terminal
IPR011767 - Glutaredoxin active site
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578307.1 hypothetical protein SDJN03_22755, partial [Cucurbita argyrosperma subsp. sororia]2.2e-10186.9Show/hide
Query:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
        MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISS FLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVS+NGSKLSTSFLSYLCPLLKIF
Subjt:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF

Query:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
        AGGDPSRERNFTLE                         VATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
Subjt:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL

Query:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL
        VEVYPCPKGSIRHRDIVKKCGGKEQ+  L
Subjt:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL

KAG7015883.1 hypothetical protein SDJN02_20986 [Cucurbita argyrosperma subsp. argyrosperma]2.2e-11793.72Show/hide
Query:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
        MSSFSFTTPYYSSLQI+LSDRSFNQATTSVLLNRNLFPISSKFLRISSRR RFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
Subjt:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF

Query:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
        AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
Subjt:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL

Query:  VE----------VYPCPKGSIRHRDIVKKCGGKEQYVVL
        VE          VYPCPKGSIRHRDIVKKCGGKEQ+  L
Subjt:  VE----------VYPCPKGSIRHRDIVKKCGGKEQYVVL

XP_022939328.1 uncharacterized protein LOC111445276 [Cucurbita moschata]1.5e-10287.77Show/hide
Query:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
        MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
Subjt:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF

Query:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
        AGGDPSRERNFTLE                         VATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
Subjt:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL

Query:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL
        VEVYPCPKGSIRHRDIVKKCGGKEQ+  L
Subjt:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL

XP_022992993.1 uncharacterized protein LOC111489153 [Cucurbita maxima]2.1e-9985.59Show/hide
Query:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
        MSSFSFTTPYYSSLQI+LSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVS+NGSKLSTSFLSYLCPLL  F
Subjt:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF

Query:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
        A GDPSRERNFTLE                         VATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
Subjt:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL

Query:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL
        VEVYPCPKGSIRHRDIVKKCGGKEQ+  L
Subjt:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL

XP_023550255.1 uncharacterized protein LOC111808485 [Cucurbita pepo subsp. pepo]5.8e-10287.34Show/hide
Query:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
        MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVS+NGSKLSTSFLSYLCPLLKIF
Subjt:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF

Query:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
        AGGDPSRERNFTLE                         VATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
Subjt:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL

Query:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL
        VEVYPCPKGSIRHRDIVKKCGGKEQ+  L
Subjt:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL

TrEMBL top hitse value%identityAlignment
A0A0A0LQK6 Uncharacterized protein3.3e-7972.41Show/hide
Query:  MSSFSFTTPYYSSL---QILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLL
        M+SF F TPYYSSL    IL S++SFNQA TSVL NRNLFPIS+K  RISS R RFHA+SVRS AETEE R + S ESN V +NGS LSTSFLSYLCPLL
Subjt:  MSSFSFTTPYYSSL---QILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLL

Query:  KIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTEL
        K+FAGGDPSRERNFTLE                         VATSSLS+LARLPWGSRTLSD+S SNRNI+LE LLPLQLYEFEACPFCRRVREALTEL
Subjt:  KIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTEL

Query:  DLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVL
        DL VEVYPCPKGSIRHRDIVKK GGKEQ+  L
Subjt:  DLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVL

A0A6J1DF22 uncharacterized protein LOC111020220 isoform X39.7e-7971.91Show/hide
Query:  MSSFSFTTPYYSSLQ---ILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLL
        MSSFS TTP YS+L    I+ ++RSFN+ATTSVL NRNLFPIS+K LRISSRR RFH NSV S AETEEPR Q+S E NAVS+ GS LSTSFLSYLCPLL
Subjt:  MSSFSFTTPYYSSLQ---ILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLL

Query:  KIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQS-NRNINLEPLLPLQLYEF-EACPFCRRVREALT
        K+FAGGDPS ERNFTLE                         VATSSLSTLARLPWGSRT+SD+S+S  RN NL PLLPLQLYEF EACPFCRRVREALT
Subjt:  KIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQS-NRNINLEPLLPLQLYEF-EACPFCRRVREALT

Query:  ELDLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVLS
        ELDL VEVYPCPKGSIRHRDIVKKCGGKEQ+  L+
Subjt:  ELDLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVLS

A0A6J1DG84 uncharacterized protein LOC111020220 isoform X43.9e-8072.22Show/hide
Query:  MSSFSFTTPYYSSLQ---ILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLL
        MSSFS TTP YS+L    I+ ++RSFN+ATTSVL NRNLFPIS+K LRISSRR RFH NSV S AETEEPR Q+S E NAVS+ GS LSTSFLSYLCPLL
Subjt:  MSSFSFTTPYYSSLQ---ILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLL

Query:  KIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQS-NRNINLEPLLPLQLYEFEACPFCRRVREALTE
        K+FAGGDPS ERNFTLE                         VATSSLSTLARLPWGSRT+SD+S+S  RN NL PLLPLQLYEFEACPFCRRVREALTE
Subjt:  KIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQS-NRNINLEPLLPLQLYEFEACPFCRRVREALTE

Query:  LDLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVLS
        LDL VEVYPCPKGSIRHRDIVKKCGGKEQ+  L+
Subjt:  LDLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVLS

A0A6J1FLC5 uncharacterized protein LOC1114452767.4e-10387.77Show/hide
Query:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
        MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
Subjt:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF

Query:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
        AGGDPSRERNFTLE                         VATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
Subjt:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL

Query:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL
        VEVYPCPKGSIRHRDIVKKCGGKEQ+  L
Subjt:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL

A0A6J1K0W0 uncharacterized protein LOC1114891531.0e-9985.59Show/hide
Query:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF
        MSSFSFTTPYYSSLQI+LSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVS+NGSKLSTSFLSYLCPLL  F
Subjt:  MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIF

Query:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
        A GDPSRERNFTLE                         VATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL
Subjt:  AGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLL

Query:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL
        VEVYPCPKGSIRHRDIVKKCGGKEQ+  L
Subjt:  VEVYPCPKGSIRHRDIVKKCGGKEQYVVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10000.1 Thioredoxin family protein7.9e-4149.44Show/hide
Query:  RHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLA
        R R +  S  S  + EE          A   + S  ++SFLS+LCPLLK+F+GGDPS++RN  LE                         VATSSL+++A
Subjt:  RHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLA

Query:  RLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVL
        RLPWGSR +S  S  N++++  P L LQL+EFEACPFCRRVREA+TELDL VEVYPCPKGSIRHR++V++ GGKE +  L
Subjt:  RLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVL

AT4G10000.2 Thioredoxin family protein7.9e-4149.44Show/hide
Query:  RHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLA
        R R +  S  S  + EE          A   + S  ++SFLS+LCPLLK+F+GGDPS++RN  LE                         VATSSL+++A
Subjt:  RHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIFAGGDPSRERNFTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLA

Query:  RLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVL
        RLPWGSR +S  S  N++++  P L LQL+EFEACPFCRRVREA+TELDL VEVYPCPKGSIRHR++V++ GGKE +  L
Subjt:  RLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLLVEVYPCPKGSIRHRDIVKKCGGKEQYVVL

AT5G03880.1 Thioredoxin family protein4.1e-1328.14Show/hide
Query:  LLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIFAGGDPSRERNFTLEAIIFRSLLS---GLFF
        +L  N+ P       +SS   R   +S+R+    +   ++ S ES +VS   S  + + + +  P      G  P   + F +++     +L    GLFF
Subjt:  LLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIFAGGDPSRERNFTLEAIIFRSLLS---GLFF

Query:  SFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLLVEVYPCPKGSIRHRDIVKKCGGKEQY
         F    +      +  S   +    +  R    + +    +   P  P+++YEFE CPFCR+VRE +  LDL +  YPCP+GS   R  VK+ GGK+Q+
Subjt:  SFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLLVEVYPCPKGSIRHRDIVKKCGGKEQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCGTTTTCTTTCACTACTCCATATTATTCCTCTCTGCAAATCCTTCTTTCCGATCGCAGCTTCAACCAAGCAACCACATCTGTCCTTCTGAATCGGAATCTCTT
TCCGATTTCATCAAAATTTTTGAGAATTTCTTCACGCAGACACAGGTTTCACGCAAACTCTGTTCGCTCAGGTGCTGAAACTGAAGAACCTCGAGCTCAGCATTCGCCCG
AAAGTAATGCAGTTTCGAACAATGGAAGCAAGCTTTCTACAAGTTTTCTATCTTATCTCTGTCCTTTACTCAAGATTTTCGCTGGAGGAGATCCTTCGAGAGAGAGGAAT
TTTACTTTGGAGGCAATAATTTTTCGCTCTTTACTCTCTGGATTGTTTTTTTCGTTCAATGCGTTTCCTTATTTTTGTGTTTCGCCGGTAGCCACATCTTCCTTATCTAC
ATTGGCTAGGCTTCCATGGGGCTCAAGAACGTTGTCGGATAGTTCTCAAAGCAATAGGAATATTAATTTGGAGCCTCTGTTGCCTTTGCAACTCTATGAATTTGAGGCAT
GCCCCTTTTGCAGGAGGGTTCGAGAGGCCTTAACTGAACTGGATCTTTTAGTAGAGGTTTATCCTTGTCCCAAGGGTTCTATTAGACATCGGGACATAGTTAAGAAATGT
GGTGGCAAAGAGCAGTACGTGGTTCTTTCTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATCGTTTTCTTTCACTACTCCATATTATTCCTCTCTGCAAATCCTTCTTTCCGATCGCAGCTTCAACCAAGCAACCACATCTGTCCTTCTGAATCGGAATCTCTT
TCCGATTTCATCAAAATTTTTGAGAATTTCTTCACGCAGACACAGGTTTCACGCAAACTCTGTTCGCTCAGGTGCTGAAACTGAAGAACCTCGAGCTCAGCATTCGCCCG
AAAGTAATGCAGTTTCGAACAATGGAAGCAAGCTTTCTACAAGTTTTCTATCTTATCTCTGTCCTTTACTCAAGATTTTCGCTGGAGGAGATCCTTCGAGAGAGAGGAAT
TTTACTTTGGAGGCAATAATTTTTCGCTCTTTACTCTCTGGATTGTTTTTTTCGTTCAATGCGTTTCCTTATTTTTGTGTTTCGCCGGTAGCCACATCTTCCTTATCTAC
ATTGGCTAGGCTTCCATGGGGCTCAAGAACGTTGTCGGATAGTTCTCAAAGCAATAGGAATATTAATTTGGAGCCTCTGTTGCCTTTGCAACTCTATGAATTTGAGGCAT
GCCCCTTTTGCAGGAGGGTTCGAGAGGCCTTAACTGAACTGGATCTTTTAGTAGAGGTTTATCCTTGTCCCAAGGGTTCTATTAGACATCGGGACATAGTTAAGAAATGT
GGTGGCAAAGAGCAGTACGTGGTTCTTTCTATTTAG
Protein sequenceShow/hide protein sequence
MSSFSFTTPYYSSLQILLSDRSFNQATTSVLLNRNLFPISSKFLRISSRRHRFHANSVRSGAETEEPRAQHSPESNAVSNNGSKLSTSFLSYLCPLLKIFAGGDPSRERN
FTLEAIIFRSLLSGLFFSFNAFPYFCVSPVATSSLSTLARLPWGSRTLSDSSQSNRNINLEPLLPLQLYEFEACPFCRRVREALTELDLLVEVYPCPKGSIRHRDIVKKC
GGKEQYVVLSI