; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003601 (gene) of Snake gourd v1 genome

Gene IDTan0003601
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF707)
Genome locationLG05:84319378..84326158
RNA-Seq ExpressionTan0003601
SyntenyTan0003601
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442108.1 PREDICTED: uncharacterized protein LOC103486065 [Cucumis melo]7.9e-21187.56Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS
        MKP KSWDML RKRNLFSDGGKYGFKMKQLPFM VIC+VMLFIVYRTTNYQY QTKIETTL PF+TTK+F EES  L  LPRGIVEARSDLELRPLW TS
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS

Query:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV
        SS+LK  DYSNRNLLAIP GIKQK  V+SIV+KFIP NFTIILFHYDGNVDGWWDLDW NDAIHIA RNQTKWWYAKRFLQPA+VSIYDYIFLWDEDLGV
Subjt:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV

Query:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
        EHFSPRRYL+IVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRG+VKCSDES+EPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMD
Subjt:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD

Query:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD
        MKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG  SK  SKAA  AKK SP+ PGDVRTEIRRQSTWELQIFK+RWNKAVAED++W+DPFK +SL SD
Subjt:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD

Query:  ERRRIRRHRH
        ERRR R+ RH
Subjt:  ERRRIRRHRH

XP_011653039.1 uncharacterized protein LOC101217607 isoform X1 [Cucumis sativus]3.9e-21086.86Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS
        MKP KSWD+L RKRNLFSD GKYGFKMKQLPFMGVIC+VMLFI+YRTTNYQY QTKIET LQPF+T KD+ EESQ L  LPRGIVEARSDLELRPLW TS
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS

Query:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV
        SS+LK  DYSNRNLLAIP GIKQK  V+SIV+KFIPENFTIILFHYDGNVDGWWDLDW NDAIHIA RNQTKWWYAKRFLQPA+VSIYDYIFLWDEDLGV
Subjt:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV

Query:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
        EHFSPRRYL+IVKSEGLEISQPALDPNSTDIHHRIT+RARTKKIHRRVYDLRG+VKCSDES+EPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMD
Subjt:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD

Query:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD
        MKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG  SK  SKAA  AKKQ+P+ P DVRTEIRRQSTWELQIFK+RWNKAVAED++W+DPFK +SL SD
Subjt:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD

Query:  ERRRIRRH-RH
        ERRR RR  RH
Subjt:  ERRRIRRH-RH

XP_022139892.1 uncharacterized protein LOC111010696 [Momordica charantia]9.4e-20484.41Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTK-DFGEESQTLIDLPRGIVEARSDLELRPLW--
        MKP KSW+MLFRK+NLFSDGG+YGFKMKQLPFMGVIC VMLFIVYRTTNYQY  T+IETTLQPF+TTK DFGE S  L  LPRGI+EARSDLELRPLW  
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTK-DFGEESQTLIDLPRGIVEARSDLELRPLW--

Query:  ---VTSSSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLW
             SSSKLKADDYS RNLLAIPAGIKQK  VD+IV+KFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPA+VS+YD+IFLW
Subjt:  ---VTSSSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLW

Query:  DEDLGVEHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLV
        DEDLGVEHF PRRYL+IVKSEGLEISQPAL PNS+ IHHRIT+RARTKK+HRRVYD+RG+VKCSD+SDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLV
Subjt:  DEDLGVEHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLV

Query:  HGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQ
        HGWGMDMKLGYCAQGDRTKKVGVIDSQYI HKGIQTLGG  R SK  SK A  AKKQ+PV   DVRTEIRRQSTWEL+IFKDRWNKAVAEDE W+DPFKQ
Subjt:  HGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQ

Query:  HSLNSDER-RRIRRHRH
        +S  SD+R R  RRH H
Subjt:  HSLNSDER-RRIRRHRH

XP_031741096.1 uncharacterized protein LOC101217607 isoform X2 [Cucumis sativus]1.4e-20786.37Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS
        MKP KSWD+L RKRNLFSD GKYGFKMKQLPFMGVIC+VMLFI+YRTTNYQY QTKIET LQPF+T KD+ EESQ L  LPRGIVEARSDLELRPLW TS
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS

Query:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV
        SS+LK  DYSNRNLLAIP GIKQK  V+SIV+KFIPENFTIILFHYDGNVDGWWDLDW NDAIHIA RNQTKWWYAKRFLQPA+VSIYDYIFLWDEDLGV
Subjt:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV

Query:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
        EHFSPRRYL+IVKSEGLEISQPALDPNSTDIHHRIT+RARTKKIHRRVYDLRG+VKCSDES+EPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMD
Subjt:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD

Query:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD
        MKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG  SK  SKAA  AK  +P+ P DVRTEIRRQSTWELQIFK+RWNKAVAED++W+DPFK +SL SD
Subjt:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD

Query:  ERRRIRRH-RH
        ERRR RR  RH
Subjt:  ERRRIRRH-RH

XP_038894249.1 uncharacterized protein LOC120082912 [Benincasa hispida]5.8e-22291.22Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS
        MKP KSWD LFRKRNLFSDGGKYGFKMKQLPFMGVIC+VMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQ L  LPRGIVEARSDLELRPLW TS
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS

Query:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV
        SS+LKA DYSNR LLAIPAGIKQK  V+SIV+KFIP NFTIILFHYDGNVDGWWDLDW NDAIHIAARNQTKWWYAKRFLQPA+VSIYDYIFLWDEDLGV
Subjt:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV

Query:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
        EHFSPRRYL+I KSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDES+EPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
Subjt:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD

Query:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD
        MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGR SK+SSKA+E AKK SP+ PGDVRTEIRRQSTWELQIFK RWNKAVAEDE+W+DPFK++SL SD
Subjt:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD

Query:  ERRRIRRHRH
        +RRR RR RH
Subjt:  ERRRIRRHRH

TrEMBL top hitse value%identityAlignment
A0A0A0LXB6 Uncharacterized protein1.9e-21086.86Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS
        MKP KSWD+L RKRNLFSD GKYGFKMKQLPFMGVIC+VMLFI+YRTTNYQY QTKIET LQPF+T KD+ EESQ L  LPRGIVEARSDLELRPLW TS
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS

Query:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV
        SS+LK  DYSNRNLLAIP GIKQK  V+SIV+KFIPENFTIILFHYDGNVDGWWDLDW NDAIHIA RNQTKWWYAKRFLQPA+VSIYDYIFLWDEDLGV
Subjt:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV

Query:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
        EHFSPRRYL+IVKSEGLEISQPALDPNSTDIHHRIT+RARTKKIHRRVYDLRG+VKCSDES+EPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMD
Subjt:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD

Query:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD
        MKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG  SK  SKAA  AKKQ+P+ P DVRTEIRRQSTWELQIFK+RWNKAVAED++W+DPFK +SL SD
Subjt:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD

Query:  ERRRIRRH-RH
        ERRR RR  RH
Subjt:  ERRRIRRH-RH

A0A1S3B5L5 uncharacterized protein LOC1034860653.8e-21187.56Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS
        MKP KSWDML RKRNLFSDGGKYGFKMKQLPFM VIC+VMLFIVYRTTNYQY QTKIETTL PF+TTK+F EES  L  LPRGIVEARSDLELRPLW TS
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS

Query:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV
        SS+LK  DYSNRNLLAIP GIKQK  V+SIV+KFIP NFTIILFHYDGNVDGWWDLDW NDAIHIA RNQTKWWYAKRFLQPA+VSIYDYIFLWDEDLGV
Subjt:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV

Query:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
        EHFSPRRYL+IVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRG+VKCSDES+EPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMD
Subjt:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD

Query:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD
        MKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG  SK  SKAA  AKK SP+ PGDVRTEIRRQSTWELQIFK+RWNKAVAED++W+DPFK +SL SD
Subjt:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSD

Query:  ERRRIRRHRH
        ERRR R+ RH
Subjt:  ERRRIRRHRH

A0A6J1CGR5 uncharacterized protein LOC1110106964.5e-20484.41Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTK-DFGEESQTLIDLPRGIVEARSDLELRPLW--
        MKP KSW+MLFRK+NLFSDGG+YGFKMKQLPFMGVIC VMLFIVYRTTNYQY  T+IETTLQPF+TTK DFGE S  L  LPRGI+EARSDLELRPLW  
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTK-DFGEESQTLIDLPRGIVEARSDLELRPLW--

Query:  ---VTSSSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLW
             SSSKLKADDYS RNLLAIPAGIKQK  VD+IV+KFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPA+VS+YD+IFLW
Subjt:  ---VTSSSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLW

Query:  DEDLGVEHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLV
        DEDLGVEHF PRRYL+IVKSEGLEISQPAL PNS+ IHHRIT+RARTKK+HRRVYD+RG+VKCSD+SDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLV
Subjt:  DEDLGVEHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLV

Query:  HGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQ
        HGWGMDMKLGYCAQGDRTKKVGVIDSQYI HKGIQTLGG  R SK  SK A  AKKQ+PV   DVRTEIRRQSTWEL+IFKDRWNKAVAEDE W+DPFKQ
Subjt:  HGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQ

Query:  HSLNSDER-RRIRRHRH
        +S  SD+R R  RRH H
Subjt:  HSLNSDER-RRIRRHRH

A0A6J1EKI2 uncharacterized protein LOC111434159 isoform X17.2e-20285.39Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS
        MKP KSW ML RK++LFSDGGK  F+MKQL FM VIC VMLFIVYRTTNYQY QTKIETTLQPF+ T+  GEE Q L  LP GIVEARSDLELRPLW TS
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS

Query:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV
        +S+L+A+DYSNRNLLAIP GIKQK  VDSIV+KF+PENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPA+V IYDYIFLWDEDLGV
Subjt:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV

Query:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
        EHFSPRRYL+I KSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGS KC+D+S+ PPCTGFVEGMAPVFSRSAWYC WHLIQNDLVHGWGMD
Subjt:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD

Query:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSL
        MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGG  S+ S KAAE AKK S +PP DVRTEIRRQSTWELQIFK+RWNKAVAED++W+DPFK+ SL
Subjt:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSL

A0A6J1KEU2 uncharacterized protein LOC1114951794.7e-20184.16Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS
        MKP KSW ML RK++LFSDGGK  F+MKQL FM VIC VMLFIVYRTTNYQY QTKIETTLQPF+ T+  GEE Q L  LP GIVEARSDLELRPLW TS
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS

Query:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV
        +S+L+A+DYSNRNLLAIP GIKQK  VDSIV+KF+PENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPA+V IYDYIFLWDEDLGV
Subjt:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV

Query:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
        EHFSPRRYL+I KSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGS KC+D+S+ PPCTGFVEGMAPVFSRSAWYC WHLIQNDLVHGWGMD
Subjt:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD

Query:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQ---HSL
        MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGG  S+ S KAAE AKK S +PP DVRTEIRRQSTWELQIFK+RWNKAVAED++W+DPFK+     L
Subjt:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQ---HSL

Query:  NSDE
          DE
Subjt:  NSDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)1.1e-14660.1Show/hide
Query:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS
        MK  KS   L +++    +  K G+KMK  PF+ ++C  +L   Y+TTN Q+ QT+IE T  PF+  K+    +  L  LPRGI+++RSDLEL+PLW   
Subjt:  MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTS

Query:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV
        S + K  + +NRNLLAIP G+KQK  VD++V+KF+P NFTI+LFHYDGN+D WWDL+WS+ +IHI A+NQTKWW+AKRFL P +VSIYDYIFLWDEDLGV
Subjt:  SSKLKADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGV

Query:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD
        E+F+P RYL IVKS GLEISQPALD NST+IHH+IT+R++TKK HRRVY  RG  +CS+ S +PPCTGFVEGMAPVFS++AW CTW+LIQNDLVHGWGMD
Subjt:  EHFSPRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMD

Query:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSL---
        MKLGYCAQGDRTK VG++DS+YI+H+GIQTLG      K +  A ++  ++      D RTEIRRQSTWELQ FK+RW+KAV ED  WIDP    S+   
Subjt:  MKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSL---

Query:  -NSDERRRIRR
         ++   RR+RR
Subjt:  -NSDERRRIRR

AT1G61240.1 Protein of unknown function (DUF707)3.6e-15362.28Show/hide
Query:  KSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTSSSKL
        KSW  L ++RN   +GGK  +KMK  P + ++C V+L   Y+TTN QY QT+IE T  PFE  K+    S+ L  LP GI++ +SDLEL+PLW +SS + 
Subjt:  KSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTSSSKL

Query:  KADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGVEHFS
        K+ + +NRNLLA+P G+KQK  VD++V+KF+P NFT+ILFHYDGN+D WWDL+WS+ AIHI A NQTKWW+AKRFL P IVSIYDY+FLWDEDLGVE+F+
Subjt:  KADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGVEHFS

Query:  PRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLG
        P++YL IVK+ GLEISQPAL PNST++HHRIT+R+RTK  HRRVYD RG++KCS+ S+ PPCTGFVEGMAPVFSRSAW+CTW+LIQNDLVHGWGMDMKLG
Subjt:  PRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLG

Query:  YCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSDERRR
        YCAQGDR+KKVG++DS+YI H+GIQTLGG G   K +S  + + +++      D RTEIRRQSTWELQ FK+RWN+AVAED+ W++            RR
Subjt:  YCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSDERRR

Query:  IRR
        ++R
Subjt:  IRR

AT1G61240.2 Protein of unknown function (DUF707)3.6e-15362.28Show/hide
Query:  KSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTSSSKL
        KSW  L ++RN   +GGK  +KMK  P + ++C V+L   Y+TTN QY QT+IE T  PFE  K+    S+ L  LP GI++ +SDLEL+PLW +SS + 
Subjt:  KSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTSSSKL

Query:  KADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGVEHFS
        K+ + +NRNLLA+P G+KQK  VD++V+KF+P NFT+ILFHYDGN+D WWDL+WS+ AIHI A NQTKWW+AKRFL P IVSIYDY+FLWDEDLGVE+F+
Subjt:  KADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGVEHFS

Query:  PRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLG
        P++YL IVK+ GLEISQPAL PNST++HHRIT+R+RTK  HRRVYD RG++KCS+ S+ PPCTGFVEGMAPVFSRSAW+CTW+LIQNDLVHGWGMDMKLG
Subjt:  PRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLG

Query:  YCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSDERRR
        YCAQGDR+KKVG++DS+YI H+GIQTLGG G   K +S  + + +++      D RTEIRRQSTWELQ FK+RWN+AVAED+ W++            RR
Subjt:  YCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSDERRR

Query:  IRR
        ++R
Subjt:  IRR

AT1G61240.3 Protein of unknown function (DUF707)3.6e-15362.28Show/hide
Query:  KSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTSSSKL
        KSW  L ++RN   +GGK  +KMK  P + ++C V+L   Y+TTN QY QT+IE T  PFE  K+    S+ L  LP GI++ +SDLEL+PLW +SS + 
Subjt:  KSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTSSSKL

Query:  KADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGVEHFS
        K+ + +NRNLLA+P G+KQK  VD++V+KF+P NFT+ILFHYDGN+D WWDL+WS+ AIHI A NQTKWW+AKRFL P IVSIYDY+FLWDEDLGVE+F+
Subjt:  KADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGVEHFS

Query:  PRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLG
        P++YL IVK+ GLEISQPAL PNST++HHRIT+R+RTK  HRRVYD RG++KCS+ S+ PPCTGFVEGMAPVFSRSAW+CTW+LIQNDLVHGWGMDMKLG
Subjt:  PRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLG

Query:  YCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSDERRR
        YCAQGDR+KKVG++DS+YI H+GIQTLGG G   K +S  + + +++      D RTEIRRQSTWELQ FK+RWN+AVAED+ W++            RR
Subjt:  YCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSDERRR

Query:  IRR
        ++R
Subjt:  IRR

AT1G61240.4 Protein of unknown function (DUF707)3.6e-15362.28Show/hide
Query:  KSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTSSSKL
        KSW  L ++RN   +GGK  +KMK  P + ++C V+L   Y+TTN QY QT+IE T  PFE  K+    S+ L  LP GI++ +SDLEL+PLW +SS + 
Subjt:  KSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTSSSKL

Query:  KADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGVEHFS
        K+ + +NRNLLA+P G+KQK  VD++V+KF+P NFT+ILFHYDGN+D WWDL+WS+ AIHI A NQTKWW+AKRFL P IVSIYDY+FLWDEDLGVE+F+
Subjt:  KADDYSNRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGVEHFS

Query:  PRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLG
        P++YL IVK+ GLEISQPAL PNST++HHRIT+R+RTK  HRRVYD RG++KCS+ S+ PPCTGFVEGMAPVFSRSAW+CTW+LIQNDLVHGWGMDMKLG
Subjt:  PRRYLDIVKSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLG

Query:  YCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSDERRR
        YCAQGDR+KKVG++DS+YI H+GIQTLGG G   K +S  + + +++      D RTEIRRQSTWELQ FK+RWN+AVAED+ W++            RR
Subjt:  YCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSDERRR

Query:  IRR
        ++R
Subjt:  IRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCCGCCTAAATCATGGGACATGCTGTTCAGGAAAAGGAATCTATTTTCTGATGGGGGAAAATACGGGTTCAAAATGAAGCAGCTTCCATTTATGGGTGTTATTTG
TGCTGTAATGCTGTTTATTGTATACAGAACTACAAATTACCAATATCATCAGACAAAGATCGAAACAACCTTGCAACCCTTTGAGACCACAAAGGACTTCGGAGAGGAGT
CTCAAACCTTGATTGATTTGCCACGTGGCATAGTAGAAGCTAGATCAGATTTGGAGTTGAGACCTCTATGGGTAACTAGTAGTTCAAAGTTAAAGGCTGATGATTACAGC
AACCGTAATTTGCTTGCAATTCCAGCTGGCATTAAACAAAAACATTATGTCGATTCTATTGTACGAAAATTTATTCCAGAGAACTTTACTATTATACTCTTTCATTATGA
TGGCAATGTGGATGGATGGTGGGATCTAGACTGGAGTAATGATGCCATACATATAGCTGCTCGAAACCAAACAAAGTGGTGGTATGCAAAGCGCTTTTTGCAACCGGCAA
TTGTGTCCATTTATGATTACATATTTCTTTGGGATGAAGATTTGGGGGTCGAACATTTCAGTCCAAGAAGATACCTGGACATTGTAAAGTCTGAAGGGCTGGAAATATCT
CAGCCTGCCTTGGACCCAAATTCGACTGACATACATCATAGAATTACTATTCGTGCTCGAACAAAGAAGATACACAGAAGAGTTTATGATCTCAGAGGCAGTGTGAAATG
TTCAGATGAAAGTGATGAGCCACCTTGCACTGGATTTGTAGAAGGGATGGCTCCGGTATTCTCAAGATCGGCCTGGTATTGTACTTGGCATCTGATACAGAATGATCTTG
TCCATGGATGGGGAATGGATATGAAACTTGGGTATTGTGCACAGGGTGATCGGACGAAGAAGGTGGGAGTAATCGATAGTCAGTATATTGTTCACAAGGGCATACAGACT
TTGGGTGGAGGCGGAAGAACGTCCAAGGCTTCTTCAAAAGCTGCAGAGTTGGCAAAGAAACAAAGCCCCGTACCACCTGGTGATGTTCGAACAGAGATAAGGAGGCAATC
AACATGGGAACTTCAGATCTTCAAAGATCGATGGAACAAAGCGGTAGCAGAGGACGAGAATTGGATCGATCCATTTAAACAACATTCATTAAATAGTGACGAAAGACGGA
GGATTCGAAGACACCGACACCGCTAA
mRNA sequenceShow/hide mRNA sequence
GTCAGGTTGAGAGTCTGAGACCAAAACTCGAAACGCTTGCTTTTGGTATGGAACGACATGCAGCGAAGAGAAGCTAAAGCTAAGTTGCAAAATGGGAAAGTGTCTTCTTC
CCTTAAAACGTGGCGCTTCCTACTATTTCGTTCTTGACTCTCTCCATTTCTTTCATCATCTTCTTTCACACCTTTTCTCTCTCTCTCTCTCTCCCCCTCCCTCTGTTTCT
CTGTCGTTTTCATTTTTTGCTCATCGATCACTTTCTTTTCCCCCCTTTTCATGTTTTCTGTGATTCTACACTGTTTTTGATCCACTGGGGTGTGACCCTTTGGTGGGTTT
AGCCAAGGATCTCTGTTCTCTGTATTTGACGTTTCAGCTGGACTCTACCGTTCTCTGTAACTGGGAATCTCGAAGGAATTCGGATCTTGTGGAAACTATCAAATGATTGG
TTCGGATTTTTAAGCTGAAGTGTCGCTTCTCTTGTCCCAAATGAGAATGGGAATATGTTATAGCGGCCTTTATGAACTCAAGATAGGAGGTTTTGCTGTCTATGAAGCCG
CCTAAATCATGGGACATGCTGTTCAGGAAAAGGAATCTATTTTCTGATGGGGGAAAATACGGGTTCAAAATGAAGCAGCTTCCATTTATGGGTGTTATTTGTGCTGTAAT
GCTGTTTATTGTATACAGAACTACAAATTACCAATATCATCAGACAAAGATCGAAACAACCTTGCAACCCTTTGAGACCACAAAGGACTTCGGAGAGGAGTCTCAAACCT
TGATTGATTTGCCACGTGGCATAGTAGAAGCTAGATCAGATTTGGAGTTGAGACCTCTATGGGTAACTAGTAGTTCAAAGTTAAAGGCTGATGATTACAGCAACCGTAAT
TTGCTTGCAATTCCAGCTGGCATTAAACAAAAACATTATGTCGATTCTATTGTACGAAAATTTATTCCAGAGAACTTTACTATTATACTCTTTCATTATGATGGCAATGT
GGATGGATGGTGGGATCTAGACTGGAGTAATGATGCCATACATATAGCTGCTCGAAACCAAACAAAGTGGTGGTATGCAAAGCGCTTTTTGCAACCGGCAATTGTGTCCA
TTTATGATTACATATTTCTTTGGGATGAAGATTTGGGGGTCGAACATTTCAGTCCAAGAAGATACCTGGACATTGTAAAGTCTGAAGGGCTGGAAATATCTCAGCCTGCC
TTGGACCCAAATTCGACTGACATACATCATAGAATTACTATTCGTGCTCGAACAAAGAAGATACACAGAAGAGTTTATGATCTCAGAGGCAGTGTGAAATGTTCAGATGA
AAGTGATGAGCCACCTTGCACTGGATTTGTAGAAGGGATGGCTCCGGTATTCTCAAGATCGGCCTGGTATTGTACTTGGCATCTGATACAGAATGATCTTGTCCATGGAT
GGGGAATGGATATGAAACTTGGGTATTGTGCACAGGGTGATCGGACGAAGAAGGTGGGAGTAATCGATAGTCAGTATATTGTTCACAAGGGCATACAGACTTTGGGTGGA
GGCGGAAGAACGTCCAAGGCTTCTTCAAAAGCTGCAGAGTTGGCAAAGAAACAAAGCCCCGTACCACCTGGTGATGTTCGAACAGAGATAAGGAGGCAATCAACATGGGA
ACTTCAGATCTTCAAAGATCGATGGAACAAAGCGGTAGCAGAGGACGAGAATTGGATCGATCCATTTAAACAACATTCATTAAATAGTGACGAAAGACGGAGGATTCGAA
GACACCGACACCGCTAATTCAGCAAGCAAGCATCCTAGCTGAGCGTCATCTACATGGGGGTTGTTACTTCCTTTGAACACATATTATCATTGCCATACTGCAATAGTTGT
AGTAGTCCATCCTAATTTTTTCATTATTTGAGCATTCAATTTTGTCCCCACAGATGGGTTTTCCTTTGTATGTAGCAGTGTAGCTTCCTCCCCCAGAGCAGAAGGAAGAA
AGAGAGAGAGAGGACAAGAAAATAAGGAAAAAGAAAGATAAAAAAATATTGAAAGAAAATTGTAACTATTGGTATAGTTTTACTGATCCCTTATGTACTTTTATGAGATA
ACATGATTAAATTCTTGGGGAACAGGAATTAAATCTGGCCAACTGGTCTGTTTTTTACATGATATATACGGTCAAGGGGTTTCTTTCTCACTTGTTACTTGGGCATCTTT
GAGGTGGGGACTAGGACGGGAATCTTTAACCTTTTGATCGAGAATATGCTTCAATCAGTTGAGTGGACGCAATTGGCTTTTCTTCCGTATGTATATGCATGTATTTTATG
AAA
Protein sequenceShow/hide protein sequence
MKPPKSWDMLFRKRNLFSDGGKYGFKMKQLPFMGVICAVMLFIVYRTTNYQYHQTKIETTLQPFETTKDFGEESQTLIDLPRGIVEARSDLELRPLWVTSSSKLKADDYS
NRNLLAIPAGIKQKHYVDSIVRKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAIVSIYDYIFLWDEDLGVEHFSPRRYLDIVKSEGLEIS
QPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESDEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQT
LGGGGRTSKASSKAAELAKKQSPVPPGDVRTEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQHSLNSDERRRIRRHRHR