; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005173 (gene) of Snake gourd v1 genome

Gene IDTan0005173
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionS-protein homolog
Genome locationLG06:16969943..16980581
RNA-Seq ExpressionTan0005173
SyntenyTan0005173
Gene Ontology termsGO:0060320 - rejection of self pollen (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646229.1 hypothetical protein Csa_016318 [Cucumis sativus]1.2e-5743.84Show/hide
Query:  AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHN-ISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRS-IWLFDKCF-ETNCIWTAKD
        A++  W + ++NGLS+   L VHC+SK+DDLG H+ +  G  + W F+ NFW TTLFWC L+K +A +VSFE FW ES S  WL D+C  E  CIW AKD
Subjt:  AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHN-ISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRS-IWLFDKCF-ETNCIWTAKD

Query:  DGIYLKDNPTQVDVLIHRWKYKIIIRKTYDILIFRTCQFFTMEMKSMKKHSLVFSLVLLLLWAIFNPTTKAAGALLK-RWQIHIVNGLSNDQILLVHCKS
         GIYLK+NP   D  +H+W                      +  K M ++ +V  L+   L A    + K    L   R+ IH+ N L+N Q +  HC+S
Subjt:  DGIYLKDNPTQVDVLIHRWKYKIIIRKTYDILIFRTCQFFTMEMKSMKKHSLVFSLVLLLLWAIFNPTTKAAGALLK-RWQIHIVNGLSNDQILLVHCKS

Query:  EDDDLG-EHNINVGTEFNWTFRVNIWNTTLFWCFLKKPNAQYVSFEAFWVEKTSIWLYYRCYNSN-CIWTAKDDGVYLKDNPVKRDVLVHKW
        +DDDLG +H ++ G EF W F+ N W TTLFWC L+K NA YVSF+ FW E+   WL  RC +   CIW+A+DDG+YL++ P + + +VHKW
Subjt:  EDDDLG-EHNINVGTEFNWTFRVNIWNTTLFWCFLKKPNAQYVSFEAFWVEKTSIWLYYRCYNSN-CIWTAKDDGVYLKDNPVKRDVLVHKW

XP_016902699.1 PREDICTED: uncharacterized protein LOC107991826 [Cucumis melo]1.2e-5467.13Show/hide
Query:  MKSMKKHFSV--LLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF
        M+SMK HF V   L VLSLA+++P +A  L++WHIH++NGLSN Q L VHC+SKDDDLG+  +SVG EF+WTF++NFW+TTLFWCYL+KPNA+ VSFEAF
Subjt:  MKSMKKHFSV--LLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF

Query:  WIESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRW
        W+E +SIWLF +CF++NCIWTAKDDGIYLKDNP   D L+H W
Subjt:  WIESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRW

XP_022143772.1 S-protein homolog 74-like [Momordica charantia]5.0e-5668.53Show/hide
Query:  MKSMKKHFSVLLIVLSLALLEPARA-AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFW
        ++S KKHF V L+VLSL +LEP  A +ELKKW IHV+NGLSNGQ L VHCKSKD+DLGEHN++ G EF+WTFRVN WNTTLFWCYL KP+ +  SF+ FW
Subjt:  MKSMKKHFSVLLIVLSLALLEPARA-AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFW

Query:  IESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWK
        +E +SIWLF +C+ +NCIWTAKDDGIYL+DNP Q D+L+H WK
Subjt:  IESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWK

XP_031745090.1 S-protein homolog 1-like [Cucumis sativus]1.2e-5471.43Show/hide
Query:  MKKHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESR
        MKK   VLL VL LA+LE  +A EL+KWHIHV+NGLSNGQ+LL HCKSKD+DLGE  +  G EF+W FRVNFWNTTLFWCYL+KPN Q  SFE+FWIESR
Subjt:  MKKHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESR

Query:  SIWLFDKCFETNCIWTAKDDGIYLKDN-PTQVDVLIHRWK
        S+WL+  CFE NCIWTAKDDGIYLKDN  T  D+LIH+W+
Subjt:  SIWLFDKCFETNCIWTAKDDGIYLKDN-PTQVDVLIHRWK

XP_038896594.1 S-protein homolog 1-like [Benincasa hispida]2.7e-5468.31Show/hide
Query:  MKSMKKHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWI
        M+ M K   VL  VL LA+L+  +AAEL KW IHV+NGLSNGQ+L VHCKSKD+DLGEH +SVG EF+W FRVNFWNTTLFWCYL+KPNAQ  SF+AFWI
Subjt:  MKSMKKHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWI

Query:  ESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWK
        ES S+WL++ C+++NCIW AKDDG+YLKDN    DVLIH+W+
Subjt:  ESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWK

TrEMBL top hitse value%identityAlignment
A0A1S4E390 S-protein homolog5.9e-5567.13Show/hide
Query:  MKSMKKHFSV--LLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF
        M+SMK HF V   L VLSLA+++P +A  L++WHIH++NGLSN Q L VHC+SKDDDLG+  +SVG EF+WTF++NFW+TTLFWCYL+KPNA+ VSFEAF
Subjt:  MKSMKKHFSV--LLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF

Query:  WIESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRW
        W+E +SIWLF +CF++NCIWTAKDDGIYLKDNP   D L+H W
Subjt:  WIESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRW

A0A5N5I9W3 S-protein homolog9.5e-4536.57Show/hide
Query:  LLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWKYK
        L+ HC+S DDDLG  +IS G EF+W+FR N   +TL+WC +   + Q  SF+ FW E    WL  +C    C W AKDDGIYL+  P   +    R K  
Subjt:  LLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWKYK

Query:  IIIRKTYDILIFRTCQFFTMEMKSMKKHSLVFSLVLLLLWAIFNPTTKAAGALLKRWQIHIVNGLSNDQILLVHCKSEDDDLGEHNINVGTEFNWTFRVN
        +     +  +   T +         +  +L  S   L   A F+          +RW +H+VN L   + L  HC+S++DD+G   I  G E  W+F+ N
Subjt:  IIIRKTYDILIFRTCQFFTMEMKSMKKHSLVFSLVLLLLWAIFNPTTKAAGALLKRWQIHIVNGLSNDQILLVHCKSEDDDLGEHNINVGTEFNWTFRVN

Query:  IWNTTLFWCFLKKPNAQYVSFEAFWVEKTSIWLYYRCYNSNCIWTAKDDGVYLKDNPVKRDVLVHKWQ
         + TTL+WC+ +  + ++ +F+ +W E    WL YRC    C W AKDDG Y++  P KRD L+HKW+
Subjt:  IWNTTLFWCFLKKPNAQYVSFEAFWVEKTSIWLYYRCYNSNCIWTAKDDGVYLKDNPVKRDVLVHKWQ

A0A6J1CPR8 S-protein homolog4.4e-5066.18Show/hide
Query:  KHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSI
        KHF V L V SLA++E   A  L KW IHV N LSN QML VHCKSK+DDLGEHN+SVG EF+W FRVN W+TTL+WCYL+KPN Q VSF+AFW+E  SI
Subjt:  KHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSI

Query:  WLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRW
        WL+ KC E+NC W AKDDGIYL++NP   DV +H+W
Subjt:  WLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRW

A0A6J1CQH6 S-protein homolog1.5e-5366.19Show/hide
Query:  MKKHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESR
        +KKHF V+L+ LSLA++EP  + ELK+W+IHV+NGL NG++L VHCKS+DDDLGE N+  GAEF WTFRVN  +TTLFWC+L+KP+AQ VSF+AFW+E  
Subjt:  MKKHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESR

Query:  SIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWK
        SIWLF +C++ NCIWTAKDDG+YL+DNP Q DVL+H+W+
Subjt:  SIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWK

A0A6J1CRU0 S-protein homolog2.4e-5668.53Show/hide
Query:  MKSMKKHFSVLLIVLSLALLEPARA-AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFW
        ++S KKHF V L+VLSL +LEP  A +ELKKW IHV+NGLSNGQ L VHCKSKD+DLGEHN++ G EF+WTFRVN WNTTLFWCYL KP+ +  SF+ FW
Subjt:  MKSMKKHFSVLLIVLSLALLEPARA-AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFW

Query:  IESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWK
        +E +SIWLF +C+ +NCIWTAKDDGIYL+DNP Q D+L+H WK
Subjt:  IESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRWK

SwissProt top hitse value%identityAlignment
F2Q9V4 S-protein homolog 62.1e-1237.62Show/hide
Query:  NDQILLVHCKSEDDDLGEHNINVGTEFNWTFRVNIWNTTLFWCFLKKPNAQYVSFEAFWVEKTSIWLYYRCYNSNCIWTAKDDGVYLKDNPVKRDVLVHK
        ND +L VHCKS DDD G H +  G  + W F VN  N+TL++C   +   +   F+ +   + S     RC   NC W AK+DG+Y      K++ L +K
Subjt:  NDQILLVHCKSEDDDLGEHNINVGTEFNWTFRVNIWNTTLFWCFLKKPNAQYVSFEAFWVEKTSIWLYYRCYNSNCIWTAKDDGVYLKDNPVKRDVLVHK

Query:  W
        W
Subjt:  W

F4JLS0 S-protein homolog 11.8e-2442.24Show/hide
Query:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIY
        ++ +W + V+NGL+ G+ L +HCKSK+DDLGE N+     F W F  N  ++T FWCY+ K N   ++   FW +   + LF +C   NCIWTAK DG+Y
Subjt:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIY

Query:  LKDNPTQVDVLIHRWK
        L ++ +  DVL  +W+
Subjt:  LKDNPTQVDVLIHRWK

P0DN92 S-protein homolog 241.0e-1134.06Show/hide
Query:  FSVLLIVLSLALLEPARAAELK---KWHI-HVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESR
        F V ++V+SL   E  +  E K   + H+  V     N  +L +HCKS+DDDLG H ++ G  F W F VNF  +TL++C   +   +   FE +    R
Subjt:  FSVLLIVLSLALLEPARAAELK---KWHI-HVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESR

Query:  SIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRW
        +   F +C   NC W A+ DGIY          L + W
Subjt:  SIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIHRW

Q2HQ46 S-protein homolog 742.0e-2340.52Show/hide
Query:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIY
        ++ +W + V NGL+ G+ L +HCKSK++DLG+ N+     F W F  N  ++TLFWCY+ K +   ++ + FW +   + LF +C   NC+WTAK+DG+Y
Subjt:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIY

Query:  LKDNPTQVDVLIHRWK
        L ++    DVL  +WK
Subjt:  LKDNPTQVDVLIHRWK

Q9LW22 S-protein homolog 216.6e-1136Show/hide
Query:  KHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLS--NGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKK--PNAQFVSFEAFWIE
        K+ S+ L V+ L ++        KK  I V N L+  N  +L VHCKSK++D+G   + +G    ++F+ NFW TT FWC L K     ++    A+   
Subjt:  KHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLS--NGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKK--PNAQFVSFEAFWIE

Query:  SRSIWLFDKCFETNCIWTAKDDGIY
         ++I LF K   ++  W A+DDGIY
Subjt:  SRSIWLFDKCFETNCIWTAKDDGIY

Arabidopsis top hitse value%identityAlignment
AT3G26880.1 Plant self-incompatibility protein S1 family4.7e-1236Show/hide
Query:  KHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLS--NGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKK--PNAQFVSFEAFWIE
        K+ S+ L V+ L ++        KK  I V N L+  N  +L VHCKSK++D+G   + +G    ++F+ NFW TT FWC L K     ++    A+   
Subjt:  KHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLS--NGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKK--PNAQFVSFEAFWIE

Query:  SRSIWLFDKCFETNCIWTAKDDGIY
         ++I LF K   ++  W A+DDGIY
Subjt:  SRSIWLFDKCFETNCIWTAKDDGIY

AT4G16295.1 S-protein homologue 11.3e-2542.24Show/hide
Query:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIY
        ++ +W + V+NGL+ G+ L +HCKSK+DDLGE N+     F W F  N  ++T FWCY+ K N   ++   FW +   + LF +C   NCIWTAK DG+Y
Subjt:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIY

Query:  LKDNPTQVDVLIHRWK
        L ++ +  DVL  +W+
Subjt:  LKDNPTQVDVLIHRWK

AT4G29035.1 Plant self-incompatibility protein S1 family1.4e-2440.52Show/hide
Query:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIY
        ++ +W + V NGL+ G+ L +HCKSK++DLG+ N+     F W F  N  ++TLFWCY+ K +   ++ + FW +   + LF +C   NC+WTAK+DG+Y
Subjt:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIY

Query:  LKDNPTQVDVLIHRWK
        L ++    DVL  +WK
Subjt:  LKDNPTQVDVLIHRWK

AT5G04350.1 Plant self-incompatibility protein S1 family1.2e-1532.54Show/hide
Query:  VLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLK-----KPNAQFVSFEAFWIESRS
        +  IV+ L +       E+ +  + + N L + ++L VHC+SKDDDLGEH + +G ++++TF  N W TT F C +      K +  FV++E  W     
Subjt:  VLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLK-----KPNAQFVSFEAFWIESRS

Query:  IWLFDKCFETNCIWTAKDDGIYLKDN
             K  E +C W  ++DGIY   +
Subjt:  IWLFDKCFETNCIWTAKDDGIYLKDN

AT5G06020.1 Plant self-incompatibility protein S1 family1.0e-1136.27Show/hide
Query:  SNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIH
        +N  +L +HCKSKDDDLG H    G  + W F VNF N+TL++C   +       F+      R+   F +C   NC W AK D +Y   N  Q      
Subjt:  SNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDKCFETNCIWTAKDDGIYLKDNPTQVDVLIH

Query:  RW
        +W
Subjt:  RW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCAATGAAAAAGCACTTTTCGGTTTTGTTGATTGTCTTGTCGTTGGCATTACTTGAGCCAGCCAGGGCTGCCGAGCTAAAAAAATGGCACATCCACGTTATGAA
TGGGCTAAGCAACGGTCAAATGTTGTTGGTCCACTGCAAGTCTAAAGATGATGATCTAGGCGAACACAATATTAGCGTTGGAGCTGAATTCGATTGGACTTTTAGAGTTA
ACTTTTGGAATACGACGTTGTTTTGGTGTTACTTGAAAAAGCCGAATGCTCAATTTGTTTCATTTGAAGCTTTTTGGATTGAGAGCAGATCTATTTGGCTCTTCGATAAA
TGCTTTGAAACAAACTGCATTTGGACAGCAAAAGATGATGGAATCTATTTGAAAGACAACCCAACTCAAGTAGATGTTTTGATTCATCGATGGAAATATAAAATTATCAT
AAGGAAAACATATGACATTTTAATCTTTAGAACTTGCCAATTTTTCACAATGGAAATGAAATCTATGAAAAAGCACTCTTTGGTTTTCTCCCTTGTTTTACTCTTGTTAT
GGGCAATATTCAATCCAACAACGAAGGCTGCCGGTGCCTTGCTGAAAAGATGGCAAATTCACATCGTGAATGGGCTAAGCAATGACCAGATCTTGTTGGTGCATTGCAAA
TCCGAGGACGATGATCTAGGCGAACACAATATCAATGTTGGAACTGAATTTAATTGGACTTTTAGAGTAAACATTTGGAATACGACATTGTTTTGGTGTTTCTTGAAAAA
GCCAAATGCTCAATATGTGTCATTTGAGGCTTTTTGGGTCGAGAAGACATCGATTTGGCTATATTATAGATGCTATAATTCTAACTGTATTTGGACGGCAAAAGATGATG
GAGTCTATTTAAAAGACAATCCTGTTAAAAGAGATGTTTTGGTTCATAAGTGGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCAATGAAAAAGCACTTTTCGGTTTTGTTGATTGTCTTGTCGTTGGCATTACTTGAGCCAGCCAGGGCTGCCGAGCTAAAAAAATGGCACATCCACGTTATGAA
TGGGCTAAGCAACGGTCAAATGTTGTTGGTCCACTGCAAGTCTAAAGATGATGATCTAGGCGAACACAATATTAGCGTTGGAGCTGAATTCGATTGGACTTTTAGAGTTA
ACTTTTGGAATACGACGTTGTTTTGGTGTTACTTGAAAAAGCCGAATGCTCAATTTGTTTCATTTGAAGCTTTTTGGATTGAGAGCAGATCTATTTGGCTCTTCGATAAA
TGCTTTGAAACAAACTGCATTTGGACAGCAAAAGATGATGGAATCTATTTGAAAGACAACCCAACTCAAGTAGATGTTTTGATTCATCGATGGAAATATAAAATTATCAT
AAGGAAAACATATGACATTTTAATCTTTAGAACTTGCCAATTTTTCACAATGGAAATGAAATCTATGAAAAAGCACTCTTTGGTTTTCTCCCTTGTTTTACTCTTGTTAT
GGGCAATATTCAATCCAACAACGAAGGCTGCCGGTGCCTTGCTGAAAAGATGGCAAATTCACATCGTGAATGGGCTAAGCAATGACCAGATCTTGTTGGTGCATTGCAAA
TCCGAGGACGATGATCTAGGCGAACACAATATCAATGTTGGAACTGAATTTAATTGGACTTTTAGAGTAAACATTTGGAATACGACATTGTTTTGGTGTTTCTTGAAAAA
GCCAAATGCTCAATATGTGTCATTTGAGGCTTTTTGGGTCGAGAAGACATCGATTTGGCTATATTATAGATGCTATAATTCTAACTGTATTTGGACGGCAAAAGATGATG
GAGTCTATTTAAAAGACAATCCTGTTAAAAGAGATGTTTTGGTTCATAAGTGGCAATAA
Protein sequenceShow/hide protein sequence
MKSMKKHFSVLLIVLSLALLEPARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFDK
CFETNCIWTAKDDGIYLKDNPTQVDVLIHRWKYKIIIRKTYDILIFRTCQFFTMEMKSMKKHSLVFSLVLLLLWAIFNPTTKAAGALLKRWQIHIVNGLSNDQILLVHCK
SEDDDLGEHNINVGTEFNWTFRVNIWNTTLFWCFLKKPNAQYVSFEAFWVEKTSIWLYYRCYNSNCIWTAKDDGVYLKDNPVKRDVLVHKWQ