; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008838 (gene) of Snake gourd v1 genome

Gene IDTan0008838
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionS-protein homolog
Genome locationLG06:16794530..16821152
RNA-Seq ExpressionTan0008838
SyntenyTan0008838
Gene Ontology termsGO:0060320 - rejection of self pollen (biological process)
GO:0005576 - extracellular region (cellular component)
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016902699.1 PREDICTED: uncharacterized protein LOC107991826 [Cucumis melo]1.4e-5566.23Show/hide
Query:  NTISQTMEMKSMKKHFSV--FLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNA
        N  S+   M+SMK HF V  FL VLSLAI++P +A  L++WHIH++NGLSNDQ L VHC+SKDDDLG+  +SVG EF+WTF++NFW+TTLFWCYL+KPNA
Subjt:  NTISQTMEMKSMKKHFSV--FLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNA

Query:  QFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQW
        + VSFEAFW+E +SIWLF++CF SNCIWTAKDD IYLKDN    D L+H W
Subjt:  QFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQW

XP_022143772.1 S-protein homolog 74-like [Momordica charantia]9.7e-5767.79Show/hide
Query:  ISQTMEMKSMKKHFSVFLIVLSLAILEPARA-AELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFV
        +SQ M ++S KKHF VFL+VLSL ILEP  A +ELKKW IHV+NGLSN Q L VHCKSKD+DLGEHN++ G EF+WTFRVN WNTTLFWCYL KP+ +  
Subjt:  ISQTMEMKSMKKHFSVFLIVLSLAILEPARA-AELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFV

Query:  SFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK
        SF+ FW+E +SIWLF++C+ SNCIWTAKDD IYL+DN  Q D+L+H+WK
Subjt:  SFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK

XP_022143829.1 S-protein homolog 74-like [Momordica charantia]1.2e-5163.89Show/hide
Query:  MEMKSMKKHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF
        M  + +KKHF V L+ LSLAI+EP  + ELK+W+IHV+NGL N ++L VHCKS+DDDLGE N+  GAEF WTFRVN  +TTLFWC+L+KP+AQ VSF+AF
Subjt:  MEMKSMKKHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF

Query:  WIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK
        W+E  SIWLF++C+D+NCIWTAKDD +YL+DN  Q DVL+H+W+
Subjt:  WIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK

XP_031745090.1 S-protein homolog 1-like [Cucumis sativus]5.0e-5365.77Show/hide
Query:  ISQTMEMKSMKKHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVS
        +++ +E+  MKK   V L VL LAILE  +A EL+KWHIHV+NGLSN Q+LL HCKSKD+DLGE  +  G EF+W FRVNFWNTTLFWCYL+KPN Q  S
Subjt:  ISQTMEMKSMKKHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVS

Query:  FEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNL-TQVDVLIHQWK
        FE+FWIESRS+WL+  CF+ NCIWTAKDD IYLKDN  T  D+LIH+W+
Subjt:  FEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNL-TQVDVLIHQWK

XP_038896594.1 S-protein homolog 1-like [Benincasa hispida]5.9e-5468.06Show/hide
Query:  MEMKSMKKHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF
        MEM+ M K   V   VL LA+L+  +AAEL KW IHV+NGLSN Q+L VHCKSKD+DLGEH +SVG EF+W FRVNFWNTTLFWCYL+KPNAQ  SF+AF
Subjt:  MEMKSMKKHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF

Query:  WIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK
        WIES S+WL++ C+DSNCIW AKDD +YLKDN    DVLIH+W+
Subjt:  WIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK

TrEMBL top hitse value%identityAlignment
A0A1S4E390 S-protein homolog6.8e-5666.23Show/hide
Query:  NTISQTMEMKSMKKHFSV--FLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNA
        N  S+   M+SMK HF V  FL VLSLAI++P +A  L++WHIH++NGLSNDQ L VHC+SKDDDLG+  +SVG EF+WTF++NFW+TTLFWCYL+KPNA
Subjt:  NTISQTMEMKSMKKHFSV--FLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNA

Query:  QFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQW
        + VSFEAFW+E +SIWLF++CF SNCIWTAKDD IYLKDN    D L+H W
Subjt:  QFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQW

A0A5N5I9W3 S-protein homolog3.2e-4537.88Show/hide
Query:  SQMLLVHCKSKDDDLGEHNISVGTEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNMTQVDISCEDM
        S+ L+ HC+S DDDLG  +IS G EF+W+FR N   +TL+WC +   + Q  SF+ FW E    WL ++C    C W AKDDGIYL+     +  S ED 
Subjt:  SQMLLVHCKSKDDDLGEHNISVGTEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNMTQVDISCEDM

Query:  TVICMRIFHGFLQNTISQTMEMKSMKKHFSVFLIVLSLA-ILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNT
             R        T  +T+   + K+  ++ L    LA   E       ++WH+HV+N L   + L  HC+SK+DD+G   I+ GAE  W+F+ NF+ T
Subjt:  TVICMRIFHGFLQNTISQTMEMKSMKKHFSVFLIVLSLA-ILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNT

Query:  TLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK
        TL+WCY +  + +  +F+ +W ES+  WL ++C    C W AKDD  Y++    + D LIH+W+
Subjt:  TLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK

A0A6J1CPR8 S-protein homolog2.1e-4966.18Show/hide
Query:  KHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSI
        KHF VFL V SLAI+E   A  L KW IHV N LSN QML VHCKSK+DDLGEHN+SVG EF+W FRVN W+TTL+WCYL+KPN Q VSF+AFW+E  SI
Subjt:  KHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSI

Query:  WLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQW
        WL++KC +SNC W AKDD IYL++N    DV +H+W
Subjt:  WLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQW

A0A6J1CQH6 S-protein homolog6.0e-5263.89Show/hide
Query:  MEMKSMKKHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF
        M  + +KKHF V L+ LSLAI+EP  + ELK+W+IHV+NGL N ++L VHCKS+DDDLGE N+  GAEF WTFRVN  +TTLFWC+L+KP+AQ VSF+AF
Subjt:  MEMKSMKKHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAF

Query:  WIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK
        W+E  SIWLF++C+D+NCIWTAKDD +YL+DN  Q DVL+H+W+
Subjt:  WIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK

A0A6J1CRU0 S-protein homolog4.7e-5767.79Show/hide
Query:  ISQTMEMKSMKKHFSVFLIVLSLAILEPARA-AELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFV
        +SQ M ++S KKHF VFL+VLSL ILEP  A +ELKKW IHV+NGLSN Q L VHCKSKD+DLGEHN++ G EF+WTFRVN WNTTLFWCYL KP+ +  
Subjt:  ISQTMEMKSMKKHFSVFLIVLSLAILEPARA-AELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFV

Query:  SFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK
        SF+ FW+E +SIWLF++C+ SNCIWTAKDD IYL+DN  Q D+L+H+WK
Subjt:  SFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK

SwissProt top hitse value%identityAlignment
F4JLS0 S-protein homolog 11.9e-2341.38Show/hide
Query:  ELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIY
        ++ +W + V+NGL+  + L +HCKSK+DDLGE N+     F W F  N  ++T FWCY+ K N   ++   FW +   + LFH+C   NCIWTAK D +Y
Subjt:  ELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIY

Query:  LKDNLTQVDVLIHQWK
        L ++ +  DVL  +W+
Subjt:  LKDNLTQVDVLIHQWK

P0DN92 S-protein homolog 247.6e-1236.59Show/hide
Query:  FSVFLIVLSLAILEPARAAELK---KWHI-HVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESR
        F V ++V+SL   E  +  E K   + H+  V     ND +L +HCKS+DDDLG H ++ G  F W F VNF  +TL++C   +   +   FE +    R
Subjt:  FSVFLIVLSLAILEPARAAELK---KWHI-HVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESR

Query:  SIWLFHKCFDSNCIWTAKDDEIY
        +   F++C  +NC W A+ D IY
Subjt:  SIWLFHKCFDSNCIWTAKDDEIY

Q2HQ46 S-protein homolog 742.1e-2239.66Show/hide
Query:  ELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIY
        ++ +W + V NGL+  + L +HCKSK++DLG+ N+     F W F  N  ++TLFWCY+ K +   ++ + FW +   + LFH+C   NC+WTAK+D +Y
Subjt:  ELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIY

Query:  LKDNLTQVDVLIHQWK
        L ++    DVL  +WK
Subjt:  LKDNLTQVDVLIHQWK

Q9FI84 S-protein homolog 272.0e-1238.24Show/hide
Query:  SNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIH
        +ND +L +HCKSKDDDLG H    G  + W F VNF N+TL++C   +       F+      R+   F++C   NC W AK D +Y   NL Q      
Subjt:  SNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIH

Query:  QW
        +W
Subjt:  QW

Q9LW22 S-protein homolog 212.0e-1237.4Show/hide
Query:  KHFSIFMIVLSLAMLESARAAELKKWHIHVMNGLS--NSQMLLVHCKSKDDDLGEHNISVGTEFDWTFRVNFWNTTLFWCYLKK--PNAQFVSFEAFWIE
        K+ SIF+ V+ L M+        KK  I V N L+  N  +L VHCKSK++D+G   + +G    ++F+ NFW TT FWC L K     ++    A+   
Subjt:  KHFSIFMIVLSLAMLESARAAELKKWHIHVMNGLS--NSQMLLVHCKSKDDDLGEHNISVGTEFDWTFRVNFWNTTLFWCYLKK--PNAQFVSFEAFWIE

Query:  SRSVWLFHKCFDSNCIWTAKDDGIYL-KDNM
         +++ LF K   S+  W A+DDGIY  KD++
Subjt:  SRSVWLFHKCFDSNCIWTAKDDGIYL-KDNM

Arabidopsis top hitse value%identityAlignment
AT3G26880.1 Plant self-incompatibility protein S1 family1.4e-1337.4Show/hide
Query:  KHFSIFMIVLSLAMLESARAAELKKWHIHVMNGLS--NSQMLLVHCKSKDDDLGEHNISVGTEFDWTFRVNFWNTTLFWCYLKK--PNAQFVSFEAFWIE
        K+ SIF+ V+ L M+        KK  I V N L+  N  +L VHCKSK++D+G   + +G    ++F+ NFW TT FWC L K     ++    A+   
Subjt:  KHFSIFMIVLSLAMLESARAAELKKWHIHVMNGLS--NSQMLLVHCKSKDDDLGEHNISVGTEFDWTFRVNFWNTTLFWCYLKK--PNAQFVSFEAFWIE

Query:  SRSVWLFHKCFDSNCIWTAKDDGIYL-KDNM
         +++ LF K   S+  W A+DDGIY  KD++
Subjt:  SRSVWLFHKCFDSNCIWTAKDDGIYL-KDNM

AT4G16295.1 S-protein homologue 11.4e-2441.38Show/hide
Query:  ELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIY
        ++ +W + V+NGL+  + L +HCKSK+DDLGE N+     F W F  N  ++T FWCY+ K N   ++   FW +   + LFH+C   NCIWTAK D +Y
Subjt:  ELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIY

Query:  LKDNLTQVDVLIHQWK
        L ++ +  DVL  +W+
Subjt:  LKDNLTQVDVLIHQWK

AT4G29035.1 Plant self-incompatibility protein S1 family1.5e-2339.66Show/hide
Query:  ELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIY
        ++ +W + V NGL+  + L +HCKSK++DLG+ N+     F W F  N  ++TLFWCY+ K +   ++ + FW +   + LFH+C   NC+WTAK+D +Y
Subjt:  ELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIY

Query:  LKDNLTQVDVLIHQWK
        L ++    DVL  +WK
Subjt:  LKDNLTQVDVLIHQWK

AT5G04350.1 Plant self-incompatibility protein S1 family8.0e-1734.92Show/hide
Query:  IFMIVLSLAMLESARAAELKKWHIHVMNGLSNSQMLLVHCKSKDDDLGEHNISVGTEFDWTFRVNFWNTTLFWCYLK-----KPNAQFVSFEAFWIESRS
        IF IV+ L +  S    E+ +  + + N L +S++L VHC+SKDDDLGEH + +G ++++TF  N W TT F C +      K +  FV++E  W     
Subjt:  IFMIVLSLAMLESARAAELKKWHIHVMNGLSNSQMLLVHCKSKDDDLGEHNISVGTEFDWTFRVNFWNTTLFWCYLK-----KPNAQFVSFEAFWIESRS

Query:  VWLFHKCFDSNCIWTAKDDGIYLKDN
             K  +++C W  ++DGIY   +
Subjt:  VWLFHKCFDSNCIWTAKDDGIYLKDN

AT5G06020.1 Plant self-incompatibility protein S1 family1.4e-1338.24Show/hide
Query:  SNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIH
        +ND +L +HCKSKDDDLG H    G  + W F VNF N+TL++C   +       F+      R+   F++C   NC W AK D +Y   NL Q      
Subjt:  SNDQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIH

Query:  QW
        +W
Subjt:  QW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCTATGGAAAAGCACTTTTCGATTTTCATGATTGTTTTGTCGTTGGCAATGCTTGAGTCAGCAAGGGCTGCCGAGCTGAAAAAATGGCACATCCACGTTATGAA
TGGGCTAAGCAATAGCCAAATGTTGTTGGTGCACTGCAAGTCTAAGGACGATGATCTAGGCGAACACAATATCAGCGTTGGAACTGAATTCGATTGGACTTTTAGAGTAA
ACTTTTGGAACACAACATTGTTTTGGTGTTACTTGAAAAAGCCGAATGCTCAATTTGTTTCATTTGAAGCATTTTGGATCGAGAGCAGATCTGTTTGGCTTTTTCATAAA
TGTTTTGATTCTAATTGCATTTGGACAGCAAAAGACGATGGAATCTATTTGAAAGACAACATGACTCAAGTAGATATTTCATGTGAGGATATGACAGTAATTTGCATGAG
AATATTTCATGGTTTCCTACAAAACACAATATCTCAAACAATGGAAATGAAATCTATGAAAAAGCACTTTTCGGTTTTCTTGATTGTCTTGTCGTTGGCAATCCTTGAGC
CAGCAAGGGCTGCAGAGCTGAAAAAATGGCACATCCACGTTATGAATGGGCTAAGCAACGACCAAATGTTGTTGGTGCACTGCAAGTCTAAGGACGATGACCTAGGGGAA
CACAATATTAGCGTTGGAGCTGAATTCGATTGGACTTTTAGAGTAAACTTTTGGAATACAACGTTGTTTTGGTGTTACTTGAAAAAGCCGAATGCTCAATTTGTTTCATT
TGAAGCTTTTTGGATCGAGAGCAGATCTATTTGGCTCTTTCATAAATGCTTTGATTCTAACTGCATTTGGACAGCAAAAGATGATGAAATCTATTTGAAAGACAACTTGA
CTCAAGTAGATGTTTTGATTCATCAATGGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAATCTATGGAAAAGCACTTTTCGATTTTCATGATTGTTTTGTCGTTGGCAATGCTTGAGTCAGCAAGGGCTGCCGAGCTGAAAAAATGGCACATCCACGTTATGAA
TGGGCTAAGCAATAGCCAAATGTTGTTGGTGCACTGCAAGTCTAAGGACGATGATCTAGGCGAACACAATATCAGCGTTGGAACTGAATTCGATTGGACTTTTAGAGTAA
ACTTTTGGAACACAACATTGTTTTGGTGTTACTTGAAAAAGCCGAATGCTCAATTTGTTTCATTTGAAGCATTTTGGATCGAGAGCAGATCTGTTTGGCTTTTTCATAAA
TGTTTTGATTCTAATTGCATTTGGACAGCAAAAGACGATGGAATCTATTTGAAAGACAACATGACTCAAGTAGATATTTCATGTGAGGATATGACAGTAATTTGCATGAG
AATATTTCATGGTTTCCTACAAAACACAATATCTCAAACAATGGAAATGAAATCTATGAAAAAGCACTTTTCGGTTTTCTTGATTGTCTTGTCGTTGGCAATCCTTGAGC
CAGCAAGGGCTGCAGAGCTGAAAAAATGGCACATCCACGTTATGAATGGGCTAAGCAACGACCAAATGTTGTTGGTGCACTGCAAGTCTAAGGACGATGACCTAGGGGAA
CACAATATTAGCGTTGGAGCTGAATTCGATTGGACTTTTAGAGTAAACTTTTGGAATACAACGTTGTTTTGGTGTTACTTGAAAAAGCCGAATGCTCAATTTGTTTCATT
TGAAGCTTTTTGGATCGAGAGCAGATCTATTTGGCTCTTTCATAAATGCTTTGATTCTAACTGCATTTGGACAGCAAAAGATGATGAAATCTATTTGAAAGACAACTTGA
CTCAAGTAGATGTTTTGATTCATCAATGGAAATAG
Protein sequenceShow/hide protein sequence
MKSMEKHFSIFMIVLSLAMLESARAAELKKWHIHVMNGLSNSQMLLVHCKSKDDDLGEHNISVGTEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSVWLFHK
CFDSNCIWTAKDDGIYLKDNMTQVDISCEDMTVICMRIFHGFLQNTISQTMEMKSMKKHFSVFLIVLSLAILEPARAAELKKWHIHVMNGLSNDQMLLVHCKSKDDDLGE
HNISVGAEFDWTFRVNFWNTTLFWCYLKKPNAQFVSFEAFWIESRSIWLFHKCFDSNCIWTAKDDEIYLKDNLTQVDVLIHQWK