; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G05950 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G05950
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionF-box protein PP2-A13-like
Genome locationClcChr01:5626015..5629401
RNA-Seq ExpressionClc01G05950
SyntenyClc01G05950
Gene Ontology termsGO:0009611 - response to wounding (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001810 - F-box domain
IPR036047 - F-box-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143628.1 F-box protein PP2-A14 [Cucumis sativus]1.3e-5582.88Show/hide
Query:  MGASFAKHS-NLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLH-PQLCQPKKHIFATLTRPY
        MGASFAK S NLP S SSS SSSLEDIPENCISIV MYLDPPEICNLASL+ AFR+ SSADFVWESKLP NY FLLHRVL  P L  PKK IFA LTRP 
Subjt:  MGASFAKHS-NLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLH-PQLCQPKKHIFATLTRPY

Query:  LFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
         FDHATKE WLDK SGK F+SISSKALKITGIDDRRYWNYIPTDES
Subjt:  LFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

XP_008467221.1 PREDICTED: LOW QUALITY PROTEIN: F-box protein PP2-A14 [Cucumis melo]1.5e-5682.99Show/hide
Query:  MGASFAKHS-NLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQ--LCQPKKHIFATLTRP
        MGASFAK S NLP S SSS SS LEDIPENCISIVFMYLDPPEICNLASLN AFR++SSADFVWESKLP NY FLLHRVL  Q     PKK IFATLTRP
Subjt:  MGASFAKHS-NLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQ--LCQPKKHIFATLTRP

Query:  YLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
         LFDHATKE WLDK SG+ F+SISSKALKITGIDDRRYWNYIPTDES
Subjt:  YLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

XP_023533028.1 F-box protein PP2-A13-like [Cucurbita pepo subsp. pepo]4.5e-4868Show/hide
Query:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ------PKKHIFATL
        MGA+FAK S +   P  S S+S +D+PENCISIV M+L+PPEIC LA+LNRAFRAASSADF+W SKLP NY  LL R+L P          PKK IFA L
Subjt:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ------PKKHIFATL

Query:  TRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
        +RP LFD  TKE+WLDKSSGK+FVSISSKALKITGI+DRRYWN++PTDES
Subjt:  TRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

XP_038906479.1 F-box protein PP2-A14 isoform X1 [Benincasa hispida]6.7e-6084.03Show/hide
Query:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQPKKHIFATLTRPYLF
        MGASFAK S   ++ SSS +SS EDIPENCISI+FMYLDPPEICNLASLNRAFRAAS ADF+WESKLP NYNFLL RVLHP L QPKK IFA LTRPYLF
Subjt:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQPKKHIFATLTRPYLF

Query:  DHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
        DHATKEVWLDKSSG  F+SISSKALKITGIDDRRYWNYI TDES
Subjt:  DHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

XP_038906480.1 F-box protein PP2-A14 isoform X2 [Benincasa hispida]4.6e-6183.67Show/hide
Query:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQPKKHIFATLTRPYLF
        MGASFAK S   ++ SSS +SS EDIPENCISI+FMYLDPPEICNLASLNRAFRAAS ADF+WESKLP NYNFLL RVLHP L QPKK IFA LTRPYLF
Subjt:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQPKKHIFATLTRPYLF

Query:  DHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDESSQY
        DHATKEVWLDKSSG  F+SISSKALKITGIDDRRYWNYI TDESS Y
Subjt:  DHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDESSQY

TrEMBL top hitse value%identityAlignment
A0A0A0KL73 F-box domain-containing protein6.3e-5682.88Show/hide
Query:  MGASFAKHS-NLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLH-PQLCQPKKHIFATLTRPY
        MGASFAK S NLP S SSS SSSLEDIPENCISIV MYLDPPEICNLASL+ AFR+ SSADFVWESKLP NY FLLHRVL  P L  PKK IFA LTRP 
Subjt:  MGASFAKHS-NLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLH-PQLCQPKKHIFATLTRPY

Query:  LFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
         FDHATKE WLDK SGK F+SISSKALKITGIDDRRYWNYIPTDES
Subjt:  LFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

A0A1S3CUB8 LOW QUALITY PROTEIN: F-box protein PP2-A147.5e-5782.99Show/hide
Query:  MGASFAKHS-NLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQ--LCQPKKHIFATLTRP
        MGASFAK S NLP S SSS SS LEDIPENCISIVFMYLDPPEICNLASLN AFR++SSADFVWESKLP NY FLLHRVL  Q     PKK IFATLTRP
Subjt:  MGASFAKHS-NLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQ--LCQPKKHIFATLTRP

Query:  YLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
         LFDHATKE WLDK SG+ F+SISSKALKITGIDDRRYWNYIPTDES
Subjt:  YLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

A0A6J1DZW7 F-box protein PP2-A13-like2.5e-4463.86Show/hide
Query:  SLSLSKSKTKKKNSSFM----GASFAK-HSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVL
        S  LS + T+ K S       GA+FAK H+N     SS  SS L+D+PENCI I+   LDPPEIC LA+LNRAFRAASSADF+WESKLP N   LL  +L
Subjt:  SLSLSKSKTKKKNSSFM----GASFAK-HSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVL

Query:  HPQLCQP-KKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
        HPQ   P  KHIFA LTRP LFD  TKE WLDK S K F+SISSKALKITGI DRRYWNYIPT +S
Subjt:  HPQLCQP-KKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

A0A6J1H031 F-box protein PP2-A13-like8.3e-4866.67Show/hide
Query:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ------PKKHIFATL
        MGA+FAK   +   P  S S+S +D+PENCISIV M+L+PPEIC LA+LNRAFRAASSADF+W SKLP NY  LL R+L P          PKK IFA L
Subjt:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ------PKKHIFATL

Query:  TRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
        +RP LFD  TKE+WLDKSSGK+FVSIS+KALKITGI+DRRYWN++PTDES
Subjt:  TRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

A0A6J1K1Y6 F-box protein PP2-A13-like1.2e-4666Show/hide
Query:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ------PKKHIFATL
        MGA+F K S +   P  S S+S +D+PENCISIV M+L+P EIC LA+LNRAFR+ASSADF+W SKLP NY  LL R+L P          PKK IFA L
Subjt:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ------PKKHIFATL

Query:  TRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
        +RP LFD  TKE+WLDKSSGK+FVSISSKALKITGI+DRRYWN++PTDES
Subjt:  TRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

SwissProt top hitse value%identityAlignment
Q9CAN4 F-box protein PP2-A113.7e-2945.64Show/hide
Query:  MGASFAK-HSNLPSSPSSSD--SSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVL--HPQLCQPKKHIFATLT
        MG+ F+    NL S     D     L D+PE+C++++   LDP EIC  + LN AF  AS ADFVWESKLPP+Y  +L ++L   P   + K+ IF  L+
Subjt:  MGASFAK-HSNLPSSPSSSD--SSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVL--HPQLCQPKKHIFATLT

Query:  RPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
        R   FD   K+ W+DK +G + +  S+K L ITGIDDRRYW++IP+D+S
Subjt:  RPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

Q9FJ80 F-box protein PP2-A141.0e-3955.26Show/hide
Query:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQ--------LCQPKKHIFA
        MGA  A  S + S P +     LED+PENCI+ +FMY++PPEIC LA +N++F  AS +D VWE KLP NY FL+ R+L  Q        L   KK I+A
Subjt:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQ--------LCQPKKHIFA

Query:  TLTRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
         L RP LFD  TKE WLDK SGK+F++IS KA+KITGIDDRRYW +I +DES
Subjt:  TLTRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

Q9LEX0 F-box protein PP2-A133.0e-3959.68Show/hide
Query:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQP--KKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSI
        L D+PENC++++   LDPPEIC LA LNR FR ASSADF+WESKLP NY  + H+V          KK ++A L++P LFD  TKE+W+DK++G++ +SI
Subjt:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQP--KKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSI

Query:  SSKALKITGIDDRRYWNYIPTDES
        SSKAL+ITGIDDRRYW++IPTDES
Subjt:  SSKALKITGIDDRRYWNYIPTDES

Q9LF92 F-box protein PP2-A157.0e-3658.06Show/hide
Query:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ--PKKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSI
        L DIPE+C++ VFMYL PPEICNLA LNR+FR A+S+D VWE KLP NY  LL  +L P+      KK IFA L+RP  FD   KEVW+D+ +G++ ++I
Subjt:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ--PKKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSI

Query:  SSKALKITGIDDRRYWNYIPTDES
        S++ + ITGI+DRRYWN+IPT+ES
Subjt:  SSKALKITGIDDRRYWNYIPTDES

Q9LN77 F-box protein PP2-A122.5e-3352.45Show/hide
Query:  HSNLPSSPSSSDSSSLE----DIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVL--HPQLCQPKKHIFATLTRPYLFD
        H++L SS    D + L+    D+PE C++I+   LDP EIC  + LNRAFR AS AD VWESKLP NY  +L ++L   P+  Q K+H++A L+R   FD
Subjt:  HSNLPSSPSSSDSSSLE----DIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVL--HPQLCQPKKHIFATLTRPYLFD

Query:  HATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
         ATK+VW+DK +  + +SIS+K L ITGIDDRRYW++IPTDES
Subjt:  HATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

Arabidopsis top hitse value%identityAlignment
AT1G12710.1 phloem protein 2-A121.8e-3452.45Show/hide
Query:  HSNLPSSPSSSDSSSLE----DIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVL--HPQLCQPKKHIFATLTRPYLFD
        H++L SS    D + L+    D+PE C++I+   LDP EIC  + LNRAFR AS AD VWESKLP NY  +L ++L   P+  Q K+H++A L+R   FD
Subjt:  HSNLPSSPSSSDSSSLE----DIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVL--HPQLCQPKKHIFATLTRPYLFD

Query:  HATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
         ATK+VW+DK +  + +SIS+K L ITGIDDRRYW++IPTDES
Subjt:  HATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES

AT3G53000.1 phloem protein 2-A155.0e-3758.06Show/hide
Query:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ--PKKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSI
        L DIPE+C++ VFMYL PPEICNLA LNR+FR A+S+D VWE KLP NY  LL  +L P+      KK IFA L+RP  FD   KEVW+D+ +G++ ++I
Subjt:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQ--PKKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSI

Query:  SSKALKITGIDDRRYWNYIPTDES
        S++ + ITGI+DRRYWN+IPT+ES
Subjt:  SSKALKITGIDDRRYWNYIPTDES

AT3G61060.1 phloem protein 2-A132.2e-4059.68Show/hide
Query:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQP--KKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSI
        L D+PENC++++   LDPPEIC LA LNR FR ASSADF+WESKLP NY  + H+V          KK ++A L++P LFD  TKE+W+DK++G++ +SI
Subjt:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQP--KKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSI

Query:  SSKALKITGIDDRRYWNYIPTDES
        SSKAL+ITGIDDRRYW++IPTDES
Subjt:  SSKALKITGIDDRRYWNYIPTDES

AT3G61060.2 phloem protein 2-A135.3e-3959.2Show/hide
Query:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQP--KKHIFATLTRPYLFDHATK-EVWLDKSSGKIFVS
        L D+PENC++++   LDPPEIC LA LNR FR ASSADF+WESKLP NY  + H+V          KK ++A L++P LFD  TK E+W+DK++G++ +S
Subjt:  LEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQLCQP--KKHIFATLTRPYLFDHATK-EVWLDKSSGKIFVS

Query:  ISSKALKITGIDDRRYWNYIPTDES
        ISSKAL+ITGIDDRRYW++IPTDES
Subjt:  ISSKALKITGIDDRRYWNYIPTDES

AT5G52120.1 phloem protein 2-A147.4e-4155.26Show/hide
Query:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQ--------LCQPKKHIFA
        MGA  A  S + S P +     LED+PENCI+ +FMY++PPEIC LA +N++F  AS +D VWE KLP NY FL+ R+L  Q        L   KK I+A
Subjt:  MGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFVWESKLPPNYNFLLHRVLHPQ--------LCQPKKHIFA

Query:  TLTRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES
         L RP LFD  TKE WLDK SGK+F++IS KA+KITGIDDRRYW +I +DES
Subjt:  TLTRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCATTATTCTTCATCATCTCCCACTTTCTATCTCCCACCTCCCTCCCTCTCTACCATTACAAACACAACACACCCATCTAATTTCTCCCCTCTCTCTCTCTCTCTC
CAAATCCAAAACAAAAAAAAAAAATTCCTCTTTTATGGGGGCTTCCTTTGCCAAACACTCCAATCTCCCTTCCTCTCCTTCCTCCTCCGATTCCTCTTCTCTGGAAGACA
TACCGGAAAATTGTATTTCCATTGTCTTCATGTACTTGGATCCTCCCGAGATTTGCAATCTTGCCTCTCTTAATCGAGCCTTTCGTGCTGCTTCTTCCGCTGATTTCGTT
TGGGAATCCAAACTCCCTCCCAATTACAACTTCTTACTTCATCGAGTTCTTCATCCTCAATTGTGCCAGCCTAAGAAACACATCTTCGCTACCCTCACTCGCCCTTATCT
CTTTGATCATGCCACCAAGGAGGTTTGGTTGGATAAAAGTTCTGGAAAAATTTTTGTTTCTATTTCATCCAAGGCTCTGAAGATTACAGGGATCGATGATAGAAGATATT
GGAACTATATTCCAACTGATGAATCAAGTCAATACCGTTTGTGGAGAACCGATGGATTATGTGGCCAATTATGGTCCATTGTGAAAGGAGTAATATACCATTCCCATACC
AATATCGATCTTTCAGGGAACCATTTGTATACAATCCATGTTGATGCGACTTGCATATCAATGGAAACACCTTTGGTTGGCTTTGGTGGGATTATTTGGTCCTCATCAAC
ATCCATTTTGGCCTCTTTACATAGTGTCGAAGATAATTATTTTTTAATCCTATGTGTGCAAGGCCCTTGTTATTTGTGA
mRNA sequenceShow/hide mRNA sequence
ACGTAGACGAACCCAATTGAATTTCATTTCTCCAAAACCCCTCCACTTTTTTAACATTCCCTCTTTTTCCCCAACTTTCCCCCACAAAAGAAAAAAACATGCCCATTATT
CTTCATCATCTCCCACTTTCTATCTCCCACCTCCCTCCCTCTCTACCATTACAAACACAACACACCCATCTAATTTCTCCCCTCTCTCTCTCTCTCTCCAAATCCAAAAC
AAAAAAAAAAAATTCCTCTTTTATGGGGGCTTCCTTTGCCAAACACTCCAATCTCCCTTCCTCTCCTTCCTCCTCCGATTCCTCTTCTCTGGAAGACATACCGGAAAATT
GTATTTCCATTGTCTTCATGTACTTGGATCCTCCCGAGATTTGCAATCTTGCCTCTCTTAATCGAGCCTTTCGTGCTGCTTCTTCCGCTGATTTCGTTTGGGAATCCAAA
CTCCCTCCCAATTACAACTTCTTACTTCATCGAGTTCTTCATCCTCAATTGTGCCAGCCTAAGAAACACATCTTCGCTACCCTCACTCGCCCTTATCTCTTTGATCATGC
CACCAAGGAGGTTTGGTTGGATAAAAGTTCTGGAAAAATTTTTGTTTCTATTTCATCCAAGGCTCTGAAGATTACAGGGATCGATGATAGAAGATATTGGAACTATATTC
CAACTGATGAATCAAGTCAATACCGTTTGTGGAGAACCGATGGATTATGTGGCCAATTATGGTCCATTGTGAAAGGAGTAATATACCATTCCCATACCAATATCGATCTT
TCAGGGAACCATTTGTATACAATCCATGTTGATGCGACTTGCATATCAATGGAAACACCTTTGGTTGGCTTTGGTGGGATTATTTGGTCCTCATCAACATCCATTTTGGC
CTCTTTACATAGTGTCGAAGATAATTATTTTTTAATCCTATGTGTGCAAGGCCCTTGTTATTTGTGAGGGCCTTCGCTTGGTGGAAAGATTGAAACAAACTCGAGTTTTG
ATATTGTCAAATTCTTTAGTTGTAATTTAGATGATTAATGGTACCACGAATATTCAGGTGGAAGTGGCAAATTATGTTGTTGATATAAGGAAGATGGTCATTTCCTTTCA
TTCTGTCACATTTCAACATGTGCCCTATTCGCACAATTCAG
Protein sequenceShow/hide protein sequence
MPIILHHLPLSISHLPPSLPLQTQHTHLISPLSLSLSKSKTKKKNSSFMGASFAKHSNLPSSPSSSDSSSLEDIPENCISIVFMYLDPPEICNLASLNRAFRAASSADFV
WESKLPPNYNFLLHRVLHPQLCQPKKHIFATLTRPYLFDHATKEVWLDKSSGKIFVSISSKALKITGIDDRRYWNYIPTDESSQYRLWRTDGLCGQLWSIVKGVIYHSHT
NIDLSGNHLYTIHVDATCISMETPLVGFGGIIWSSSTSILASLHSVEDNYFLILCVQGPCYL