; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020853 (gene) of Chayote v1 genome

Gene IDSed0020853
OrganismSechium edule (Chayote v1)
Descriptionhistone acetyltransferase KAT6B-like
Genome locationLG13:2739428..2743916
RNA-Seq ExpressionSed0020853
SyntenySed0020853
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030926.1 hypothetical protein SDJN02_04963 [Cucurbita argyrosperma subsp. argyrosperma]3.9e-20754.68Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP
        MV PSH SSSP   +GR SPNSRNSEI +P RRSFS          + PRSLN  TP NSPSDYPRRNS SRE LF   DNEEKENGK+ S KP R+RSP
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP

Query:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVA--------------------------
          GKSTK+FMSPTISAASK + SPKKKILGD N+ VRSSLSFS  K SS NS+  N NPEAS A +SD N ++A                          
Subjt:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVA--------------------------

Query:  --LNPISKS---------SKTVRFGGVEVISCSIDDPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDC
            PISKS         SKTV+ GG++VIS S  + EV+ MA  TD K E +PIS SAI   P + S  +E  +VEV S   ND ESPAK+        
Subjt:  --LNPISKS---------SKTVRFGGVEVISCSIDDPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDC

Query:  VNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENE
        V+L+ SFK S VSS +E+A LDADPLM  PYDPKTNYLSPRPQFLHYKPRRINQ ELD  LEELFS    SESEF+E  DS+DP M+SDEA +      E
Subjt:  VNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENE

Query:  EEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERAS-LLTTEDSTEIFEFARTNFDVLLWKLEIWHENT
        EE EEEEE +E+     E+   EAK SSKLHFSR+FKISSLLLIL TACFSICVVNV+D   LERAS LL  EDSTE+FEFA+TNF+VL+ K E+WH N+
Subjt:  EEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERAS-LLTTEDSTEIFEFARTNFDVLLWKLEIWHENT

Query:  KSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHP---KQEHMI-------KETEIDSFE------DHNEEQES--LQDIETMKDSEIEVEPV
        +S++SD++ NI   RPLIYLNQT FL KD +   QC++LSH    ++E+ +       KE EID  E      DHNEE+E   L++IE MK+ EI +E V
Subjt:  KSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHP---KQEHMI-------KETEIDSFE------DHNEEQES--LQDIETMKDSEIEVEPV

Query:  ERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSS
        E +   E+E               QEI  +  +     EE  Q       +    +  E SLQEIIE+TS  S+SD+LNEE+ IQE QTEE+YE SSS  
Subjt:  ERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSS

Query:  NFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVSPPSEPQSDVENENGENFDLIKAAT--KISRDLTQNTAVIVSAILLGFTLI--AGLIYS
           I DQIEQE AT  E+ EE+  DSIQQ N E Q QSPPVSPPS PQSD E+ENG N DL+  AT  +ISRD +QNTAVI SAILLG ++I  AGLIY+
Subjt:  NFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVSPPSEPQSDVENENGENFDLIKAAT--KISRDLTQNTAVIVSAILLGFTLI--AGLIYS

Query:  RKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREEEVEEEPIDDDDQS-------CSYETSSF----------------------------------
        RK GS +S  A A AE Q E PLL+EK+  QS  V E   EEE I+DDD+        CS ETSSF                                  
Subjt:  RKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREEEVEEEPIDDDDQS-------CSYETSSF----------------------------------

Query:  ---QYSSMR-EDEPET-KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK
           QYSSM+ EDE ET K   EA++HSHGRKTRK S R  MASSSLDE SV + SASPSYGSFTTYEKIPIKH N +++EIVTPVRRSSRIRK
Subjt:  ---QYSSMR-EDEPET-KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK

XP_022942486.1 uncharacterized protein LOC111447507 [Cucurbita moschata]1.5e-20354.35Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP
        M  PSH SSSP   +GR SPNSRNSEI +P RRSFS          + PRSLN  TP NSPSDYPRRNS SRE+LF   DNEEKENGK+ S KP R+RSP
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP

Query:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVA--------------------------
          GKSTK+FMSPTISAASK + SPKKKILGD N+ VRSSLSFS  K SS NS+  N NPEAS A +SD N ++A                          
Subjt:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVA--------------------------

Query:  --LNPISKS---------SKTVRFGGVEVISCSIDDPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDC
            PISKS         SKTV+ GG++VIS S  + EV+ MA  TD K E + IS SAI   P + S  +E  +VEV S   ND ESPAK+        
Subjt:  --LNPISKS---------SKTVRFGGVEVISCSIDDPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDC

Query:  VNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENE
        V+L+ SFK S VSS +E+A LDADPLM  PYDPKTNYLSPRPQFLHYKPRRINQ ELD  LEELFS    SESEF+E  DS+DP M+SDEA +      E
Subjt:  VNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENE

Query:  EEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASL-LTTEDSTEIFEFARTNFDVLLWKLEIWHENT
        EE EE EE +E+     E+   EAK SSKLHFSR+FKISSLLLIL TACFSICVVNV+D   LERASL L  EDSTE+FEFA+TNF+VL+ K E+WH ++
Subjt:  EEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASL-LTTEDSTEIFEFARTNFDVLLWKLEIWHENT

Query:  KSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFE------DHNEEQES--LQDIETMKDSEIEVEPV
        + ++SD++ NI   RPLIYLNQT FL KD +   QC++LSH             E   KE EID  E      DHNEE+E   L++IE MK+ EI +E V
Subjt:  KSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFE------DHNEEQES--LQDIETMKDSEIEVEPV

Query:  ERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSS
        E +   E+E               QEI  +  +     EE  Q       +    +  E SLQEIIE+TS  S+SD+LNEE+ IQE QTEE+YE SSS  
Subjt:  ERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSS

Query:  NFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVSPPSEPQSDVENENGENFDLIKAAT--KISRDLTQNTAVIVSAILLGFTLI--AGLIYS
           I DQIEQE AT  E+ EE+  DSIQQ N E Q QSPPVSPPS PQSD E+EN  N DL+  AT  +ISRD +QNTAVI SAILLG ++I  AGLIY+
Subjt:  NFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVSPPSEPQSDVENENGENFDLIKAAT--KISRDLTQNTAVIVSAILLGFTLI--AGLIYS

Query:  RKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREEE-------------------------------VEEEPIDDDDQS-------CSYETSSF-QY
        RK GS +S  A A AE Q E PLL+EK+  QS  V EEE                                EE+ I+DDD+        CS E SSF QY
Subjt:  RKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREEE-------------------------------VEEEPIDDDDQS-------CSYETSSF-QY

Query:  SSMR-EDEPET-KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK
        SSM+ EDE ET K   EA++HSHGRKTRK S R  MASSSLDE SV + SASPSYGSFTTYEKIPIKH N +++EIVTPVRRSSRIRK
Subjt:  SSMR-EDEPET-KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK

XP_022975663.1 uncharacterized protein LOC111475455 [Cucurbita maxima]1.5e-20354.19Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP
        M  PSH SSSP   +GRTSPNSRNSEI +P RRSFS          +  RSLN  TP NSPSDYPRRNS SRE+LF   DNEEKENGK+ S KP R+RSP
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP

Query:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSID--
          GKSTKNFMSPTISAASK + SPKKKILGD N+ VRSSLSFS  K SS NS+  N NPEAS A +SD N ++A     K SKTV+ GG EVI+ S    
Subjt:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSID--

Query:  --DPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVI--------------------------------------------SISTNDLESP
          DPEV  +A  TDTK+E +PISKS I   PL  S+T++S  ++VI                                            S S ND ESP
Subjt:  --DPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVI--------------------------------------------SISTNDLESP

Query:  AKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSD
        AK++  E++D V L+ SFK S VSS +E+A LDADPLM  PYDPKTNYLSPRPQFLHYKPRRINQ ELD  LEELFS    SESEFTE  DS+DP M SD
Subjt:  AKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSD

Query:  EAETESQTENEEEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERAS-LLTTEDSTEIFEFARTNFDVL
        E  +      EEE EEEEE +E+     E+   EAK SSKLHFSR+FKISSLLLIL TACFSI VVNV+D   LERAS LL  EDSTE+FEFA+TNF+VL
Subjt:  EAETESQTENEEEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERAS-LLTTEDSTEIFEFARTNFDVL

Query:  LWKLEIWHENTKSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFE------DHNEEQESL-QDIETM
        + K E+WH N++S++SD++ NI   RPLIYLNQT FL KD +   QC++LSH             E   KE EID  E      D NEE E L ++IE M
Subjt:  LWKLEIWHENTKSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFE------DHNEEQESL-QDIETM

Query:  KDSEIEVEPVERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANE--SKDTEESLQEIIEDTSANSSSDELNEEERIQETQ
        K+ EI +E V+     ++EE+ F EI             E    DP   EEE      +  + E   ++ E SLQEIIE+TS  S+SD LNEE++IQE Q
Subjt:  KDSEIEVEPVERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANE--SKDTEESLQEIIEDTSANSSSDELNEEERIQETQ

Query:  TEESYEVSSSSSNFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVS-PPSEPQSDVENENGENFDLIKAATKISRDLTQNTAVIVSAILLGF
        TEE+YE SS+     I DQIEQE AT  E+ EE+  DSIQQSN E Q+QSPPVS PPS PQS+ E+ENG N    +   KISRD +QNTAVI SAILLG 
Subjt:  TEESYEVSSSSSNFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVS-PPSEPQSDVENENGENFDLIKAATKISRDLTQNTAVIVSAILLGF

Query:  TLI---AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREE--------------------------------EVEEEPIDDDDQS-----
        ++I   AGLIY+RK GS  S  A A AE Q E PLL+EK+  QS  V EE                                EVEE    DDD+      
Subjt:  TLI---AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREE--------------------------------EVEEEPIDDDDQS-----

Query:  --CSYETSSF-QYSSMR-EDEPET--KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRI
          CS ETSSF QYSSM+ EDE ET  K   EA++HSHGRKTRK S R  MASSSLDE SV + SASPSYGSFTTYEKIPIKH N +++EIVTPVRRSSRI
Subjt:  --CSYETSSF-QYSSMR-EDEPET--KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRI

Query:  RK
        RK
Subjt:  RK

XP_023524378.1 uncharacterized protein LOC111788283 [Cucurbita pepo subsp. pepo]2.7e-20854.89Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP
        M  PSH SSSP   +GRTSPNSRNSEI +P RRSFS          + PRSLN  TP NSPSDYPRRNS SRE+LF   DNEEKENGK+ S KP R RSP
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP

Query:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVA--------------------------
          GKSTK+FMSPTISAASK + SPKKKILGD N+ VRSSLSFS  K SS NS+  N NPEAS A +SD N ++A                          
Subjt:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVA--------------------------

Query:  --LNPISKS---------SKTVRFGGVEVISCSIDDPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDC
            PISKS         SKTV+ GG +VIS S  + EV+ +A  TD K E +PIS SAI   P + S T+E  +VEV S S ND ESPAK+        
Subjt:  --LNPISKS---------SKTVRFGGVEVISCSIDDPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDC

Query:  VNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENE
        V+LD SFK S VSS +E+A LDADPLM  PYDPKTNYLSPRPQFLHYKPRRINQ ELD  LEELFS    SESEFTE  DS+DP M+SDEA +      E
Subjt:  VNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENE

Query:  EEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERAS-LLTTEDSTEIFEFARTNFDVLLWKLEIWHENT
        EE EEEEE +E+     E+   EAKNSSKLHFSR FKISSLLLIL TACFSICVVNV+D   LERAS LL  E+STE+FEFA+TNF+VL+ K E+WH N+
Subjt:  EEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERAS-LLTTEDSTEIFEFARTNFDVLLWKLEIWHENT

Query:  KSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFEDH------NEEQES----LQDIETMKDSEIEVE
        +S++SD++ NI   RPLIY NQT FL KD +   QC++LSH             E   KE EID  E+H      NEE+E     L++IE MK+ EI++E
Subjt:  KSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFEDH------NEEQES----LQDIETMKDSEIEVE

Query:  PVERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSS
         VE +   E+E               QEI  +  +     EE  Q   +   +    +  E SLQEIIE+TS  S+SD+LNEE++IQE QTEE+YE SSS
Subjt:  PVERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSS

Query:  SSNFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVSPPS-EPQSDVENENGENFDLIKAATK--ISRDLTQNTAVIVSAILLGFTLI--AGL
             I DQIEQE AT  E+ EE+  DSIQQ N E Q+QSPPVSPP   PQSD E+ENG N DL+  ATK  ISRD +QNTAVI SAILLG ++I  AGL
Subjt:  SSNFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVSPPS-EPQSDVENENGENFDLIKAATK--ISRDLTQNTAVIVSAILLGFTLI--AGL

Query:  IYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREEE-------------------------------VEEEPIDDDDQS-------CSYETSSF
        IY+RK GS +S  A A AE Q E PLL+EK+  QS  V EEE                                EE+ I+DDD+        CS E SSF
Subjt:  IYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREEE-------------------------------VEEEPIDDDDQS-------CSYETSSF

Query:  -QYSSMR-EDEPET-KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK
         QYSSM+ EDE ET K+  EA++HSHGRKTRK S R  MASSSLDE SV + SASPSYGSFTTYEKIPIKH N +++EIVTPVRRSSRIRK
Subjt:  -QYSSMR-EDEPET-KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK

XP_038903440.1 uncharacterized protein LOC120090026 [Benincasa hispida]1.7e-21557.66Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP
        M LPS+ SSSP   SGRTSPNSRNSEI +P+RRSFSGNPF KPSIV NPR LNP TPANSPSDYPRRNS+SRE+ FT  + +EKEN KD S KP R+RSP
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP

Query:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSIDD-
         VGKS+K+FMSPTISAASK AASPKKKILGD N+ VRSS SFS  K SS NS+NQ+   ++S   +SD NPQ+     SKS+KTVRFGG EVIS S DD 
Subjt:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSIDD-

Query:  ---------PEVI-MMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLES-PAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADP
                 PEV+  MA   D K+E +P+SKSA    PLE S +    + EVISIS  DL+S PA+SNL EDVDCVNLDPSFKISP+SSP+ +A LD DP
Subjt:  ---------PEVI-MMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLES-PAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADP

Query:  LMAVPYDPKTNYLSPRPQFLHYKP-RRINQHELDCGLEE-LFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNSS
         +  PYDPKTNYLSPRPQFLHY+P RRIN++E +  LEE LFS   VS+SE  EE DS+D   +SDEA + +++E EEEE+EEEE++  E+   E K SS
Subjt:  LMAVPYDPKTNYLSPRPQFLHYKP-RRINQHELDCGLEE-LFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNSS

Query:  KLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLLTTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIR-ERPLI-YLNQTEFLLK
        KLHFS +FK SSLLLILFTACFSICVVNV+DPNI +R S LT ED +EIF FA+TNF+VL+ KLE+WH  + SF+SDV+ N R   PLI Y NQTEF  +
Subjt:  KLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLLTTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIR-ERPLI-YLNQTEFLLK

Query:  DFSPNGQCVLLSHP---KQEH------MIKETEIDSFEDHNEEQ----ESLQDIETMKDSEIEVEPVERDAHTEDEEQLFHEI----------KDQESSL
         F+ N QC++LSH    ++E+       +K+ EID FE+  E++    E  Q+ E +   EI +E  ER++   +EE+LF EI          ++QE  L
Subjt:  DFSPNGQCVLLSHP---KQEH------MIKETEIDSFEDHNEEQ----ESLQDIETMKDSEIEVEPVERDAHTEDEEQLFHEI----------KDQESSL

Query:  -------VQEIHGEPVERDPHIEE--EEQLIHEIEEKANESKDTE---ESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQE
               ++EI  E VER+   EE  E+    E E  ANE ++ E   ESLQE IE+ S NS+SD+L EEE +QE + EE+++  SS S+    DQIEQ 
Subjt:  -------VQEIHGEPVERDPHIEE--EEQLIHEIEEKANESKDTE---ESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQE

Query:  PATLAESEEEHIDSIQQSNTEFQYQSPPVSPP-SEPQSDVENEN-GENFDLIKAATKISRDLTQNTAVIVSAILLGFTLIAGLIYSRKPGSISSMAATAT
         A      EE      + NTEFQYQ PPVSPP +E QSD E +N G+  DLI+    IS+D TQNTA+I+SAILLG TLI GLIY+R+ GS  S +  A 
Subjt:  PATLAESEEEHIDSIQQSNTEFQYQSPPVSPP-SEPQSDVENEN-GENFDLIKAATKISRDLTQNTAVIVSAILLGFTLIAGLIYSRKPGSISSMAATAT

Query:  AEEQNE-QPLLEEKQANQSSIVREEEVEEEPIDDDD----QSCSYETSSFQYSSMREDEPET-KRSNEARSHSHGR-KTRKISRRESMASSSLDEDSVLS
        AEE+ E QPL++E++ NQS +  EE VEEE  +++D    + CS ETSSFQYSSMRE++ +  KRS+E +SHSHGR K RK SRRESMASSSLDE SV S
Subjt:  AEEQNE-QPLLEEKQANQSSIVREEEVEEEPIDDDD----QSCSYETSSFQYSSMREDEPET-KRSNEARSHSHGR-KTRKISRRESMASSSLDEDSVLS

Query:  ISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK
         SASPSYGSFTTYEKIPIKH  G DDEIVTPVRRS+RIRK
Subjt:  ISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK

TrEMBL top hitse value%identityAlignment
A0A6J1E496 uncharacterized protein LOC111430638 isoform X11.7e-19250.67Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPS--DY-PRRNSLSRESLFTRTDNEEKENGKDHSSKPARL
        M LPS+ SSSP   +GRTSP SRNSEIG+P+ RSFS NPF KPSI T+ RSLNP TPANSPS  DY P+RNS+SRE LFT  DNE+KENGKD S K  R+
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPS--DY-PRRNSLSRESLFTRTDNEEKENGKDHSSKPARL

Query:  RSPTVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSI
        RSPTVGKS KNFMS TISAASK A SPKKKILGD N+ VRSSLSFS  K SS NS+  N  PEAS AF+SD NP + L    KS+KTVRFGGVEVIS S 
Subjt:  RSPTVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSI

Query:  DD----------PEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVIS--------------------------------------------
        +D          PE++ MA  TDTK+   PI KSAI     + S+T+     EVIS                                            
Subjt:  DD----------PEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVIS--------------------------------------------

Query:  ---------------ISTNDLESPAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKP-RRINQHELDCGLEEL
                       +S N LES   SNLTE+VD VNLDPSF ISPVSSP+ +A +DADP++  PYDPKTNYLSPRPQFLHY P RRIN+   D   EEL
Subjt:  ---------------ISTNDLESPAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKP-RRINQHELDCGLEEL

Query:  FSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLL
        FSS        +EEID +DP  +SDE  + ++++ +EEE+EEEE++  E+   E K SSKL  SR+ KISSLLLILFTAC SICVVNV+DP I +R+++L
Subjt:  FSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLL

Query:  TTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIR-ERPLIYLNQTEFLLKDFSPNGQCVLLSH----PKQEHMIKETEI-----DSFEDHNE
        TT D +EIFE A+TNF+VL+ KLEIWH N+ SF+SDV+ N R   PLI+LNQTE+   D + + QC++LS+     ++ +++   E         ++  E
Subjt:  TTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIR-ERPLIYLNQTEFLLKDFSPNGQCVLLSH----PKQEHMIKETEI-----DSFEDHNE

Query:  EQESLQDIETMKDSEIEVEPVERDAHTED-EEQLFHEI---------------------------------------------KDQESSLVQEIHGEPVE
        ++E  Q++E +K  EI ++ VE ++  E+ EE+ F EI                                             +D E+  V+EI  E VE
Subjt:  EQESLQDIETMKDSEIEVEPVERDAHTED-EEQLFHEI---------------------------------------------KDQESSLVQEIHGEPVE

Query:  RDPHIEE-EEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQEPATLAESEEEHIDSIQQSNTE
        R+   EE EE+   + E KAN+ KD+EE       D ++  S  E+ EEE IQE +T E+++ +SSSS+F + DQIEQ  AT  E++EE       +NTE
Subjt:  RDPHIEE-EEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQEPATLAESEEEHIDSIQQSNTE

Query:  FQYQSPPV-SPPSEPQSDVENENGENF-DLIKAATKISRDLTQNTAVIVSAILLGFTLI--AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSS
        FQYQSPPV SPPSE QSDVE ENG    DLI+ AT ISRD TQNTA I+SAILLG  LI  AGLIY+RK GS  + +  A AEEQ E+PLL++K+ NQ S
Subjt:  FQYQSPPV-SPPSEPQSDVENENGENF-DLIKAATKISRDLTQNTAVIVSAILLGFTLI--AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSS

Query:  IVREEEVEEEPIDDDD----QSCSYETSS-FQYSSMREDEPET-KRSNEARSHSH---------------------------GRKTRKISRRESMASSSL
        +V EEE EE P DDDD    + CS ETSS FQYSS+RE E ET KRSNE +SHSH                            RK R  + RESMASSSL
Subjt:  IVREEEVEEEPIDDDD----QSCSYETSS-FQYSSMREDEPET-KRSNEARSHSH---------------------------GRKTRKISRRESMASSSL

Query:  DEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIR
        D+ SV S SASPSYG FTTYEKIPIKH NG D+EIVTPVRRSSRIR
Subjt:  DEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIR

A0A6J1E9M3 uncharacterized protein LOC111430638 isoform X25.3e-19450.77Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDY-PRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRS
        M LPS+ SSSP   +GRTSP SRNSEIG+P+ RSFS NPF KPSI T+ RSLNP TPANSPSDY P+RNS+SRE LFT  DNE+KENGKD S K  R+RS
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDY-PRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRS

Query:  PTVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSIDD
        PTVGKS KNFMS TISAASK A SPKKKILGD N+ VRSSLSFS  K SS NS+  N  PEAS AF+SD NP + L    KS+KTVRFGGVEVIS S +D
Subjt:  PTVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSIDD

Query:  ----------PEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVIS----------------------------------------------
                  PE++ MA  TDTK+   PI KSAI     + S+T+     EVIS                                              
Subjt:  ----------PEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVIS----------------------------------------------

Query:  -------------ISTNDLESPAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKP-RRINQHELDCGLEELFS
                     +S N LES   SNLTE+VD VNLDPSF ISPVSSP+ +A +DADP++  PYDPKTNYLSPRPQFLHY P RRIN+   D   EELFS
Subjt:  -------------ISTNDLESPAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKP-RRINQHELDCGLEELFS

Query:  STIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLLTT
        S        +EEID +DP  +SDE  + ++++ +EEE+EEEE++  E+   E K SSKL  SR+ KISSLLLILFTAC SICVVNV+DP I +R+++LTT
Subjt:  STIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLLTT

Query:  EDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIR-ERPLIYLNQTEFLLKDFSPNGQCVLLSH----PKQEHMIKETEI-----DSFEDHNEEQ
         D +EIFE A+TNF+VL+ KLEIWH N+ SF+SDV+ N R   PLI+LNQTE+   D + + QC++LS+     ++ +++   E         ++  E++
Subjt:  EDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIR-ERPLIYLNQTEFLLKDFSPNGQCVLLSH----PKQEHMIKETEI-----DSFEDHNEEQ

Query:  ESLQDIETMKDSEIEVEPVERDAHTED-EEQLFHEI---------------------------------------------KDQESSLVQEIHGEPVERD
        E  Q++E +K  EI ++ VE ++  E+ EE+ F EI                                             +D E+  V+EI  E VER+
Subjt:  ESLQDIETMKDSEIEVEPVERDAHTED-EEQLFHEI---------------------------------------------KDQESSLVQEIHGEPVERD

Query:  PHIEE-EEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQEPATLAESEEEHIDSIQQSNTEFQ
           EE EE+   + E KAN+ KD+EE       D ++  S  E+ EEE IQE +T E+++ +SSSS+F + DQIEQ  AT  E++EE       +NTEFQ
Subjt:  PHIEE-EEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQEPATLAESEEEHIDSIQQSNTEFQ

Query:  YQSPPV-SPPSEPQSDVENENGENF-DLIKAATKISRDLTQNTAVIVSAILLGFTLI--AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIV
        YQSPPV SPPSE QSDVE ENG    DLI+ AT ISRD TQNTA I+SAILLG  LI  AGLIY+RK GS  + +  A AEEQ E+PLL++K+ NQ S+V
Subjt:  YQSPPV-SPPSEPQSDVENENGENF-DLIKAATKISRDLTQNTAVIVSAILLGFTLI--AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIV

Query:  REEEVEEEPIDDDD----QSCSYETSS-FQYSSMREDEPET-KRSNEARSHSH---------------------------GRKTRKISRRESMASSSLDE
         EEE EE P DDDD    + CS ETSS FQYSS+RE E ET KRSNE +SHSH                            RK R  + RESMASSSLD+
Subjt:  REEEVEEEPIDDDD----QSCSYETSS-FQYSSMREDEPET-KRSNEARSHSH---------------------------GRKTRKISRRESMASSSLDE

Query:  DSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIR
         SV S SASPSYG FTTYEKIPIKH NG D+EIVTPVRRSSRIR
Subjt:  DSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIR

A0A6J1FNZ9 uncharacterized protein LOC1114475077.4e-20454.35Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP
        M  PSH SSSP   +GR SPNSRNSEI +P RRSFS          + PRSLN  TP NSPSDYPRRNS SRE+LF   DNEEKENGK+ S KP R+RSP
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP

Query:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVA--------------------------
          GKSTK+FMSPTISAASK + SPKKKILGD N+ VRSSLSFS  K SS NS+  N NPEAS A +SD N ++A                          
Subjt:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVA--------------------------

Query:  --LNPISKS---------SKTVRFGGVEVISCSIDDPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDC
            PISKS         SKTV+ GG++VIS S  + EV+ MA  TD K E + IS SAI   P + S  +E  +VEV S   ND ESPAK+        
Subjt:  --LNPISKS---------SKTVRFGGVEVISCSIDDPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDC

Query:  VNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENE
        V+L+ SFK S VSS +E+A LDADPLM  PYDPKTNYLSPRPQFLHYKPRRINQ ELD  LEELFS    SESEF+E  DS+DP M+SDEA +      E
Subjt:  VNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENE

Query:  EEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASL-LTTEDSTEIFEFARTNFDVLLWKLEIWHENT
        EE EE EE +E+     E+   EAK SSKLHFSR+FKISSLLLIL TACFSICVVNV+D   LERASL L  EDSTE+FEFA+TNF+VL+ K E+WH ++
Subjt:  EEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASL-LTTEDSTEIFEFARTNFDVLLWKLEIWHENT

Query:  KSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFE------DHNEEQES--LQDIETMKDSEIEVEPV
        + ++SD++ NI   RPLIYLNQT FL KD +   QC++LSH             E   KE EID  E      DHNEE+E   L++IE MK+ EI +E V
Subjt:  KSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFE------DHNEEQES--LQDIETMKDSEIEVEPV

Query:  ERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSS
        E +   E+E               QEI  +  +     EE  Q       +    +  E SLQEIIE+TS  S+SD+LNEE+ IQE QTEE+YE SSS  
Subjt:  ERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSS

Query:  NFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVSPPSEPQSDVENENGENFDLIKAAT--KISRDLTQNTAVIVSAILLGFTLI--AGLIYS
           I DQIEQE AT  E+ EE+  DSIQQ N E Q QSPPVSPPS PQSD E+EN  N DL+  AT  +ISRD +QNTAVI SAILLG ++I  AGLIY+
Subjt:  NFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVSPPSEPQSDVENENGENFDLIKAAT--KISRDLTQNTAVIVSAILLGFTLI--AGLIYS

Query:  RKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREEE-------------------------------VEEEPIDDDDQS-------CSYETSSF-QY
        RK GS +S  A A AE Q E PLL+EK+  QS  V EEE                                EE+ I+DDD+        CS E SSF QY
Subjt:  RKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREEE-------------------------------VEEEPIDDDDQS-------CSYETSSF-QY

Query:  SSMR-EDEPET-KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK
        SSM+ EDE ET K   EA++HSHGRKTRK S R  MASSSLDE SV + SASPSYGSFTTYEKIPIKH N +++EIVTPVRRSSRIRK
Subjt:  SSMR-EDEPET-KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRK

A0A6J1IJW9 uncharacterized protein LOC1114754557.4e-20454.19Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP
        M  PSH SSSP   +GRTSPNSRNSEI +P RRSFS          +  RSLN  TP NSPSDYPRRNS SRE+LF   DNEEKENGK+ S KP R+RSP
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP

Query:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSID--
          GKSTKNFMSPTISAASK + SPKKKILGD N+ VRSSLSFS  K SS NS+  N NPEAS A +SD N ++A     K SKTV+ GG EVI+ S    
Subjt:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSID--

Query:  --DPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVI--------------------------------------------SISTNDLESP
          DPEV  +A  TDTK+E +PISKS I   PL  S+T++S  ++VI                                            S S ND ESP
Subjt:  --DPEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVI--------------------------------------------SISTNDLESP

Query:  AKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSD
        AK++  E++D V L+ SFK S VSS +E+A LDADPLM  PYDPKTNYLSPRPQFLHYKPRRINQ ELD  LEELFS    SESEFTE  DS+DP M SD
Subjt:  AKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTIVSESEFTEEIDSKDPLMDSD

Query:  EAETESQTENEEEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERAS-LLTTEDSTEIFEFARTNFDVL
        E  +      EEE EEEEE +E+     E+   EAK SSKLHFSR+FKISSLLLIL TACFSI VVNV+D   LERAS LL  EDSTE+FEFA+TNF+VL
Subjt:  EAETESQTENEEEEEEEEEMDEI-----EKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERAS-LLTTEDSTEIFEFARTNFDVL

Query:  LWKLEIWHENTKSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFE------DHNEEQESL-QDIETM
        + K E+WH N++S++SD++ NI   RPLIYLNQT FL KD +   QC++LSH             E   KE EID  E      D NEE E L ++IE M
Subjt:  LWKLEIWHENTKSFVSDVILNI-RERPLIYLNQTEFLLKDFSPNGQCVLLSHPK----------QEHMIKETEIDSFE------DHNEEQESL-QDIETM

Query:  KDSEIEVEPVERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANE--SKDTEESLQEIIEDTSANSSSDELNEEERIQETQ
        K+ EI +E V+     ++EE+ F EI             E    DP   EEE      +  + E   ++ E SLQEIIE+TS  S+SD LNEE++IQE Q
Subjt:  KDSEIEVEPVERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANE--SKDTEESLQEIIEDTSANSSSDELNEEERIQETQ

Query:  TEESYEVSSSSSNFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVS-PPSEPQSDVENENGENFDLIKAATKISRDLTQNTAVIVSAILLGF
        TEE+YE SS+     I DQIEQE AT  E+ EE+  DSIQQSN E Q+QSPPVS PPS PQS+ E+ENG N    +   KISRD +QNTAVI SAILLG 
Subjt:  TEESYEVSSSSSNFIILDQIEQEPATLAES-EEEHIDSIQQSNTEFQYQSPPVS-PPSEPQSDVENENGENFDLIKAATKISRDLTQNTAVIVSAILLGF

Query:  TLI---AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREE--------------------------------EVEEEPIDDDDQS-----
        ++I   AGLIY+RK GS  S  A A AE Q E PLL+EK+  QS  V EE                                EVEE    DDD+      
Subjt:  TLI---AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREE--------------------------------EVEEEPIDDDDQS-----

Query:  --CSYETSSF-QYSSMR-EDEPET--KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRI
          CS ETSSF QYSSM+ EDE ET  K   EA++HSHGRKTRK S R  MASSSLDE SV + SASPSYGSFTTYEKIPIKH N +++EIVTPVRRSSRI
Subjt:  --CSYETSSF-QYSSMR-EDEPET--KRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRI

Query:  RK
        RK
Subjt:  RK

A0A6J1JB72 uncharacterized protein LOC111482876 isoform X48.0e-19050.05Show/hide
Query:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPS--DY-PRRNSLSRESLFTRTDNEEKENGKDHSSKPARL
        M LPS+ SSSP   +GRTSP SRNSEI +P+ RSFS NPF KPSI T+ +SLNP TPAN+PS  DY P+RNS+SRE LFT  DNE+KENGKD S K  R+
Subjt:  MVLPSHISSSP--FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPS--DY-PRRNSLSRESLFTRTDNEEKENGKDHSSKPARL

Query:  RSPTVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSI
        RSPTVGKS KNFMS TISAASK A SPKKKILGD N+ VRSSLSFS  K SS NS+  N  PEAS AF+SD NP + L    KS+KTVRFGGVEVIS S 
Subjt:  RSPTVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSI

Query:  DD----------PEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVIS--------------------------------------------
        +D          PE++ +A  TD+K+   PI+KSAI     + S+T+     EVIS                                            
Subjt:  DD----------PEVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVIS--------------------------------------------

Query:  ---------------ISTNDLESPAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKP-RRINQHELDCGLEEL
                       +S N LES   SN TE+VDCVNLDPSF ISPVSSP+ +A +DADP++  PYDPKTNYLSPRPQFLHY P RRIN+   D   EEL
Subjt:  ---------------ISTNDLESPAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKP-RRINQHELDCGLEEL

Query:  FSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLL
        FS+        +EE D +DP  +SD  E  S     +EEE+EEE+D  E+   E K SSK   SR+FKISSLLLILFTAC SICVVNV+DP I ER++LL
Subjt:  FSSTIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLL

Query:  TTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIR-ERPLIYLNQTEFLLKDFSPNGQCVLLSHPK--QEHMIKETEIDSFED------HNEE
        T  D +EIF  A+TNF+VL+ KLEIWH N+ SF+SDV+ N R   PLI+LNQTEF   D + + QC++LSH    +E       +++ +D        +E
Subjt:  TTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIR-ERPLIYLNQTEFLLKDFSPNGQCVLLSHPK--QEHMIKETEIDSFED------HNEE

Query:  QESLQDIETMKDSEIEVEPVERDAHTED-EEQLFHEIK----DQESS--------------LVQEIHG---------------------------EPVER
        QE     E +K  EI ++ VER++  E+ EEQ F EI+    D E+S              +++ I G                           E VER
Subjt:  QESLQDIETMKDSEIEVEPVERDAHTED-EEQLFHEIK----DQESS--------------LVQEIHG---------------------------EPVER

Query:  DPHIEE-EEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQEPATLAESEEEHIDSIQQSNTEF
        +   EE EE+   + E KAN+ KD EE       D ++  S  E+ EEE +QE +T E+++ +SSSS+F + D+IEQ  AT  E++EE       +NTEF
Subjt:  DPHIEE-EEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQEPATLAESEEEHIDSIQQSNTEF

Query:  QYQSPPV-SPPSEPQSDVENENGENF-DLIKAATKISRDLTQNTAVIVSAILLGFTLI--AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSI
        QYQSPPV SPPSE QSDVE ENG    DLI+ AT ISRD TQNTA I+SAILLG  LI  AGLIY+RK GS  + +  A AEEQ E+PLL++K+ NQS +
Subjt:  QYQSPPV-SPPSEPQSDVENENGENF-DLIKAATKISRDLTQNTAVIVSAILLGFTLI--AGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSI

Query:  VREEEVEEEPIDDDD-----QSCSYETSS-FQYSSMREDEPE---------------------------TKRSNEARSHSHGRKTRKI--SRRESMASSS
          EEE EE+ +DDDD     + CS ETSS FQYSS+RE E E                             RS+E +SHSHGR+  ++  SRRESMASSS
Subjt:  VREEEVEEEPIDDDD-----QSCSYETSS-FQYSSMREDEPE---------------------------TKRSNEARSHSHGRKTRKI--SRRESMASSS

Query:  LDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIR
        LDE SV S SASPSYG FTTYEKIPIKH NG D+EIVTPVRRSSRIR
Subjt:  LDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16630.1 unknown protein1.1e-2626.91Show/hide
Query:  VLPSHI--SSSP-FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP
        V PS +  SSSP    R +P  RNSE G  +RRSF GNPF                     +D  RRNS+ RE    R +  +KEN  D       ++ P
Subjt:  VLPSHI--SSSP-FSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSP

Query:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSIDDP
        T  K +K+FMSPTISA SK   SP+KKIL D N                          E S +FD   +     + +S S         +VIS   +D 
Subjt:  TVGKSTKNFMSPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSIDDP

Query:  EVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNY
        +V  +                      ++ ++ L       I++S  D     KSN   D     + P     P + P+   S + DP++A PYDPK NY
Subjt:  EVIMMADGTDTKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNY

Query:  LSPRPQFLHYKPRRINQHELD-C-GLEELFSSTIVS-----------ESEFTEEIDSKDPLMDSDEAET-----------------ESQTENEEEEEEEE
        LSPRPQFLHYKP    +H  D C  LEELF S   S           E +  EE+ S++ ++  +E E                  E + E  E ++EEE
Subjt:  LSPRPQFLHYKPRRINQHELD-C-GLEELFSSTIVS-----------ESEFTEEIDSKDPLMDSDEAET-----------------ESQTENEEEEEEEE

Query:  EMDEIEKIVNEAKN--SSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLLTTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNI
        E+   E I  E  +  S +  FS+   +   +L L  A   +     +    +  +       S EI   A  NF+ L  KL +W E++  ++  ++ ++
Subjt:  EMDEIEKIVNEAKN--SSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLLTTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNI

Query:  RER----PLIYLNQTEFLLKDFSPNGQCVLLSHPKQEHMIKETEIDSFE--------DHNEEQESLQDIETMKDSEIEVEPVERDAHTEDEEQLFHEIKD
        RE     P  + N T  LL+D   +    +      E ++    +DS E         H E +E     E+    EI +E V    + ED+ ++  E ++
Subjt:  RER----PLIYLNQTEFLLKDFSPNGQCVLLSHPKQEHMIKETEIDSFE--------DHNEEQESLQDIETMKDSEIEVEPVERDAHTEDEEQLFHEIKD

Query:  QESSLVQEIHGE-------PVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQE
        +E  +  EI  E        +  D  +   E+    + E+ +  ++T     +++E       +D+ N EE   + Q  +  + ++ SSN     Q EQ 
Subjt:  QESSLVQEIHGE-------PVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQE

Query:  PATLAES--EEEHID-------SIQQSNTEFQYQSPPVSPPSEPQSDVENENGENFDLIKAATKISRDLTQNTAVIVSAILLGFTL----IAGLIYSRKP
             E+  EEE +        S+ +  T+ ++    V        +V N+ G   D++ +  K    L     VI++A+  GF L       ++   + 
Subjt:  PATLAES--EEEHID-------SIQQSNTEFQYQSPPVSPPSEPQSDVENENGENFDLIKAATKISRDLTQNTAVIVSAILLGFTL----IAGLIYSRKP

Query:  GSISSMAATATAEEQNEQPLLEEKQANQSSIVREEEVEEEPIDDDDQSCSYETS-SFQYSSMREDEPETKRSNEARSHSHGRKTRKISRRESMASSSLDE
        G  ++++AT   E    + L+ E+    SS+  +EE EE   D   +  S+ +  SF +S  +     + + ++ + H  G   +K +      +SS  E
Subjt:  GSISSMAATATAEEQNEQPLLEEKQANQSSIVREEEVEEEPIDDDDQSCSYETS-SFQYSSMREDEPETKRSNEARSHSHGRKTRKISRRESMASSSLDE

Query:  DSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIR
         S+ S+    SYGSFTTYEKI  K +  ++ E++TPVRRSSRIR
Subjt:  DSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIR

AT2G16270.1 unknown protein2.8e-3027.63Show/hide
Query:  SSPFSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSPTVGKSTKNFM
        S P   R +P  RNSE G P+RRSF GNPFP  S V            N PSD  RRNS   +              K++ +KP +L      K +KNFM
Subjt:  SSPFSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSPTVGKSTKNFM

Query:  SPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSIDDPEVIMMADGTD
        SPTISA SK  ASP+K++L D N+  RS        +   N  N +                       ++   V F  V    C IDD           
Subjt:  SPTISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSIDDPEVIMMADGTD

Query:  TKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDCVNLDPSFKIS-----PVSSPIEVASLDADPLMAVPYDPKTNYLSPRP
                                E   VE   ++  D +   +  + E+      DP F+IS     P +SP E A+ + D L+  PYDPK N+LSPRP
Subjt:  TKNETSPISKSAIETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDCVNLDPSFKIS-----PVSSPIEVASLDADPLMAVPYDPKTNYLSPRP

Query:  QFLHYKPR-RINQHELDC-GLEELF-SSTIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNS-SKLHFSRMFKISSLLL
        QFLHYKP  RI +   +C  LEELF S +   ++E + E   +     ++E   E +TE+ E+ E E + + + + V E  +   K   SR FK     L
Subjt:  QFLHYKPR-RINQHELDC-GLEELF-SSTIVSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNS-SKLHFSRMFKISSLLL

Query:  ILFTACFSICVVNVYDPNILERASLLTTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIRERPLIYLNQTEFLLKDFSPNGQCVLLSHPKQE
         L  A   + V   + P  L ++S        EI EFA+               N    +SD +  + E  L+Y+++               L+S   + 
Subjt:  ILFTACFSICVVNVYDPNILERASLLTTEDSTEIFEFARTNFDVLLWKLEIWHENTKSFVSDVILNIRERPLIYLNQTEFLLKDFSPNGQCVLLSHPKQE

Query:  HMIKETEIDSFEDHNEEQESLQDIETMKDSEIEV--EPVERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKAN-------E
        +     E    + HN    +L+D    K + +E+  EP++ ++ +E+  +                 G   E +   EE  +++ + +E A        E
Subjt:  HMIKETEIDSFEDHNEEQESLQDIETMKDSEIEV--EPVERDAHTEDEEQLFHEIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKAN-------E

Query:  SKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQEPATLAESEEEHIDS---IQQSNTEFQY----------------
        S D E +L+ + ED     + +EL E E   E + E   ++  + S  I ++Q + E A +  +  +HI+S   + +S +E  +                
Subjt:  SKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQEPATLAESEEEHIDS---IQQSNTEFQY----------------

Query:  --QSPPVSPPSEPQSDVENENGENFDL-IKAATKISRDLTQNTAVIVSAILLGFTLIAGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVRE
          +    S   E   ++  E  ++  L ++++ K   D T+   +++S+ +L    +A  ++++K      +AAT  A E N +  L      + ++V+E
Subjt:  --QSPPVSPPSEPQSDVENENGENFDL-IKAATKISRDLTQNTAVIVSAILLGFTLIAGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVRE

Query:  EEVE---EEPIDDDDQSCSYETSSFQYSSMREDEPETKRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDE
        +      EE +DD        ++SFQ  S    EP++K   +  ++S   K     RRESMASS+    S  SI  S SYGSFTTYEKIPIK +  +++E
Subjt:  EEVE---EEPIDDDDQSCSYETSSFQYSSMREDEPETKRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDE

Query:  IVTPVRRSSRIR
        ++TPVRRSSRI+
Subjt:  IVTPVRRSSRIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTGCCGTCTCACATATCGTCTTCCCCGTTCTCCGGAAGAACTAGCCCTAATTCCAGAAACTCCGAAATCGGCAGCCCGATCCGTCGGAGCTTCTCGGGCAACCC
GTTTCCGAAGCCCTCGATCGTCACCAATCCGAGAAGCTTGAACCCTACCACTCCGGCAAATAGTCCCTCTGATTATCCAAGAAGAAATTCGCTAAGCAGAGAGAGTTTAT
TTACTCGTACTGATAATGAAGAGAAAGAAAATGGAAAGGATCATAGTTCGAAACCTGCTCGACTCCGTTCGCCGACGGTCGGAAAATCCACCAAGAACTTCATGTCTCCG
ACGATCTCTGCTGCCTCGAAGTTCGCCGCCTCTCCGAAGAAGAAAATTCTCGGTGATCTGAACGATCAAGTTCGGTCGTCCCTTTCGTTTTCCACCTCGAAAATCTCTTC
ACACAACTCTCTGAATCAAAATCAAAATCCAGAGGCCTCAACCGCATTTGATTCTGATGCAAATCCTCAAGTTGCTCTGAATCCAATCTCTAAATCATCTAAAACTGTGA
GATTCGGTGGAGTTGAAGTCATTTCTTGTTCCATTGATGATCCAGAAGTGATAATGATGGCAGATGGAACCGATACGAAGAATGAAACCTCTCCGATTTCGAAATCCGCC
ATTGAAACAACACCTCTTGAACCATCAAGGACTCTGGAATCTGTTAATGTCGAGGTAATCTCTATCTCAACTAACGATTTAGAGTCCCCAGCAAAGAGTAATTTGACTGA
AGATGTAGATTGTGTTAATCTTGATCCAAGTTTTAAGATCAGTCCTGTTTCTTCTCCAATCGAAGTAGCATCGCTCGATGCCGATCCATTAATGGCGGTTCCTTATGATC
CGAAAACCAATTACCTATCTCCAAGGCCACAATTCCTCCATTACAAACCAAGAAGAATCAATCAACACGAACTCGACTGTGGACTTGAGGAACTCTTTTCCTCTACCATT
GTTTCCGAGTCTGAATTCACAGAGGAAATTGACTCTAAAGATCCATTGATGGATTCTGATGAAGCTGAAACAGAATCGCAGACAGAAAATGAAGAAGAAGAAGAAGAAGA
AGAAGAAATGGATGAGATAGAAAAGATTGTTAATGAAGCTAAGAATTCATCTAAGCTGCACTTTTCCAGGATGTTCAAGATCAGTTCTCTGCTTTTGATTCTGTTTACTG
CTTGCTTTTCGATTTGTGTTGTTAATGTGTATGATCCGAATATCCTCGAGAGAGCAAGCTTGTTAACTACGGAGGATTCAACCGAGATTTTCGAGTTTGCGAGGACGAAT
TTCGATGTGTTGTTGTGGAAACTTGAGATTTGGCATGAAAACACCAAATCTTTTGTTTCTGATGTGATTCTCAACATCAGAGAGCGGCCATTGATTTATCTTAACCAAAC
TGAATTCTTACTCAAAGATTTCAGTCCAAATGGACAGTGTGTCCTATTATCTCATCCAAAACAAGAACATATGATAAAGGAGACAGAAATTGACAGTTTTGAAGATCATA
ATGAGGAACAAGAATCATTGCAAGATATTGAAACCATGAAGGATAGCGAAATCGAAGTCGAACCTGTCGAAAGAGATGCTCATACTGAAGATGAAGAACAATTGTTTCAT
GAAATTAAAGATCAAGAATCATCATTGGTTCAAGAGATTCACGGCGAACCTGTTGAAAGAGATCCACATATTGAAGAAGAAGAACAATTGATCCATGAAATTGAGGAAAA
AGCCAATGAATCAAAAGACACTGAAGAATCATTACAAGAGATCATTGAAGATACCTCTGCAAATTCATCTTCTGATGAACTCAACGAAGAAGAACGCATCCAGGAGACTC
AAACCGAAGAGAGTTACGAAGTTTCTTCGTCGTCGTCTAACTTCATAATCCTCGATCAGATCGAACAAGAACCTGCAACATTAGCCGAATCAGAGGAAGAACATATTGAT
TCCATTCAACAAAGCAACACGGAATTTCAATATCAATCACCTCCAGTTTCTCCTCCTTCTGAACCTCAATCAGATGTCGAAAACGAAAATGGCGAAAACTTCGATCTCAT
AAAAGCTGCAACCAAAATATCAAGAGATTTGACTCAAAACACCGCGGTTATAGTATCTGCAATACTGCTAGGCTTTACTCTAATTGCAGGACTGATTTACTCAAGAAAAC
CAGGCTCGATATCATCCATGGCGGCCACGGCCACTGCCGAAGAGCAAAATGAGCAGCCATTGCTGGAAGAGAAGCAGGCGAATCAGAGTTCGATCGTACGAGAGGAAGAA
GTAGAAGAAGAACCGATTGATGATGATGATCAATCTTGCTCTTATGAAACGAGCAGTTTCCAATACAGCAGCATGAGAGAAGACGAACCGGAAACGAAGAGATCGAACGA
AGCTCGGAGCCATAGCCATGGAAGGAAGACAAGGAAGATTTCGAGAAGAGAATCAATGGCTTCTTCTTCTTTGGATGAAGATTCAGTGTTGTCCATCTCTGCCTCTCCAT
CATATGGGAGTTTCACAACCTATGAGAAGATCCCAATCAAGCATGCGAATGGAGATGATGATGAGATCGTGACTCCAGTTAGGCGTTCTAGTAGAATTAGAAAAGCAATT
CAACCGTAG
mRNA sequenceShow/hide mRNA sequence
CGAACACCAAAAGCCCTTTCTCTCTCTCCCCTTAAATATTTTCGCCCACTGCCCATTCCTAATCCACATTTGAATCTCTCAAATTTCTCCCCATCTCTCCCAAATTTCCA
CTCAAGATCCATTTCCATGGTGCTGCCGTCTCACATATCGTCTTCCCCGTTCTCCGGAAGAACTAGCCCTAATTCCAGAAACTCCGAAATCGGCAGCCCGATCCGTCGGA
GCTTCTCGGGCAACCCGTTTCCGAAGCCCTCGATCGTCACCAATCCGAGAAGCTTGAACCCTACCACTCCGGCAAATAGTCCCTCTGATTATCCAAGAAGAAATTCGCTA
AGCAGAGAGAGTTTATTTACTCGTACTGATAATGAAGAGAAAGAAAATGGAAAGGATCATAGTTCGAAACCTGCTCGACTCCGTTCGCCGACGGTCGGAAAATCCACCAA
GAACTTCATGTCTCCGACGATCTCTGCTGCCTCGAAGTTCGCCGCCTCTCCGAAGAAGAAAATTCTCGGTGATCTGAACGATCAAGTTCGGTCGTCCCTTTCGTTTTCCA
CCTCGAAAATCTCTTCACACAACTCTCTGAATCAAAATCAAAATCCAGAGGCCTCAACCGCATTTGATTCTGATGCAAATCCTCAAGTTGCTCTGAATCCAATCTCTAAA
TCATCTAAAACTGTGAGATTCGGTGGAGTTGAAGTCATTTCTTGTTCCATTGATGATCCAGAAGTGATAATGATGGCAGATGGAACCGATACGAAGAATGAAACCTCTCC
GATTTCGAAATCCGCCATTGAAACAACACCTCTTGAACCATCAAGGACTCTGGAATCTGTTAATGTCGAGGTAATCTCTATCTCAACTAACGATTTAGAGTCCCCAGCAA
AGAGTAATTTGACTGAAGATGTAGATTGTGTTAATCTTGATCCAAGTTTTAAGATCAGTCCTGTTTCTTCTCCAATCGAAGTAGCATCGCTCGATGCCGATCCATTAATG
GCGGTTCCTTATGATCCGAAAACCAATTACCTATCTCCAAGGCCACAATTCCTCCATTACAAACCAAGAAGAATCAATCAACACGAACTCGACTGTGGACTTGAGGAACT
CTTTTCCTCTACCATTGTTTCCGAGTCTGAATTCACAGAGGAAATTGACTCTAAAGATCCATTGATGGATTCTGATGAAGCTGAAACAGAATCGCAGACAGAAAATGAAG
AAGAAGAAGAAGAAGAAGAAGAAATGGATGAGATAGAAAAGATTGTTAATGAAGCTAAGAATTCATCTAAGCTGCACTTTTCCAGGATGTTCAAGATCAGTTCTCTGCTT
TTGATTCTGTTTACTGCTTGCTTTTCGATTTGTGTTGTTAATGTGTATGATCCGAATATCCTCGAGAGAGCAAGCTTGTTAACTACGGAGGATTCAACCGAGATTTTCGA
GTTTGCGAGGACGAATTTCGATGTGTTGTTGTGGAAACTTGAGATTTGGCATGAAAACACCAAATCTTTTGTTTCTGATGTGATTCTCAACATCAGAGAGCGGCCATTGA
TTTATCTTAACCAAACTGAATTCTTACTCAAAGATTTCAGTCCAAATGGACAGTGTGTCCTATTATCTCATCCAAAACAAGAACATATGATAAAGGAGACAGAAATTGAC
AGTTTTGAAGATCATAATGAGGAACAAGAATCATTGCAAGATATTGAAACCATGAAGGATAGCGAAATCGAAGTCGAACCTGTCGAAAGAGATGCTCATACTGAAGATGA
AGAACAATTGTTTCATGAAATTAAAGATCAAGAATCATCATTGGTTCAAGAGATTCACGGCGAACCTGTTGAAAGAGATCCACATATTGAAGAAGAAGAACAATTGATCC
ATGAAATTGAGGAAAAAGCCAATGAATCAAAAGACACTGAAGAATCATTACAAGAGATCATTGAAGATACCTCTGCAAATTCATCTTCTGATGAACTCAACGAAGAAGAA
CGCATCCAGGAGACTCAAACCGAAGAGAGTTACGAAGTTTCTTCGTCGTCGTCTAACTTCATAATCCTCGATCAGATCGAACAAGAACCTGCAACATTAGCCGAATCAGA
GGAAGAACATATTGATTCCATTCAACAAAGCAACACGGAATTTCAATATCAATCACCTCCAGTTTCTCCTCCTTCTGAACCTCAATCAGATGTCGAAAACGAAAATGGCG
AAAACTTCGATCTCATAAAAGCTGCAACCAAAATATCAAGAGATTTGACTCAAAACACCGCGGTTATAGTATCTGCAATACTGCTAGGCTTTACTCTAATTGCAGGACTG
ATTTACTCAAGAAAACCAGGCTCGATATCATCCATGGCGGCCACGGCCACTGCCGAAGAGCAAAATGAGCAGCCATTGCTGGAAGAGAAGCAGGCGAATCAGAGTTCGAT
CGTACGAGAGGAAGAAGTAGAAGAAGAACCGATTGATGATGATGATCAATCTTGCTCTTATGAAACGAGCAGTTTCCAATACAGCAGCATGAGAGAAGACGAACCGGAAA
CGAAGAGATCGAACGAAGCTCGGAGCCATAGCCATGGAAGGAAGACAAGGAAGATTTCGAGAAGAGAATCAATGGCTTCTTCTTCTTTGGATGAAGATTCAGTGTTGTCC
ATCTCTGCCTCTCCATCATATGGGAGTTTCACAACCTATGAGAAGATCCCAATCAAGCATGCGAATGGAGATGATGATGAGATCGTGACTCCAGTTAGGCGTTCTAGTAG
AATTAGAAAAGCAATTCAACCGTAGCCAATTAATGCAGTAATGAAATTGGATTTTAAATTAGAACTTAGAGTTGACAAGATTGTGACTCTTATTAGATGTTATAGTAGTG
ATCGAAATGTGTGCTCCTGTGTTGAAAGTTGGAATATTTTGAGCATATATTTGAATTGTGTCACTAATGCATGCTATAGTAGAATTTGAAAATGCAACGCAACAAATAGT
TGCTGTTTGTTGTATGAGTTGAAGAAATGCAGGCTGAAATTTCTTT
Protein sequenceShow/hide protein sequence
MVLPSHISSSPFSGRTSPNSRNSEIGSPIRRSFSGNPFPKPSIVTNPRSLNPTTPANSPSDYPRRNSLSRESLFTRTDNEEKENGKDHSSKPARLRSPTVGKSTKNFMSP
TISAASKFAASPKKKILGDLNDQVRSSLSFSTSKISSHNSLNQNQNPEASTAFDSDANPQVALNPISKSSKTVRFGGVEVISCSIDDPEVIMMADGTDTKNETSPISKSA
IETTPLEPSRTLESVNVEVISISTNDLESPAKSNLTEDVDCVNLDPSFKISPVSSPIEVASLDADPLMAVPYDPKTNYLSPRPQFLHYKPRRINQHELDCGLEELFSSTI
VSESEFTEEIDSKDPLMDSDEAETESQTENEEEEEEEEEMDEIEKIVNEAKNSSKLHFSRMFKISSLLLILFTACFSICVVNVYDPNILERASLLTTEDSTEIFEFARTN
FDVLLWKLEIWHENTKSFVSDVILNIRERPLIYLNQTEFLLKDFSPNGQCVLLSHPKQEHMIKETEIDSFEDHNEEQESLQDIETMKDSEIEVEPVERDAHTEDEEQLFH
EIKDQESSLVQEIHGEPVERDPHIEEEEQLIHEIEEKANESKDTEESLQEIIEDTSANSSSDELNEEERIQETQTEESYEVSSSSSNFIILDQIEQEPATLAESEEEHID
SIQQSNTEFQYQSPPVSPPSEPQSDVENENGENFDLIKAATKISRDLTQNTAVIVSAILLGFTLIAGLIYSRKPGSISSMAATATAEEQNEQPLLEEKQANQSSIVREEE
VEEEPIDDDDQSCSYETSSFQYSSMREDEPETKRSNEARSHSHGRKTRKISRRESMASSSLDEDSVLSISASPSYGSFTTYEKIPIKHANGDDDEIVTPVRRSSRIRKAI
QP