; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023564 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023564
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionheat stress transcription factor A-4c-like
Genome locationtig00000892:4513675..4524432
RNA-Seq ExpressionSgr023564
SyntenySgr023564
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0034605 - cellular response to heat (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR001810 - F-box domain
IPR006566 - FBD domain
IPR027725 - Heat shock transcription factor family
IPR032675 - Leucine-rich repeat domain superfamily
IPR036047 - F-box-like domain superfamily
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054510.1 F-box/FBD/LRR-repeat protein [Cucumis melo var. makuwa]6.0e-21374.41Show/hide
Query:  VMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTI
        +MR SK  K+EEGQND +SQLPESVLVDILSYLPTKD VKMAL+ RFR+LWT+I  LSFD+CAYHDH+ Y    YDGP+YD+RFLNLIR+VLILHERTTI
Subjt:  VMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTI

Query:  DEFHLKFAFNLFNVIHDDPDNSN-YAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSL
        DEFHLKFAFNLFN IHDD  NS+ YA KE+RMA+ELTTWIKF++RKQVKVLD+DLLGCGL EPEV+YELP  +LT+N L EL L GCGI+EKG IQL  L
Subjt:  DEFHLKFAFNLFNVIHDDPDNSN-YAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSL

Query:  TKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSR
        +KLSLK+IMLSDKIMGEI  GCPMLEELSL+GC GL KL+L  TNIKRL I +GWR++++NSRLEISC  +KSLE AG+IQL  LK S+S+FDASL +SR
Subjt:  TKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSR

Query:  SFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETL-TFYIYPGLNSMFTNEEAKW
        +F CERRIYE+VQMLL KLAE            +FTIWELTYVP  VTGWKSVEFRLLF KWH+PGICSIL+ S+WLET+ TFYIYPG  S F  EEAKW
Subjt:  SFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETL-TFYIYPGLNSMFTNEEAKW

Query:  MEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLSL
        MEAYDFDGK+YWKS+ G    L K+LKTVMIYGYVTEP+V ELIEFLLKNALVLEKMVISTKRTL PIHQYE FKDAV +Q+D FT +ELL+FSQKLL+L
Subjt:  MEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLSL

Query:  PRASRSAVIQFC
        PRAS+SAVIQFC
Subjt:  PRASRSAVIQFC

KAA8524482.1 hypothetical protein F0562_010905 [Nyssa sinensis]1.2e-14954.69Show/hide
Query:  TVMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYD-GPNYDQRFLNLIRYVLILHERT
        +V  S ++   EE Q D +S LPESV+  ILS+LPTKD VK  LVP F +LWT   +LSFD C+YH     D   YD GP Y++RFLNLIR+VL+LH+ +
Subjt:  TVMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYD-GPNYDQRFLNLIRYVLILHERT

Query:  TIDEFHLKFAFNLF-NVIHDDPDNSNYAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLK
        TI +FHLKF F+LF ++  +  D+  YA KEKRMA E+TTWI F +R++VK LDLDLLGCG+ E   SY LPN VL ++ L++L LAG  I+  G+I LK
Subjt:  TIDEFHLKFAFNLF-NVIHDDPDNSNYAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLK

Query:  SLTKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCF
        SL  + LK + L+DKIMGEI  GCP+LEELSL  C GL KL +TN N K L +  G+R+    S LEISCP++KSL+IAGSI   YLK +SSV D S+ F
Subjt:  SLTKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCF

Query:  SRSFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNEEAK
        S SFKC  +   +V++L  KL+ + +F PCTW  LV TIW+ T  P P   W+S+   +   KWH+PGI S+L+ S +LETLT YI PG   +F  ++  
Subjt:  SRSFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNEEAK

Query:  WMEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLS
        W++AYDFD  NYW SQ+ +  CLT HLKT+MIYGY+TEP+V EL+EFLLK A VLEKMV ST +TL P H Y FF  AV+ QK HFT ++LL  SQ+LLS
Subjt:  WMEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLS

Query:  LPRASRSAVIQF
        +PRAS  AVI F
Subjt:  LPRASRSAVIQF

OMO61486.1 Zinc finger, CCCH-type [Corchorus capsularis]9.1e-16135.55Show/hide
Query:  LMMERRASNSGVVRSSKRLK-SEEEQVDRISQLPESILVHILSFLPTKDVIKLERVPRFMHVWTYIRNLSFDQCQFHNHRTYRGGYYDGTQHDDRFLRLI
        ++ME    +S      KR + SEEE  DRIS LP++ILVHILSFLP KD +K   VPRF ++W ++ +LSF+QC +H+  +      DG  ++++FL  +
Subjt:  LMMERRASNSGVVRSSKRLK-SEEEQVDRISQLPESILVHILSFLPTKDVIKLERVPRFMHVWTYIRNLSFDQCQFHNHRTYRGGYYDGTQHDDRFLRLI

Query:  RYVLMLHESETIDEFHLKFGFNL-----LCVTYNELDGSDYASIEKNMALELSTWIMFVARKRVKVLDLNLLGCTLEEEVR-YKLPNIVLTSNDLMELML
        R+VL LHES TID FHLKF  NL           + +  DYA+ E  MA E+ +WI FV RK+VKVL+L+  GC L E    Y+LPN+V     L EL L
Subjt:  RYVLMLHESETIDEFHLKFGFNL-----LCVTYNELDGSDYASIEKNMALELSTWIMFVARKRVKVLDLNLLGCTLEEEVR-YKLPNIVLTSNDLMELML

Query:  AGCGIKEYGQIKLTSLRKLSLKEILLSENIMSKILKGCPLLEELSLIRCSGVHKLKLTNPRTKRLSISIGWRYEMMYSQLEIICPHITSLEISGPIQHVD
          C IK   +I+L  L++L L+E++L + I+++ILKGC LLEELSLIRC G+ +L   NP TK L + + +   M    L+++CP++ SLEI G I  VD
Subjt:  AGCGIKEYGQIKLTSLRKLSLKEILLSENIMSKILKGCPLLEELSLIRCSGVHKLKLTNPRTKRLSISIGWRYEMMYSQLEIICPHITSLEISGPIQHVD

Query:  LNASSSVVDASLYYKSRFFRFARKRYNRLRTLLWQFTQSGALVLCSSTILVQLSLLLSLVVKANTHVVIYAVLIVNLMVFTIWAMTNMQSPLFSSRSLVL
        L   SS+  +++ + S  F+   +RY  L+ L  + +     + C+  IL                            VFTIW +      LF  + L  
Subjt:  LNASSSVVDASLYYKSRFFRFARKRYNRLRTLLWQFTQSGALVLCSSTILVQLSLLLSLVVKANTHVVIYAVLIVNLMVFTIWAMTNMQSPLFSSRSLVL

Query:  RLRFTKWHLPGICNILRNSAFLETLTFCIYPGSHSIFSHNEASWIMEHNFDARDYWKSQEPGFPSVMLHLKTVKIYGYITEPLVFELIEFLLKKALVLEK
         +  TKWHLPGI  +LR S +LETL   I PG + IF+                                                              
Subjt:  RLRFTKWHLPGICNILRNSAFLETLTFCIYPGSHSIFSHNEASWIMEHNFDARDYWKSQEPGFPSVMLHLKTVKIYGYITEPLVFELIEFLLKKALVLEK

Query:  MVVSTKKILPFEDQDDRTAANVVSEINRLNSDKLFELSQKLLSLPRASTSAVIQFYPNDSNSKYVNSINNFEEKCNFVLLEPSAKILMEGTNDPTVMRSS
                                                                                             ++MEG +D     S 
Subjt:  MVVSTKKILPFEDQDDRTAANVVSEINRLNSDKLFELSQKLLSLPRASTSAVIQFYPNDSNSKYVNSINNFEEKCNFVLLEPSAKILMEGTNDPTVMRSS

Query:  KRLKVEEGQ-NDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTIDEFH
        KR +V E Q  D +S LP+ ++V ILS+LPT+D VK  L+ RF  LWTF+  LSFDQC+Y  HD  D    D   YD+RFL+ +  VLILHE  TID+F 
Subjt:  KRLKVEEGQ-NDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTIDEFH

Query:  LKFAFNLFNVIHDDPDNSNYAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSLTKLSL
        LK  +N      ++  N N   +E R+  ++ +W+ FA+RK+V++L LD  GCG  EP+ +Y LP+ V+    L +L+L  C I   G+I+L +L +L L
Subjt:  LKFAFNLFNVIHDDPDNSNYAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSLTKLSL

Query:  KEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSRSFKCE
        +EI L  +I+ +I  G  +LEELSL  C  L  L   N +IK  ++ + +    ++ RL+ISCP++ SL++AG I L  L   SS+  +S+ FS   KC 
Subjt:  KEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSRSFKCE

Query:  RRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNE-EAKWMEAYD
           +++++ +  KL+ + +   CTW+ +VFT W+L  +      WK +EF ++  KWH  GIC +L+ S +LETLT YI  GL        E +W E YD
Subjt:  RRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNE-EAKWMEAYD

Query:  FDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFF
         D   +W SQ GS  CL  HLKT+ IYG +TEP++ + IEFLLK A+VLE +VIST   L P  Q E F
Subjt:  FDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFF

OMO65640.1 hypothetical protein CCACVL1_21465 [Corchorus capsularis]6.9e-17737.22Show/hide
Query:  VVRSSKRLK-SEEEQVDRISQLPESILVHILSFLPTKDVIKLERVPRFMHVWTYIRNLSFDQCQFHNHRTYRGGYYDGTQHD-DRFLRLIRYVLMLHESE
        V  SSKR +  EEE VDRIS LP+S+LVHILS LPTKD ++   VPRF H+  ++  L+FD             +Y G ++    F   +R VL+ H++ 
Subjt:  VVRSSKRLK-SEEEQVDRISQLPESILVHILSFLPTKDVIKLERVPRFMHVWTYIRNLSFDQCQFHNHRTYRGGYYDGTQHD-DRFLRLIRYVLMLHESE

Query:  TIDEFHLKFGFNLLC----------------------VTYNELDGSD-----------YASIEKNMALELSTWIMFVARKRVKVLDLN-LLGCTLEEEVR
        TI++F LK   N L                       V +N+ D  D            A +EK  A E+ +WI    RK VKVL L+ LL    +    
Subjt:  TIDEFHLKFGFNLLC----------------------VTYNELDGSD-----------YASIEKNMALELSTWIMFVARKRVKVLDLN-LLGCTLEEEVR

Query:  YKLPNIVLTSNDLMELMLAGCGIKEYGQIKLTSLRKLSLKEILLSENIMSKILKGCPLLEELSLIRCSGVHKLKLTNPRTKRLSISIGWRYEMMYSQLEI
        Y+LP++V     L EL L  C IK  G+I+L  L++L LK+++L++  +++IL GC +LEELSLI C G+ +L   NP  K L ++    + +   ++EI
Subjt:  YKLPNIVLTSNDLMELMLAGCGIKEYGQIKLTSLRKLSLKEILLSENIMSKILKGCPLLEELSLIRCSGVHKLKLTNPRTKRLSISIGWRYEMMYSQLEI

Query:  ICPHITSLEISGPIQHVDLNASSSVVDASLYYKSRFFRFARKRYNRLRTLLWQFTQSGALVLCSSTILVQLSLLLSLVVKANTHVVIYAVLIVNLMVFTI
         CP+I SL+I+G    VDL   SSVVD+S+     +F  + + Y  L  L  +  +     LC+S IL                            V++I
Subjt:  ICPHITSLEISGPIQHVDLNASSSVVDASLYYKSRFFRFARKRYNRLRTLLWQFTQSGALVLCSSTILVQLSLLLSLVVKANTHVVIYAVLIVNLMVFTI

Query:  WAMTNMQSPLFSSRSLVLRLRFTKWHLPGICNILRNSAFLETLTFCIYPGSHSIFSHNEASWIMEHNFDARDYWKSQEPGFPSVMLHLKTVKIY-GYITE
        W + N    LF  + L   +  TKWHLPGI  +LR S  LETL   IYP     F  +   W  +++FD   +W+ QE  F  +   LKT+KIY GYITE
Subjt:  WAMTNMQSPLFSSRSLVLRLRFTKWHLPGICNILRNSAFLETLTFCIYPGSHSIFSHNEASWIMEHNFDARDYWKSQEPGFPSVMLHLKTVKIY-GYITE

Query:  PLVFELIEFLLKKALVLEKMVVSTKKILPFEDQDDRTAANVVSEINRLNSDKLFELSQKLLSLPRASTSAVIQFYPNDSNSKYVNSINNFEEKCNFVLLE
        P V + +EFLLK A VLEK+ +ST K L    Q   +   V      L +++  E SQKLLSLPRAS  AVI+                        L++
Subjt:  PLVFELIEFLLKKALVLEKMVVSTKKILPFEDQDDRTAANVVSEINRLNSDKLFELSQKLLSLPRASTSAVIQFYPNDSNSKYVNSINNFEEKCNFVLLE

Query:  PSAKILMEGTNDPTVMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLN
           +I+   +  P V+        EE   D +S LP+SV++ ILS+LPT+D V+   VP FRHLW F+  L+FD   Y D    +     G + ++ FL 
Subjt:  PSAKILMEGTNDPTVMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLN

Query:  LIRYVLILHERTTIDEFHLKFAFNLFNVIHDDPDNSNYAF----KEKRMATELTTWIKFAVRKQVKVLDLDLLGCG-LPEPEVSYELPNCVLTSNCLMEL
         IR+VL+ H+  TID+F LK   NL       P +  +A     +EK M++E+ +W+ FA+RK+VK LDL  +      EP+  Y LP+ V     L+++
Subjt:  LIRYVLILHERTTIDEFHLKFAFNLFNVIHDDPDNSNYAF----KEKRMATELTTWIKFAVRKQVKVLDLDLLGCG-LPEPEVSYELPNCVLTSNCLMEL

Query:  RLAGCGIKEKGRIQLKSLTKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQL
        +L GC I    RI+L  L +L +  ++++++ + +I  GC +LEELSL+ C GL KL   N +IKRL +F G   E    RLEISCP+++SL+I+G + L
Subjt:  RLAGCGIKEKGRIQLKSLTKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQL

Query:  AYLKGSSSVFDASLCFSRSFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTF
          L   SSV D S+ F    + E R ++ ++    KL+   +F  C    LVFT W+L   P  + GWKS+EF+L   KW+ PGI  +L+ S  LETL  
Subjt:  AYLKGSSSVFDASLCFSRSFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTF

Query:  YIYPGLNSMF--TNEEAKWMEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQ
        YI       +   ++E +WM++YDFDG+ +W+SQ+G+  CL K+LK++ IYG+++EP+V ++IEFLLKNA+VLEK+ IS+++T  P     F K     Q
Subjt:  YIYPGLNSMF--TNEEAKWMEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQ

Query:  KDHFTYQELLEFSQK
        K   T ++  EFSQK
Subjt:  KDHFTYQELLEFSQK

XP_004143230.1 putative F-box/LRR-repeat protein At3g18150 [Cucumis sativus]5.6e-22776.76Show/hide
Query:  VMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTI
        +MR SKR + EE +ND +SQLPESVLVDILSYLPT+D VKMAL+ RFR+LWT+I  LSFD+CAYHDH+ YD   YDGP+YD+RFLNLIR+VLILHERT I
Subjt:  VMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTI

Query:  DEFHLKFAFNLFNVIHDDPDNSN-YAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSL
        DEFHLKFAFNLFN IHDD  NS+ YA KE+RMA+ELTTWIKF++RKQVKVLD+DLLGCGL EPEV+YELP  +LT+  L EL L GCGI+EKGRIQL SL
Subjt:  DEFHLKFAFNLFNVIHDDPDNSN-YAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSL

Query:  TKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSR
        +KLSLKEIMLSDKIMGEI  GCPMLEELSL+GC GL KL+LT +NIKRL I +GWR+E++NSRLEISCP +KSLE+AG+I L  LK S+S+FDASL +SR
Subjt:  TKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSR

Query:  SFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNEEAKWM
        +F CERRIYEKVQMLLWKLAE +VFIPCTWTTL+FTIW+LTYVP PV GWKSVEFRLLF KWH+PGICSIL+ SHWLETLTFYIYPG  S F  EEAKWM
Subjt:  SFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNEEAKWM

Query:  EAYDFDGKNYWKSQKGSLL-CLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLSL
        EAYDFDGK+YWKSQ G     L K+LKTV IYGYVTEP+V ELIEFLLKNALVLEKMVISTK+TL PIHQYE FKDAV +Q+D FT +ELL+FSQKLL+L
Subjt:  EAYDFDGKNYWKSQKGSLL-CLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLSL

Query:  PRASRSAVIQFC
        PRAS+SAVIQFC
Subjt:  PRASRSAVIQFC

TrEMBL top hitse value%identityAlignment
A0A0A0KDY5 F-box domain-containing protein2.7e-22776.76Show/hide
Query:  VMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTI
        +MR SKR + EE +ND +SQLPESVLVDILSYLPT+D VKMAL+ RFR+LWT+I  LSFD+CAYHDH+ YD   YDGP+YD+RFLNLIR+VLILHERT I
Subjt:  VMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTI

Query:  DEFHLKFAFNLFNVIHDDPDNSN-YAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSL
        DEFHLKFAFNLFN IHDD  NS+ YA KE+RMA+ELTTWIKF++RKQVKVLD+DLLGCGL EPEV+YELP  +LT+  L EL L GCGI+EKGRIQL SL
Subjt:  DEFHLKFAFNLFNVIHDDPDNSN-YAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSL

Query:  TKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSR
        +KLSLKEIMLSDKIMGEI  GCPMLEELSL+GC GL KL+LT +NIKRL I +GWR+E++NSRLEISCP +KSLE+AG+I L  LK S+S+FDASL +SR
Subjt:  TKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSR

Query:  SFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNEEAKWM
        +F CERRIYEKVQMLLWKLAE +VFIPCTWTTL+FTIW+LTYVP PV GWKSVEFRLLF KWH+PGICSIL+ SHWLETLTFYIYPG  S F  EEAKWM
Subjt:  SFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNEEAKWM

Query:  EAYDFDGKNYWKSQKGSLL-CLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLSL
        EAYDFDGK+YWKSQ G     L K+LKTV IYGYVTEP+V ELIEFLLKNALVLEKMVISTK+TL PIHQYE FKDAV +Q+D FT +ELL+FSQKLL+L
Subjt:  EAYDFDGKNYWKSQKGSLL-CLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLSL

Query:  PRASRSAVIQFC
        PRAS+SAVIQFC
Subjt:  PRASRSAVIQFC

A0A1R3GTU6 Zinc finger, CCCH-type4.4e-16135.55Show/hide
Query:  LMMERRASNSGVVRSSKRLK-SEEEQVDRISQLPESILVHILSFLPTKDVIKLERVPRFMHVWTYIRNLSFDQCQFHNHRTYRGGYYDGTQHDDRFLRLI
        ++ME    +S      KR + SEEE  DRIS LP++ILVHILSFLP KD +K   VPRF ++W ++ +LSF+QC +H+  +      DG  ++++FL  +
Subjt:  LMMERRASNSGVVRSSKRLK-SEEEQVDRISQLPESILVHILSFLPTKDVIKLERVPRFMHVWTYIRNLSFDQCQFHNHRTYRGGYYDGTQHDDRFLRLI

Query:  RYVLMLHESETIDEFHLKFGFNL-----LCVTYNELDGSDYASIEKNMALELSTWIMFVARKRVKVLDLNLLGCTLEEEVR-YKLPNIVLTSNDLMELML
        R+VL LHES TID FHLKF  NL           + +  DYA+ E  MA E+ +WI FV RK+VKVL+L+  GC L E    Y+LPN+V     L EL L
Subjt:  RYVLMLHESETIDEFHLKFGFNL-----LCVTYNELDGSDYASIEKNMALELSTWIMFVARKRVKVLDLNLLGCTLEEEVR-YKLPNIVLTSNDLMELML

Query:  AGCGIKEYGQIKLTSLRKLSLKEILLSENIMSKILKGCPLLEELSLIRCSGVHKLKLTNPRTKRLSISIGWRYEMMYSQLEIICPHITSLEISGPIQHVD
          C IK   +I+L  L++L L+E++L + I+++ILKGC LLEELSLIRC G+ +L   NP TK L + + +   M    L+++CP++ SLEI G I  VD
Subjt:  AGCGIKEYGQIKLTSLRKLSLKEILLSENIMSKILKGCPLLEELSLIRCSGVHKLKLTNPRTKRLSISIGWRYEMMYSQLEIICPHITSLEISGPIQHVD

Query:  LNASSSVVDASLYYKSRFFRFARKRYNRLRTLLWQFTQSGALVLCSSTILVQLSLLLSLVVKANTHVVIYAVLIVNLMVFTIWAMTNMQSPLFSSRSLVL
        L   SS+  +++ + S  F+   +RY  L+ L  + +     + C+  IL                            VFTIW +      LF  + L  
Subjt:  LNASSSVVDASLYYKSRFFRFARKRYNRLRTLLWQFTQSGALVLCSSTILVQLSLLLSLVVKANTHVVIYAVLIVNLMVFTIWAMTNMQSPLFSSRSLVL

Query:  RLRFTKWHLPGICNILRNSAFLETLTFCIYPGSHSIFSHNEASWIMEHNFDARDYWKSQEPGFPSVMLHLKTVKIYGYITEPLVFELIEFLLKKALVLEK
         +  TKWHLPGI  +LR S +LETL   I PG + IF+                                                              
Subjt:  RLRFTKWHLPGICNILRNSAFLETLTFCIYPGSHSIFSHNEASWIMEHNFDARDYWKSQEPGFPSVMLHLKTVKIYGYITEPLVFELIEFLLKKALVLEK

Query:  MVVSTKKILPFEDQDDRTAANVVSEINRLNSDKLFELSQKLLSLPRASTSAVIQFYPNDSNSKYVNSINNFEEKCNFVLLEPSAKILMEGTNDPTVMRSS
                                                                                             ++MEG +D     S 
Subjt:  MVVSTKKILPFEDQDDRTAANVVSEINRLNSDKLFELSQKLLSLPRASTSAVIQFYPNDSNSKYVNSINNFEEKCNFVLLEPSAKILMEGTNDPTVMRSS

Query:  KRLKVEEGQ-NDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTIDEFH
        KR +V E Q  D +S LP+ ++V ILS+LPT+D VK  L+ RF  LWTF+  LSFDQC+Y  HD  D    D   YD+RFL+ +  VLILHE  TID+F 
Subjt:  KRLKVEEGQ-NDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTIDEFH

Query:  LKFAFNLFNVIHDDPDNSNYAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSLTKLSL
        LK  +N      ++  N N   +E R+  ++ +W+ FA+RK+V++L LD  GCG  EP+ +Y LP+ V+    L +L+L  C I   G+I+L +L +L L
Subjt:  LKFAFNLFNVIHDDPDNSNYAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSLTKLSL

Query:  KEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSRSFKCE
        +EI L  +I+ +I  G  +LEELSL  C  L  L   N +IK  ++ + +    ++ RL+ISCP++ SL++AG I L  L   SS+  +S+ FS   KC 
Subjt:  KEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSRSFKCE

Query:  RRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNE-EAKWMEAYD
           +++++ +  KL+ + +   CTW+ +VFT W+L  +      WK +EF ++  KWH  GIC +L+ S +LETLT YI  GL        E +W E YD
Subjt:  RRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNE-EAKWMEAYD

Query:  FDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFF
         D   +W SQ GS  CL  HLKT+ IYG +TEP++ + IEFLLK A+VLE +VIST   L P  Q E F
Subjt:  FDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFF

A0A1R3H5V3 Uncharacterized protein3.3e-17737.22Show/hide
Query:  VVRSSKRLK-SEEEQVDRISQLPESILVHILSFLPTKDVIKLERVPRFMHVWTYIRNLSFDQCQFHNHRTYRGGYYDGTQHD-DRFLRLIRYVLMLHESE
        V  SSKR +  EEE VDRIS LP+S+LVHILS LPTKD ++   VPRF H+  ++  L+FD             +Y G ++    F   +R VL+ H++ 
Subjt:  VVRSSKRLK-SEEEQVDRISQLPESILVHILSFLPTKDVIKLERVPRFMHVWTYIRNLSFDQCQFHNHRTYRGGYYDGTQHD-DRFLRLIRYVLMLHESE

Query:  TIDEFHLKFGFNLLC----------------------VTYNELDGSD-----------YASIEKNMALELSTWIMFVARKRVKVLDLN-LLGCTLEEEVR
        TI++F LK   N L                       V +N+ D  D            A +EK  A E+ +WI    RK VKVL L+ LL    +    
Subjt:  TIDEFHLKFGFNLLC----------------------VTYNELDGSD-----------YASIEKNMALELSTWIMFVARKRVKVLDLN-LLGCTLEEEVR

Query:  YKLPNIVLTSNDLMELMLAGCGIKEYGQIKLTSLRKLSLKEILLSENIMSKILKGCPLLEELSLIRCSGVHKLKLTNPRTKRLSISIGWRYEMMYSQLEI
        Y+LP++V     L EL L  C IK  G+I+L  L++L LK+++L++  +++IL GC +LEELSLI C G+ +L   NP  K L ++    + +   ++EI
Subjt:  YKLPNIVLTSNDLMELMLAGCGIKEYGQIKLTSLRKLSLKEILLSENIMSKILKGCPLLEELSLIRCSGVHKLKLTNPRTKRLSISIGWRYEMMYSQLEI

Query:  ICPHITSLEISGPIQHVDLNASSSVVDASLYYKSRFFRFARKRYNRLRTLLWQFTQSGALVLCSSTILVQLSLLLSLVVKANTHVVIYAVLIVNLMVFTI
         CP+I SL+I+G    VDL   SSVVD+S+     +F  + + Y  L  L  +  +     LC+S IL                            V++I
Subjt:  ICPHITSLEISGPIQHVDLNASSSVVDASLYYKSRFFRFARKRYNRLRTLLWQFTQSGALVLCSSTILVQLSLLLSLVVKANTHVVIYAVLIVNLMVFTI

Query:  WAMTNMQSPLFSSRSLVLRLRFTKWHLPGICNILRNSAFLETLTFCIYPGSHSIFSHNEASWIMEHNFDARDYWKSQEPGFPSVMLHLKTVKIY-GYITE
        W + N    LF  + L   +  TKWHLPGI  +LR S  LETL   IYP     F  +   W  +++FD   +W+ QE  F  +   LKT+KIY GYITE
Subjt:  WAMTNMQSPLFSSRSLVLRLRFTKWHLPGICNILRNSAFLETLTFCIYPGSHSIFSHNEASWIMEHNFDARDYWKSQEPGFPSVMLHLKTVKIY-GYITE

Query:  PLVFELIEFLLKKALVLEKMVVSTKKILPFEDQDDRTAANVVSEINRLNSDKLFELSQKLLSLPRASTSAVIQFYPNDSNSKYVNSINNFEEKCNFVLLE
        P V + +EFLLK A VLEK+ +ST K L    Q   +   V      L +++  E SQKLLSLPRAS  AVI+                        L++
Subjt:  PLVFELIEFLLKKALVLEKMVVSTKKILPFEDQDDRTAANVVSEINRLNSDKLFELSQKLLSLPRASTSAVIQFYPNDSNSKYVNSINNFEEKCNFVLLE

Query:  PSAKILMEGTNDPTVMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLN
           +I+   +  P V+        EE   D +S LP+SV++ ILS+LPT+D V+   VP FRHLW F+  L+FD   Y D    +     G + ++ FL 
Subjt:  PSAKILMEGTNDPTVMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLN

Query:  LIRYVLILHERTTIDEFHLKFAFNLFNVIHDDPDNSNYAF----KEKRMATELTTWIKFAVRKQVKVLDLDLLGCG-LPEPEVSYELPNCVLTSNCLMEL
         IR+VL+ H+  TID+F LK   NL       P +  +A     +EK M++E+ +W+ FA+RK+VK LDL  +      EP+  Y LP+ V     L+++
Subjt:  LIRYVLILHERTTIDEFHLKFAFNLFNVIHDDPDNSNYAF----KEKRMATELTTWIKFAVRKQVKVLDLDLLGCG-LPEPEVSYELPNCVLTSNCLMEL

Query:  RLAGCGIKEKGRIQLKSLTKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQL
        +L GC I    RI+L  L +L +  ++++++ + +I  GC +LEELSL+ C GL KL   N +IKRL +F G   E    RLEISCP+++SL+I+G + L
Subjt:  RLAGCGIKEKGRIQLKSLTKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQL

Query:  AYLKGSSSVFDASLCFSRSFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTF
          L   SSV D S+ F    + E R ++ ++    KL+   +F  C    LVFT W+L   P  + GWKS+EF+L   KW+ PGI  +L+ S  LETL  
Subjt:  AYLKGSSSVFDASLCFSRSFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTF

Query:  YIYPGLNSMF--TNEEAKWMEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQ
        YI       +   ++E +WM++YDFDG+ +W+SQ+G+  CL K+LK++ IYG+++EP+V ++IEFLLKNA+VLEK+ IS+++T  P     F K     Q
Subjt:  YIYPGLNSMF--TNEEAKWMEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQ

Query:  KDHFTYQELLEFSQK
        K   T ++  EFSQK
Subjt:  KDHFTYQELLEFSQK

A0A5A7UGX2 F-box/FBD/LRR-repeat protein2.9e-21374.41Show/hide
Query:  VMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTI
        +MR SK  K+EEGQND +SQLPESVLVDILSYLPTKD VKMAL+ RFR+LWT+I  LSFD+CAYHDH+ Y    YDGP+YD+RFLNLIR+VLILHERTTI
Subjt:  VMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTI

Query:  DEFHLKFAFNLFNVIHDDPDNSN-YAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSL
        DEFHLKFAFNLFN IHDD  NS+ YA KE+RMA+ELTTWIKF++RKQVKVLD+DLLGCGL EPEV+YELP  +LT+N L EL L GCGI+EKG IQL  L
Subjt:  DEFHLKFAFNLFNVIHDDPDNSN-YAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSL

Query:  TKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSR
        +KLSLK+IMLSDKIMGEI  GCPMLEELSL+GC GL KL+L  TNIKRL I +GWR++++NSRLEISC  +KSLE AG+IQL  LK S+S+FDASL +SR
Subjt:  TKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCFSR

Query:  SFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETL-TFYIYPGLNSMFTNEEAKW
        +F CERRIYE+VQMLL KLAE            +FTIWELTYVP  VTGWKSVEFRLLF KWH+PGICSIL+ S+WLET+ TFYIYPG  S F  EEAKW
Subjt:  SFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETL-TFYIYPGLNSMFTNEEAKW

Query:  MEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLSL
        MEAYDFDGK+YWKS+ G    L K+LKTVMIYGYVTEP+V ELIEFLLKNALVLEKMVISTKRTL PIHQYE FKDAV +Q+D FT +ELL+FSQKLL+L
Subjt:  MEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLSL

Query:  PRASRSAVIQFC
        PRAS+SAVIQFC
Subjt:  PRASRSAVIQFC

A0A5J5A3U5 FBD domain-containing protein5.9e-15054.69Show/hide
Query:  TVMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYD-GPNYDQRFLNLIRYVLILHERT
        +V  S ++   EE Q D +S LPESV+  ILS+LPTKD VK  LVP F +LWT   +LSFD C+YH     D   YD GP Y++RFLNLIR+VL+LH+ +
Subjt:  TVMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIRSLSFDQCAYHDHDSYDSAIYD-GPNYDQRFLNLIRYVLILHERT

Query:  TIDEFHLKFAFNLF-NVIHDDPDNSNYAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLK
        TI +FHLKF F+LF ++  +  D+  YA KEKRMA E+TTWI F +R++VK LDLDLLGCG+ E   SY LPN VL ++ L++L LAG  I+  G+I LK
Subjt:  TIDEFHLKFAFNLF-NVIHDDPDNSNYAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVSYELPNCVLTSNCLMELRLAGCGIKEKGRIQLK

Query:  SLTKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCF
        SL  + LK + L+DKIMGEI  GCP+LEELSL  C GL KL +TN N K L +  G+R+    S LEISCP++KSL+IAGSI   YLK +SSV D S+ F
Subjt:  SLTKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEIAGSIQLAYLKGSSSVFDASLCF

Query:  SRSFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNEEAK
        S SFKC  +   +V++L  KL+ + +F PCTW  LV TIW+ T  P P   W+S+   +   KWH+PGI S+L+ S +LETLT YI PG   +F  ++  
Subjt:  SRSFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYPGLNSMFTNEEAK

Query:  WMEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLS
        W++AYDFD  NYW SQ+ +  CLT HLKT+MIYGY+TEP+V EL+EFLLK A VLEKMV ST +TL P H Y FF  AV+ QK HFT ++LL  SQ+LLS
Subjt:  WMEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKLLS

Query:  LPRASRSAVIQF
        +PRAS  AVI F
Subjt:  LPRASRSAVIQF

SwissProt top hitse value%identityAlignment
O49403 Heat stress transcription factor A-4a6.5e-5337.98Show/hide
Query:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI
        SS + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF++DLLP +FKHNNFSSF+RQLNTYGFRK D +QWEFAN+ F+RG+ HL+K+IHRRKP+
Subjt:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI

Query:  YSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAEETT---
        +SHS  + Q   +     PL++ ER  +  QI+ L +EK  L  +L K + E+E    Q+  L ++L  M  +QK +++ ++  L+K  PG A   +   
Subjt:  YSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAEETT---

Query:  --HNNKKRRLSKL-----NDLLEDSSS--ELEKENNGRNIIIPPLELMGKLESSLGFCEDLLCN-----------VTQVLGEESK-------ACAVIT--
           N +KRR  ++       +LE++ +   + +E +           + +LESS+   E+L+ +           +T  + E S        +C  ++  
Subjt:  --HNNKKRRLSKL-----NDLLEDSSS--ELEKENNGRNIIIPPLELMGKLESSLGFCEDLLCN-----------VTQVLGEESK-------ACAVIT--

Query:  --KKETRGVGDVKVVASKEGEVRREHVLR------NGANDMFWEQFLTEIPGSSNAGEVCLNRRNN
           K       + +    +G   +  V         GAND FW+QF +E PGS+   EV L R+++
Subjt:  --KKETRGVGDVKVVASKEGEVRREHVLR------NGANDMFWEQFLTEIPGSSNAGEVCLNRRNN

P41153 Heat shock factor protein HSF81.7e-4848.9Show/hide
Query:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI
        S+ APPPFL KTY+MVDDP T+ IVSWS +  SFVVW+PPEFAKDLLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLLKSI RRKP 
Subjt:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI

Query:  YSHS------HSHSQ-----PTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLA-------
        + H+      H H+Q     P  S  VGA + E  +  LE++++ L ++K  L  +L +  ++++    Q+  + Q+L  M  +Q+Q+++ LA       
Subjt:  YSHS------HSHSQ-----PTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLA-------

Query:  --AQLQKQQPGFAEETTHNNKKRRLSK
          AQ  +QQ    +     +KKRR+ +
Subjt:  --AQLQKQQPGFAEETTHNNKKRRLSK

Q93VB5 Heat stress transcription factor A-4d3.8e-5348.91Show/hide
Query:  GSEGSSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHR
        G  G  G PPPFL KTYEMV+D  TN +VSW   G SFVVWNP +F++DLLP YFKHNNFSSF+RQLNTYGFRKID ++WEFANE FIRG THLLK+IHR
Subjt:  GSEGSSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHR

Query:  RKPIYSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAE--
        RKP++SHS  +        +  PL+E ER ELE++I  L  EK+ L + LQ+  +++  I  Q+ ++  +L  M  +QK ++  L   LQ++    +   
Subjt:  RKPIYSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAE--

Query:  -ETTHNNKKRRLSKLNDLLEDSSSELEKE
         E+ H +KKRR+ K++  ++D ++  E++
Subjt:  -ETTHNNKKRRLSKLNDLLEDSSSELEKE

Q94J16 Heat stress transcription factor A-4b5.0e-5335.68Show/hide
Query:  MDGSEGSSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI
        M+G  G  G+ PPFL+KTYEMVDDP T+++V W+ +G SFVV N PEF +DLLP YFKHNNFSSFVRQLNTYGFRK+D +QWEFANE FI+G+ H LK+I
Subjt:  MDGSEGSSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI

Query:  HRRKPIYSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPG---
        HRRKPI+SHS SHSQ     G G PL++ ER++ E++I+ L  +  +L S+LQ +  +K  + +++ +L ++L+ +  QQ+ LI+ +   +  + PG   
Subjt:  HRRKPIYSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPG---

Query:  -FAEETTHNNKKRRLSKLNDLLEDSSSELEKENNGRNIIIPPL-----ELMGKLESSLGFCEDLLCNVTQVLGEE---------SKACAVITKKETRGVG
         F ++  H+ KKRRL       ED++++ E +    ++   P      E   K+ESSL   E+ L   ++  G +           +  V+T+  + G  
Subjt:  -FAEETTHNNKKRRLSKLNDLLEDSSSELEKENNGRNIIIPPL-----ELMGKLESSLGFCEDLLCNVTQVLGEE---------SKACAVITKKETRGVG

Query:  DVKVVA--------------------------------------------------------------SKEGEVRREHVLRNGANDMFWEQFLTEIPGSS
        D +V +                                                              S++        +  GAND FW+QFLTE PGSS
Subjt:  DVKVVA--------------------------------------------------------------SKEGEVRREHVLRNGANDMFWEQFLTEIPGSS

Query:  NAGEVCLNRRNN
        +A +   + R +
Subjt:  NAGEVCLNRRNN

Q9FK72 Heat stress transcription factor A-4c3.0e-5840.65Show/hide
Query:  MDGSEGSSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI
        MD + G S + PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF++DLLP +FKH NFSSF+RQLNTYGFRK+D ++WEF N+ F+RGR +L+K+I
Subjt:  MDGSEGSSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI

Query:  HRRKPIYSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAE
        HRRKP++SHS  + Q         PL+E ER  +E QI+ L  EK  L ++LQ  E+E++E   Q+ +L  +L  M   QK ++  ++  L K  PG + 
Subjt:  HRRKPIYSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAE

Query:  ETTHNNKKRRLSKLNDLLEDSSSELEKENNGRNIIIPPLELMGKLESSLGFCEDLLCNVTQVLGEESKACAVITKKETRGVGDVKVVASKEGEVRREHVL
           ++ +++R  + N L   SS                +E + KLESSL F E+L+    +  G +S +      + +  +GD +  +SK        V 
Subjt:  ETTHNNKKRRLSKLNDLLEDSSSELEKENNGRNIIIPPLELMGKLESSLGFCEDLLCNVTQVLGEESKACAVITKKETRGVGDVKVVASKEGEVRREHVL

Query:  ------RNGANDMFWEQFLTEIPGSSNAGEVCLNRRN
              + G ND FWEQ LTE PGS+   EV   RR+
Subjt:  ------RNGANDMFWEQFLTEIPGSSNAGEVCLNRRN

Arabidopsis top hitse value%identityAlignment
AT1G32330.1 heat shock transcription factor A1D1.1e-4748.13Show/hide
Query:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI
        SS APPPFL+KTY+MVDD  T+SIVSWS +  SF+VW PPEFA+DLLP  FKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLL+SI RRKP 
Subjt:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI

Query:  YSHSHSHSQPTPSHGVGAPLS---EPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGF-----
        +     H +   S+G  + +S   E  +  LE++++ L ++K  L  +L +  ++++    Q+ ++ Q+L  M  +Q+QL++ LA  +  Q P F     
Subjt:  YSHSHSHSQPTPSHGVGAPLS---EPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGF-----

Query:  AEETTHNNKKRRLS
         ++   N   RR+S
Subjt:  AEETTHNNKKRRLS

AT4G17750.1 heat shock factor 13.5e-4643.67Show/hide
Query:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI
        ++  PPPFL+KTY+MV+DP T++IVSWS +  SF+VW+PPEF++DLLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RG+ HLLK I RRK +
Subjt:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI

Query:  YSHSHSHSQP-----TPSHGVGAPLS---EPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQ------
          H  S S P     +   G  A LS   E  +  LE++++ L ++K  L  +L K  ++++    ++  L + L  M  +Q+Q+++ LA  +Q      
Subjt:  YSHSHSHSQP-----TPSHGVGAPLS---EPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQ------

Query:  ---KQQPGFAEETTHNNKKRRLSKLNDLLEDSSSELEKENNGRNI
           ++Q       T  NKKRRL       EDS++  E  ++  ++
Subjt:  ---KQQPGFAEETTHNNKKRRLSKLNDLLEDSSSELEKENNGRNI

AT4G18880.1 heat shock transcription factor A4A4.6e-5437.98Show/hide
Query:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI
        SS + PPFLTKTYEMVDD  ++SIVSWSQS  SF+VWNPPEF++DLLP +FKHNNFSSF+RQLNTYGFRK D +QWEFAN+ F+RG+ HL+K+IHRRKP+
Subjt:  SSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPI

Query:  YSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAEETT---
        +SHS  + Q   +     PL++ ER  +  QI+ L +EK  L  +L K + E+E    Q+  L ++L  M  +QK +++ ++  L+K  PG A   +   
Subjt:  YSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAEETT---

Query:  --HNNKKRRLSKL-----NDLLEDSSS--ELEKENNGRNIIIPPLELMGKLESSLGFCEDLLCN-----------VTQVLGEESK-------ACAVIT--
           N +KRR  ++       +LE++ +   + +E +           + +LESS+   E+L+ +           +T  + E S        +C  ++  
Subjt:  --HNNKKRRLSKL-----NDLLEDSSS--ELEKENNGRNIIIPPLELMGKLESSLGFCEDLLCN-----------VTQVLGEESK-------ACAVIT--

Query:  --KKETRGVGDVKVVASKEGEVRREHVLR------NGANDMFWEQFLTEIPGSSNAGEVCLNRRNN
           K       + +    +G   +  V         GAND FW+QF +E PGS+   EV L R+++
Subjt:  --KKETRGVGDVKVVASKEGEVRREHVLR------NGANDMFWEQFLTEIPGSSNAGEVCLNRRNN

AT5G16820.1 heat shock factor 35.1e-4545.11Show/hide
Query:  PPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHSH
        PPFL+KTY+MVDDPLTN +VSWS    SFVVW+ PEF+K LLP YFKHNNFSSFVRQLNTYGFRK+D D+WEFANEGF+RGR  LLKSI RRKP +   +
Subjt:  PPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHSH

Query:  SHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAEETTH-------
               S  VGA + E  +  +E++++ L ++K  L  +L +  ++++    Q+ ++ Q++  M  +Q+Q+++ LA  +  Q PGF  +          
Subjt:  SHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAEETTH-------

Query:  -----NNKKRRLSKLNDLLEDSSSELEKENNGRNI
             +NKKRRL              E+EN G N+
Subjt:  -----NNKKRRLSKLNDLLEDSSSELEKENNGRNI

AT5G45710.1 winged-helix DNA-binding transcription factor family protein2.1e-5940.65Show/hide
Query:  MDGSEGSSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI
        MD + G S + PPFLTKTYEMVDD  ++S+V+WS++  SF+V NP EF++DLLP +FKH NFSSF+RQLNTYGFRK+D ++WEF N+ F+RGR +L+K+I
Subjt:  MDGSEGSSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSI

Query:  HRRKPIYSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAE
        HRRKP++SHS  + Q         PL+E ER  +E QI+ L  EK  L ++LQ  E+E++E   Q+ +L  +L  M   QK ++  ++  L K  PG + 
Subjt:  HRRKPIYSHSHSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAE

Query:  ETTHNNKKRRLSKLNDLLEDSSSELEKENNGRNIIIPPLELMGKLESSLGFCEDLLCNVTQVLGEESKACAVITKKETRGVGDVKVVASKEGEVRREHVL
           ++ +++R  + N L   SS                +E + KLESSL F E+L+    +  G +S +      + +  +GD +  +SK        V 
Subjt:  ETTHNNKKRRLSKLNDLLEDSSSELEKENNGRNIIIPPLELMGKLESSLGFCEDLLCNVTQVLGEESKACAVITKKETRGVGDVKVVASKEGEVRREHVL

Query:  ------RNGANDMFWEQFLTEIPGSSNAGEVCLNRRN
              + G ND FWEQ LTE PGS+   EV   RR+
Subjt:  ------RNGANDMFWEQFLTEIPGSSNAGEVCLNRRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGTTCAGAGGGTAGCTCCGGCGCACCGCCGCCATTTCTGACAAAAACATATGAGATGGTGGATGATCCATTGACCAACTCCATCGTATCGTGGAGTCAAAGCGG
TTTCAGCTTCGTCGTTTGGAACCCACCGGAATTTGCCAAAGACTTGCTCCCAATATATTTCAAACACAACAATTTCTCCAGCTTCGTTCGCCAGCTTAACACTTATGGGT
TCAGGAAGATCGATCGAGACCAATGGGAGTTCGCGAACGAGGGATTTATACGAGGACGAACCCATCTTCTAAAGAGCATCCACCGCCGCAAACCAATCTACAGCCATAGC
CACAGCCATAGCCAGCCGACGCCGAGCCATGGCGTCGGAGCTCCATTGTCGGAACCAGAAAGGGAAGAGCTGGAGCAGCAAATCAAGACCCTGAATCAAGAAAAGACTTC
ACTTCAATCACAGCTGCAGAAGCACGAAAGAGAAAAAGAAGAAATTGGGCGTCAGATTAACTCGCTGTATCAGCAGCTATGGAGAATGGGGACTCAACAGAAGCAGCTCA
TCACCATGTTGGCCGCTCAGTTGCAGAAGCAGCAGCCTGGTTTTGCTGAAGAGACGACTCATAACAACAAAAAGAGAAGATTGTCGAAACTCAACGACTTGCTTGAAGAC
TCATCGTCAGAGTTGGAGAAAGAAAATAATGGTCGGAACATTATTATTCCGCCATTGGAGCTAATGGGGAAGCTGGAATCGTCATTGGGGTTCTGCGAGGACTTGCTCTG
CAACGTAACGCAGGTGCTGGGAGAAGAATCGAAGGCGTGTGCAGTTATTACGAAGAAGGAGACGAGAGGCGTCGGAGATGTTAAGGTTGTGGCGTCGAAAGAAGGGGAAG
TGAGAAGGGAACATGTTTTGAGAAATGGAGCGAATGATATGTTCTGGGAACAGTTTCTGACGGAGATTCCAGGCTCCTCAAATGCAGGAGAAGTCTGTTTGAATAGAAGG
AACAATGTTGTAACTCTGGAGCTCATTCCGAAAAGTTCTATACATTTGAGACTTATGATGGAGAGAAGAGCCTCTAATTCAGGTGTAGTGCGTAGCTCAAAGAGACTGAA
GAGTGAAGAAGAGCAAGTTGATAGGATTAGTCAACTGCCAGAGTCCATTCTTGTTCACATCCTTTCATTTTTACCAACTAAGGATGTCATAAAGCTGGAACGTGTTCCGA
GATTCATGCACGTATGGACTTACATTCGCAATCTCTCCTTTGATCAGTGTCAGTTTCATAACCATAGAACTTATAGAGGTGGTTATTACGATGGTACTCAACATGATGAT
CGGTTCCTGAGATTGATTCGCTATGTGTTGATGCTTCATGAAAGTGAAACCATTGATGAATTCCATCTCAAATTTGGGTTCAATTTGTTATGTGTGACCTATAATGAACT
TGATGGTTCTGATTACGCTTCTATAGAGAAGAATATGGCACTTGAACTATCTACGTGGATTATGTTTGTTGCTAGGAAACGAGTAAAAGTTCTTGACCTTAATTTGTTGG
GATGTACGCTAGAGGAAGAAGTTCGTTATAAACTGCCTAATATTGTTTTAACTAGTAATGATCTCATGGAGCTCATGTTGGCTGGTTGTGGAATCAAGGAGTATGGGCAG
ATCAAACTGACGTCACTCAGAAAGTTATCTCTTAAGGAGATACTATTGAGTGAGAATATTATGAGTAAAATTCTCAAAGGATGCCCACTGCTTGAAGAACTTTCTCTCAT
CAGGTGTTCCGGTGTTCATAAGTTGAAGTTGACTAATCCTCGTACTAAAAGATTGAGTATTTCCATTGGATGGAGATATGAGATGATGTACTCGCAATTGGAGATTATCT
GCCCTCATATCACGTCACTAGAAATCTCCGGGCCGATACAGCATGTGGATTTAAATGCTTCGTCTTCTGTCGTTGATGCTTCCCTTTACTACAAAAGCCGCTTTTTCAGA
TTTGCACGAAAGAGATATAACAGACTGCGAACGCTTCTCTGGCAGTTCACCCAATCTGGTGCTCTTGTGTTGTGCAGTTCTACTATTTTGGTACAGCTTTCTCTCCTTTT
ATCTCTTGTTGTAAAGGCAAACACACATGTCGTAATTTATGCTGTTTTGATAGTAAATCTCATGGTCTTTACCATATGGGCAATGACAAACATGCAATCGCCTCTCTTCA
GTTCGCGGTCTCTCGTGCTCAGGCTTCGATTTACAAAATGGCATCTCCCTGGAATATGTAACATACTCAGAAATTCAGCTTTTTTGGAGACGCTGACCTTTTGTATTTAC
CCTGGATCACACTCTATATTTTCGCACAATGAAGCGTCGTGGATAATGGAGCACAATTTCGATGCTCGAGATTACTGGAAGTCTCAGGAACCCGGTTTCCCTTCGGTTAT
GCTCCACCTCAAGACTGTAAAGATCTATGGTTACATAACAGAGCCATTGGTGTTTGAACTGATAGAATTTCTACTGAAGAAAGCATTGGTTTTGGAAAAAATGGTTGTTT
CTACCAAGAAAATTCTCCCATTCGAGGATCAAGATGATCGTACTGCTGCGAATGTCGTGTCCGAAATCAATCGTCTCAATTCTGATAAACTGTTTGAACTTTCTCAGAAA
CTGTTGAGTCTGCCAAGGGCCTCCACCTCTGCAGTAATCCAATTCTATCCAAATGACTCAAATTCGAAGTATGTCAATTCAATAAACAACTTCGAAGAAAAATGTAATTT
TGTTTTGCTAGAGCCGAGCGCTAAGATTCTTATGGAAGGAACTAACGACCCAACAGTGATGCGTAGCTCAAAGAGGCTGAAAGTTGAAGAGGGTCAGAATGATTTGGTTA
GTCAGCTTCCGGAGTCTGTTCTTGTCGACATTCTTTCATATTTACCAACAAAGGATACTGTAAAGATGGCATTGGTTCCAAGATTCAGGCATCTCTGGACCTTTATACGT
AGCCTCTCTTTTGACCAGTGTGCATATCATGACCACGATAGTTATGACAGTGCCATTTATGATGGTCCAAACTATGATCAGAGGTTCCTAAATCTGATCCGCTATGTCTT
GATACTTCATGAACGTACTACAATTGATGAGTTCCATCTCAAATTTGCTTTCAATTTGTTTAATGTTATTCATGATGATCCTGATAATTCTAACTATGCATTTAAAGAGA
AACGAATGGCAACTGAGCTTACTACATGGATTAAATTTGCTGTGAGGAAACAAGTTAAGGTTCTTGATCTTGATTTATTGGGATGTGGTCTGCCAGAGCCAGAAGTTAGT
TATGAGTTGCCTAATTGTGTTTTAACCAGTAACTGTCTTATGGAACTCAGATTGGCTGGCTGTGGAATCAAGGAAAAGGGGCGCATTCAGCTGAAGTCACTCACTAAGCT
TTCCCTCAAGGAAATAATGTTAAGTGATAAGATTATGGGTGAAATCTTTGAAGGATGCCCAATGCTTGAAGAACTTTCTCTGAATGGATGTAGTGGTCTTCGTAAGCTGC
AGTTGACTAATACTAATATTAAGAGGTTGAATATTTTCATTGGATGGAGAGATGAGGTGGCAAATTCGAGGTTGGAGATAAGCTGCCCTAGTGTTAAATCATTAGAAATT
GCTGGGTCAATACAGCTTGCATATTTAAAGGGTTCATCTTCTGTTTTCGATGCCTCCCTTTGTTTTAGCCGCTCCTTCAAGTGTGAACGCAGGATATATGAAAAGGTCCA
AATGCTGCTGTGGAAACTAGCTGAAGCCAATGTTTTCATACCATGCACATGGACTACTTTGGTTTTCACAATATGGGAGTTGACATATGTGCCAACTCCTGTTACTGGTT
GGAAGTCTGTAGAGTTCAGACTTCTTTTCAGAAAGTGGCACATCCCTGGAATATGCAGCATTCTCAAGAGGTCACATTGGTTGGAGACACTGACTTTTTACATCTACCCT
GGATTGAACTCTATGTTCACGAATGAGGAAGCAAAATGGATGGAGGCATATGATTTTGATGGTAAGAATTACTGGAAGTCGCAGAAAGGCAGTTTACTTTGCCTCACAAA
GCACCTCAAGACTGTTATGATATATGGATACGTGACTGAGCCACATGTGTTTGAGCTTATAGAATTTCTACTGAAGAATGCCTTGGTCTTGGAGAAAATGGTCATATCCA
CAAAGAGGACCCTCTGGCCCATTCATCAATATGAATTTTTCAAAGATGCCGTTGAGAATCAGAAGGATCATTTCACTTACCAGGAACTACTTGAGTTTTCTCAGAAATTG
TTAAGTCTTCCAAGAGCCTCCAGGTCTGCTGTAATTCAATTCTGCTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGTTCAGAGGGTAGCTCCGGCGCACCGCCGCCATTTCTGACAAAAACATATGAGATGGTGGATGATCCATTGACCAACTCCATCGTATCGTGGAGTCAAAGCGG
TTTCAGCTTCGTCGTTTGGAACCCACCGGAATTTGCCAAAGACTTGCTCCCAATATATTTCAAACACAACAATTTCTCCAGCTTCGTTCGCCAGCTTAACACTTATGGGT
TCAGGAAGATCGATCGAGACCAATGGGAGTTCGCGAACGAGGGATTTATACGAGGACGAACCCATCTTCTAAAGAGCATCCACCGCCGCAAACCAATCTACAGCCATAGC
CACAGCCATAGCCAGCCGACGCCGAGCCATGGCGTCGGAGCTCCATTGTCGGAACCAGAAAGGGAAGAGCTGGAGCAGCAAATCAAGACCCTGAATCAAGAAAAGACTTC
ACTTCAATCACAGCTGCAGAAGCACGAAAGAGAAAAAGAAGAAATTGGGCGTCAGATTAACTCGCTGTATCAGCAGCTATGGAGAATGGGGACTCAACAGAAGCAGCTCA
TCACCATGTTGGCCGCTCAGTTGCAGAAGCAGCAGCCTGGTTTTGCTGAAGAGACGACTCATAACAACAAAAAGAGAAGATTGTCGAAACTCAACGACTTGCTTGAAGAC
TCATCGTCAGAGTTGGAGAAAGAAAATAATGGTCGGAACATTATTATTCCGCCATTGGAGCTAATGGGGAAGCTGGAATCGTCATTGGGGTTCTGCGAGGACTTGCTCTG
CAACGTAACGCAGGTGCTGGGAGAAGAATCGAAGGCGTGTGCAGTTATTACGAAGAAGGAGACGAGAGGCGTCGGAGATGTTAAGGTTGTGGCGTCGAAAGAAGGGGAAG
TGAGAAGGGAACATGTTTTGAGAAATGGAGCGAATGATATGTTCTGGGAACAGTTTCTGACGGAGATTCCAGGCTCCTCAAATGCAGGAGAAGTCTGTTTGAATAGAAGG
AACAATGTTGTAACTCTGGAGCTCATTCCGAAAAGTTCTATACATTTGAGACTTATGATGGAGAGAAGAGCCTCTAATTCAGGTGTAGTGCGTAGCTCAAAGAGACTGAA
GAGTGAAGAAGAGCAAGTTGATAGGATTAGTCAACTGCCAGAGTCCATTCTTGTTCACATCCTTTCATTTTTACCAACTAAGGATGTCATAAAGCTGGAACGTGTTCCGA
GATTCATGCACGTATGGACTTACATTCGCAATCTCTCCTTTGATCAGTGTCAGTTTCATAACCATAGAACTTATAGAGGTGGTTATTACGATGGTACTCAACATGATGAT
CGGTTCCTGAGATTGATTCGCTATGTGTTGATGCTTCATGAAAGTGAAACCATTGATGAATTCCATCTCAAATTTGGGTTCAATTTGTTATGTGTGACCTATAATGAACT
TGATGGTTCTGATTACGCTTCTATAGAGAAGAATATGGCACTTGAACTATCTACGTGGATTATGTTTGTTGCTAGGAAACGAGTAAAAGTTCTTGACCTTAATTTGTTGG
GATGTACGCTAGAGGAAGAAGTTCGTTATAAACTGCCTAATATTGTTTTAACTAGTAATGATCTCATGGAGCTCATGTTGGCTGGTTGTGGAATCAAGGAGTATGGGCAG
ATCAAACTGACGTCACTCAGAAAGTTATCTCTTAAGGAGATACTATTGAGTGAGAATATTATGAGTAAAATTCTCAAAGGATGCCCACTGCTTGAAGAACTTTCTCTCAT
CAGGTGTTCCGGTGTTCATAAGTTGAAGTTGACTAATCCTCGTACTAAAAGATTGAGTATTTCCATTGGATGGAGATATGAGATGATGTACTCGCAATTGGAGATTATCT
GCCCTCATATCACGTCACTAGAAATCTCCGGGCCGATACAGCATGTGGATTTAAATGCTTCGTCTTCTGTCGTTGATGCTTCCCTTTACTACAAAAGCCGCTTTTTCAGA
TTTGCACGAAAGAGATATAACAGACTGCGAACGCTTCTCTGGCAGTTCACCCAATCTGGTGCTCTTGTGTTGTGCAGTTCTACTATTTTGGTACAGCTTTCTCTCCTTTT
ATCTCTTGTTGTAAAGGCAAACACACATGTCGTAATTTATGCTGTTTTGATAGTAAATCTCATGGTCTTTACCATATGGGCAATGACAAACATGCAATCGCCTCTCTTCA
GTTCGCGGTCTCTCGTGCTCAGGCTTCGATTTACAAAATGGCATCTCCCTGGAATATGTAACATACTCAGAAATTCAGCTTTTTTGGAGACGCTGACCTTTTGTATTTAC
CCTGGATCACACTCTATATTTTCGCACAATGAAGCGTCGTGGATAATGGAGCACAATTTCGATGCTCGAGATTACTGGAAGTCTCAGGAACCCGGTTTCCCTTCGGTTAT
GCTCCACCTCAAGACTGTAAAGATCTATGGTTACATAACAGAGCCATTGGTGTTTGAACTGATAGAATTTCTACTGAAGAAAGCATTGGTTTTGGAAAAAATGGTTGTTT
CTACCAAGAAAATTCTCCCATTCGAGGATCAAGATGATCGTACTGCTGCGAATGTCGTGTCCGAAATCAATCGTCTCAATTCTGATAAACTGTTTGAACTTTCTCAGAAA
CTGTTGAGTCTGCCAAGGGCCTCCACCTCTGCAGTAATCCAATTCTATCCAAATGACTCAAATTCGAAGTATGTCAATTCAATAAACAACTTCGAAGAAAAATGTAATTT
TGTTTTGCTAGAGCCGAGCGCTAAGATTCTTATGGAAGGAACTAACGACCCAACAGTGATGCGTAGCTCAAAGAGGCTGAAAGTTGAAGAGGGTCAGAATGATTTGGTTA
GTCAGCTTCCGGAGTCTGTTCTTGTCGACATTCTTTCATATTTACCAACAAAGGATACTGTAAAGATGGCATTGGTTCCAAGATTCAGGCATCTCTGGACCTTTATACGT
AGCCTCTCTTTTGACCAGTGTGCATATCATGACCACGATAGTTATGACAGTGCCATTTATGATGGTCCAAACTATGATCAGAGGTTCCTAAATCTGATCCGCTATGTCTT
GATACTTCATGAACGTACTACAATTGATGAGTTCCATCTCAAATTTGCTTTCAATTTGTTTAATGTTATTCATGATGATCCTGATAATTCTAACTATGCATTTAAAGAGA
AACGAATGGCAACTGAGCTTACTACATGGATTAAATTTGCTGTGAGGAAACAAGTTAAGGTTCTTGATCTTGATTTATTGGGATGTGGTCTGCCAGAGCCAGAAGTTAGT
TATGAGTTGCCTAATTGTGTTTTAACCAGTAACTGTCTTATGGAACTCAGATTGGCTGGCTGTGGAATCAAGGAAAAGGGGCGCATTCAGCTGAAGTCACTCACTAAGCT
TTCCCTCAAGGAAATAATGTTAAGTGATAAGATTATGGGTGAAATCTTTGAAGGATGCCCAATGCTTGAAGAACTTTCTCTGAATGGATGTAGTGGTCTTCGTAAGCTGC
AGTTGACTAATACTAATATTAAGAGGTTGAATATTTTCATTGGATGGAGAGATGAGGTGGCAAATTCGAGGTTGGAGATAAGCTGCCCTAGTGTTAAATCATTAGAAATT
GCTGGGTCAATACAGCTTGCATATTTAAAGGGTTCATCTTCTGTTTTCGATGCCTCCCTTTGTTTTAGCCGCTCCTTCAAGTGTGAACGCAGGATATATGAAAAGGTCCA
AATGCTGCTGTGGAAACTAGCTGAAGCCAATGTTTTCATACCATGCACATGGACTACTTTGGTTTTCACAATATGGGAGTTGACATATGTGCCAACTCCTGTTACTGGTT
GGAAGTCTGTAGAGTTCAGACTTCTTTTCAGAAAGTGGCACATCCCTGGAATATGCAGCATTCTCAAGAGGTCACATTGGTTGGAGACACTGACTTTTTACATCTACCCT
GGATTGAACTCTATGTTCACGAATGAGGAAGCAAAATGGATGGAGGCATATGATTTTGATGGTAAGAATTACTGGAAGTCGCAGAAAGGCAGTTTACTTTGCCTCACAAA
GCACCTCAAGACTGTTATGATATATGGATACGTGACTGAGCCACATGTGTTTGAGCTTATAGAATTTCTACTGAAGAATGCCTTGGTCTTGGAGAAAATGGTCATATCCA
CAAAGAGGACCCTCTGGCCCATTCATCAATATGAATTTTTCAAAGATGCCGTTGAGAATCAGAAGGATCATTTCACTTACCAGGAACTACTTGAGTTTTCTCAGAAATTG
TTAAGTCTTCCAAGAGCCTCCAGGTCTGCTGTAATTCAATTCTGCTACTGA
Protein sequenceShow/hide protein sequence
MDGSEGSSGAPPPFLTKTYEMVDDPLTNSIVSWSQSGFSFVVWNPPEFAKDLLPIYFKHNNFSSFVRQLNTYGFRKIDRDQWEFANEGFIRGRTHLLKSIHRRKPIYSHS
HSHSQPTPSHGVGAPLSEPEREELEQQIKTLNQEKTSLQSQLQKHEREKEEIGRQINSLYQQLWRMGTQQKQLITMLAAQLQKQQPGFAEETTHNNKKRRLSKLNDLLED
SSSELEKENNGRNIIIPPLELMGKLESSLGFCEDLLCNVTQVLGEESKACAVITKKETRGVGDVKVVASKEGEVRREHVLRNGANDMFWEQFLTEIPGSSNAGEVCLNRR
NNVVTLELIPKSSIHLRLMMERRASNSGVVRSSKRLKSEEEQVDRISQLPESILVHILSFLPTKDVIKLERVPRFMHVWTYIRNLSFDQCQFHNHRTYRGGYYDGTQHDD
RFLRLIRYVLMLHESETIDEFHLKFGFNLLCVTYNELDGSDYASIEKNMALELSTWIMFVARKRVKVLDLNLLGCTLEEEVRYKLPNIVLTSNDLMELMLAGCGIKEYGQ
IKLTSLRKLSLKEILLSENIMSKILKGCPLLEELSLIRCSGVHKLKLTNPRTKRLSISIGWRYEMMYSQLEIICPHITSLEISGPIQHVDLNASSSVVDASLYYKSRFFR
FARKRYNRLRTLLWQFTQSGALVLCSSTILVQLSLLLSLVVKANTHVVIYAVLIVNLMVFTIWAMTNMQSPLFSSRSLVLRLRFTKWHLPGICNILRNSAFLETLTFCIY
PGSHSIFSHNEASWIMEHNFDARDYWKSQEPGFPSVMLHLKTVKIYGYITEPLVFELIEFLLKKALVLEKMVVSTKKILPFEDQDDRTAANVVSEINRLNSDKLFELSQK
LLSLPRASTSAVIQFYPNDSNSKYVNSINNFEEKCNFVLLEPSAKILMEGTNDPTVMRSSKRLKVEEGQNDLVSQLPESVLVDILSYLPTKDTVKMALVPRFRHLWTFIR
SLSFDQCAYHDHDSYDSAIYDGPNYDQRFLNLIRYVLILHERTTIDEFHLKFAFNLFNVIHDDPDNSNYAFKEKRMATELTTWIKFAVRKQVKVLDLDLLGCGLPEPEVS
YELPNCVLTSNCLMELRLAGCGIKEKGRIQLKSLTKLSLKEIMLSDKIMGEIFEGCPMLEELSLNGCSGLRKLQLTNTNIKRLNIFIGWRDEVANSRLEISCPSVKSLEI
AGSIQLAYLKGSSSVFDASLCFSRSFKCERRIYEKVQMLLWKLAEANVFIPCTWTTLVFTIWELTYVPTPVTGWKSVEFRLLFRKWHIPGICSILKRSHWLETLTFYIYP
GLNSMFTNEEAKWMEAYDFDGKNYWKSQKGSLLCLTKHLKTVMIYGYVTEPHVFELIEFLLKNALVLEKMVISTKRTLWPIHQYEFFKDAVENQKDHFTYQELLEFSQKL
LSLPRASRSAVIQFCY