; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014196 (gene) of Chayote v1 genome

Gene IDSed0014196
OrganismSechium edule (Chayote v1)
Descriptionprotein CHUP1, chloroplastic-like
Genome locationLG03:7668925..7673059
RNA-Seq ExpressionSed0014196
SyntenySed0014196
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008467205.1 PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo]8.6e-19956.98Show/hide
Query:  IKMMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGL---AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPS
        ++MMNR+ V+VAVS AAYAI+QL IRSW+S  LPTNCSENGED + N         GL    EEEEEA+S+N++TSQV GRTSD                
Subjt:  IKMMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGL---AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPS

Query:  QSTLILPTNCSGGENSSAELRELLTQ-ESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKN
                    G++SS EL+ LL Q  SEN LL   K          E++KIE +RLLKLVMELE RKVKLEGELLMCD IK+ +T+V EL KQL+AKN
Subjt:  QSTLILPTNCSGGENSSAELRELLTQ-ESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKN

Query:  DDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETL
        +DI+ LN TI SLQAERK L+EEILKGA MK ELE+ARDKIKELQ+QIQLDAN+TKE LLLLKQRVS+LQAKEEE VKKEAEL+K+ KAAKDFE+EL  L
Subjt:  DDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETL

Query:  KRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFL
        K KNREL+HE++ELTSKL+VMKA++ TL    ESEIITKE EEAQ L+SENE+LIKQLE LQMNRFSEVE+LVYLRWINACLRYELR+N+   GESAR+L
Subjt:  KRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFL

Query:  NKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPL
        NKS SPKS+EKAKQLMLEYA ++FGQ ETDHESNFSHP S GI++ + TSIDSS SRTSSFSEKPNSNLSLKK IRN+GG S VS   I+GSS+R ++PL
Subjt:  NKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPL

Query:  EEVMAVSAAETSTLS-------------------------------------------------------------------------------------
        E VMA+S AET TLS                                                                                     
Subjt:  EEVMAVSAAETSTLS-------------------------------------------------------------------------------------

Query:  -----------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLV
                                                             EVKA MET  D +++ LA EVREA FSNM+D+VSFVIRLDEKLSSLV
Subjt:  -----------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLV

Query:  DEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV
        D  EILEH DWP  K  AL+EAAF YQ LMKL EEVSSFVD+PKLTCEVAL KM+SLLDKVEQSV+ ++ TRD  ISRYEELGIPIDWLLD GVV K+KV
Subjt:  DEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV

Query:  MC
        +C
Subjt:  MC

XP_011655490.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]4.1e-20157.38Show/hide
Query:  MNRLGVLVAVSAAAYAIRQLPIRSWSSLILP-TNCSENGEDTENNVRR-HGRIVRGL-AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQST
        MNR+ V+VAVS AAYAI+QL IRSW+S  LP TNCSENGED + NV++ H +I+RGL  EEEEEANS++++TSQV GRTSD                   
Subjt:  MNRLGVLVAVSAAAYAIRQLPIRSWSSLILP-TNCSENGEDTENNVRR-HGRIVRGL-AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQST

Query:  LILPTNCSGGENSSAELRELL--TQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDD
                 G++SS E + LL   + SEN LLDDN+         IENSKIEL+RLLKL+MELE RKVKLEGEL+MCD IK+ +T+V EL KQL+AKNDD
Subjt:  LILPTNCSGGENSSAELRELL--TQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDD

Query:  INTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKR
        I+ LN TI SLQAERK L+EEILKGA MK ELE+ R KIKELQ+QIQLDAN+TKE LLLLKQRVS+LQAKEEE VKKEAELYK+ KAAKDFE+E   LK 
Subjt:  INTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKR

Query:  KNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNK
        KNREL+HE +ELTSKL+VMKA++ TL    E+EIITKE EEAQ L+SENE+LIKQLE LQMNRFSEVE+LVYLRWINACLRYELR+N+   GESAR+LNK
Subjt:  KNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNK

Query:  SLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEE
        S SPKSKEKAKQLMLEYA  + G+AETDHESNFSHP SS I++ + TSIDSS SRTSSF EKPNSNLSLKK IRN+GGSS VS      SS+R ++PLE 
Subjt:  SLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEE

Query:  VMAVSAAETSTLS---------------------------------------------------------------------------------------
        VMA+S AET TLS                                                                                       
Subjt:  VMAVSAAETSTLS---------------------------------------------------------------------------------------

Query:  ---------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDE
                                                           EVK  MET  D ++M LA EVREASFSNM+D+VSFVI LDEKLSSLVD 
Subjt:  ---------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDE

Query:  KEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC
         EILEH DWP  K  AL+EAAF YQ LMKL EEVSSFVD+PKLTCEVAL KM+SLLDKVEQSV+ ++ TRD TISRYEELGIPIDWLLD GVV K+KV+C
Subjt:  KEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC

XP_031741050.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]4.1e-20157.38Show/hide
Query:  MNRLGVLVAVSAAAYAIRQLPIRSWSSLILP-TNCSENGEDTENNVRR-HGRIVRGL-AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQST
        MNR+ V+VAVS AAYAI+QL IRSW+S  LP TNCSENGED + NV++ H +I+RGL  EEEEEANS++++TSQV GRTSD                   
Subjt:  MNRLGVLVAVSAAAYAIRQLPIRSWSSLILP-TNCSENGEDTENNVRR-HGRIVRGL-AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQST

Query:  LILPTNCSGGENSSAELRELL--TQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDD
                 G++SS E + LL   + SEN LLDDN+         IENSKIEL+RLLKL+MELE RKVKLEGEL+MCD IK+ +T+V EL KQL+AKNDD
Subjt:  LILPTNCSGGENSSAELRELL--TQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDD

Query:  INTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKR
        I+ LN TI SLQAERK L+EEILKGA MK ELE+ R KIKELQ+QIQLDAN+TKE LLLLKQRVS+LQAKEEE VKKEAELYK+ KAAKDFE+E   LK 
Subjt:  INTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKR

Query:  KNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNK
        KNREL+HE +ELTSKL+VMKA++ TL    E+EIITKE EEAQ L+SENE+LIKQLE LQMNRFSEVE+LVYLRWINACLRYELR+N+   GESAR+LNK
Subjt:  KNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNK

Query:  SLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEE
        S SPKSKEKAKQLMLEYA  + G+AETDHESNFSHP SS I++ + TSIDSS SRTSSF EKPNSNLSLKK IRN+GGSS VS      SS+R ++PLE 
Subjt:  SLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEE

Query:  VMAVSAAETSTLS---------------------------------------------------------------------------------------
        VMA+S AET TLS                                                                                       
Subjt:  VMAVSAAETSTLS---------------------------------------------------------------------------------------

Query:  ---------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDE
                                                           EVK  MET  D ++M LA EVREASFSNM+D+VSFVI LDEKLSSLVD 
Subjt:  ---------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDE

Query:  KEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC
         EILEH DWP  K  AL+EAAF YQ LMKL EEVSSFVD+PKLTCEVAL KM+SLLDKVEQSV+ ++ TRD TISRYEELGIPIDWLLD GVV K+KV+C
Subjt:  KEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC

XP_031741051.1 protein CHUP1, chloroplastic isoform X3 [Cucumis sativus]2.0e-18454.37Show/hide
Query:  MNRLGVLVAVSAAAYAIRQLPIRSWSSLILP-TNCSENGEDTENNVRR-HGRIVRGL-AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQST
        MNR+ V+VAVS AAYAI+QL IRSW+S  LP TNCSENGED + NV++ H +I+RGL  EEEEEANS++++TSQV GRTSD                   
Subjt:  MNRLGVLVAVSAAAYAIRQLPIRSWSSLILP-TNCSENGEDTENNVRR-HGRIVRGL-AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQST

Query:  LILPTNCSGGENSSAELRELL--TQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDD
                 G++SS E + LL   + SEN LLDDN+         IENSKIEL+RLLKL+MELE RKVKLEGEL+MCD IK+ +T+V EL KQL+AKNDD
Subjt:  LILPTNCSGGENSSAELRELL--TQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDD

Query:  INTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKR
        I+ LN TI SLQAERK L+EEILKGA MK ELE+ R KIKELQ+QIQLDAN+TKE LLLLKQRVS+LQAKEEE VKKEAELYK+ KAAKDFE+E   LK 
Subjt:  INTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKR

Query:  KNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNK
        KNREL+HE +ELTSKL+VMKA++ TL    E+EIITKE EEAQ L+SENE+LIKQLE LQMNRFSEVE+LVYLRWINACLRYELR+N+   G+       
Subjt:  KNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNK

Query:  SLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEE
                            + G+AETDHESNFSHP SS I++ + TSIDSS SRTSSF EKPNSNLSLKK IRN+GGSS VS      SS+R ++PLE 
Subjt:  SLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEE

Query:  VMAVSAAETSTLS---------------------------------------------------------------------------------------
        VMA+S AET TLS                                                                                       
Subjt:  VMAVSAAETSTLS---------------------------------------------------------------------------------------

Query:  ---------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDE
                                                           EVK  MET  D ++M LA EVREASFSNM+D+VSFVI LDEKLSSLVD 
Subjt:  ---------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDE

Query:  KEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC
         EILEH DWP  K  AL+EAAF YQ LMKL EEVSSFVD+PKLTCEVAL KM+SLLDKVEQSV+ ++ TRD TISRYEELGIPIDWLLD GVV K+KV+C
Subjt:  KEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC

XP_038906491.1 protein CHUP1, chloroplastic-like isoform X1 [Benincasa hispida]1.3e-20758.67Show/hide
Query:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTLI
        MMNRLG+LVAVS  AYAIRQL IRSWSSL  P NCSENGEDT+ N         GL EEEEEANS+N+ TSQV GRTSD                     
Subjt:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTLI

Query:  LPTNCSGGENSSAELRELLTQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINTL
               G++ S E R LL +ESEN  LDDNK         IEN+KIEL+RL+KLVMELE RK KLEGELLMCD IK+ +T+V EL KQL+AKNDDI+ L
Subjt:  LPTNCSGGENSSAELRELLTQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINTL

Query:  NITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNRE
        NITI SLQAERK LQEEI+KGA MK ELE AR KIKELQ+QIQLDAN+TKEHLLLLKQRVS+LQAKEEE +KKEAELYK+ KAAKDFE+EL  LKRKNRE
Subjt:  NITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNRE

Query:  LRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKSLSP
        L+HE+ EL SKL+VMKA++ TL    ESEI+TKE EEAQ L+SENE+LIK LE LQMNRF+EVE+LVYLRWINACLRYELRDNE S GESAR+LNKSLSP
Subjt:  LRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKSLSP

Query:  KSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEVMAV
        KSKEKAKQLMLEYA L+ GQ ETDHESNFSHP SSGIED D TSIDSS SRTSSF EKPNSNLSLKK IRN GGSS VSS  I GSS+R ++PLE VMA+
Subjt:  KSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEVMAV

Query:  SAAETSTLSEVK----------------------------------------------------------------------------------------
        S AET TLSEV+                                                                                        
Subjt:  SAAETSTLSEVK----------------------------------------------------------------------------------------

Query:  ---------------------------------------------------ADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEIL
                                                           AD+ET  D+++ LA EVREASFSNM+DVVSF+IRLDEK  SLV+  EIL
Subjt:  ---------------------------------------------------ADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEIL

Query:  EHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC
        +H DWP  K  AL EAAF YQ LMKL EEVSSFVD+PKLTCEVAL KM+SL+DKVEQSV+ +  TRD TIS+YEELGIPIDWLLD GVV K+KV C
Subjt:  EHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC

TrEMBL top hitse value%identityAlignment
A0A0A0KT25 Uncharacterized protein1.6e-19857.07Show/hide
Query:  MNRLGVLVAVSAAAYAIRQLPIRSWSSLILP-TNCSENGEDTENNVRRHGRIVRGL-AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTL
        MNR+ V+VAVS AAYAI+QL IRSW+S  LP TNCSENGED + NV+      +GL  EEEEEANS++++TSQV GRTSD                    
Subjt:  MNRLGVLVAVSAAAYAIRQLPIRSWSSLILP-TNCSENGEDTENNVRRHGRIVRGL-AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTL

Query:  ILPTNCSGGENSSAELRELL--TQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDI
                G++SS E + LL   + SEN LLDDN+         IENSKIEL+RLLKL+MELE RKVKLEGEL+MCD IK+ +T+V EL KQL+AKNDDI
Subjt:  ILPTNCSGGENSSAELRELL--TQESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDI

Query:  NTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRK
        + LN TI SLQAERK L+EEILKGA MK ELE+ R KIKELQ+QIQLDAN+TKE LLLLKQRVS+LQAKEEE VKKEAELYK+ KAAKDFE+E   LK K
Subjt:  NTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRK

Query:  NRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKS
        NREL+HE +ELTSKL+VMKA++ TL    E+EIITKE EEAQ L+SENE+LIKQLE LQMNRFSEVE+LVYLRWINACLRYELR+N+   GESAR+LNKS
Subjt:  NRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKS

Query:  LSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEV
         SPKSKEKAKQLMLEYA  + G+AETDHESNFSHP SS I++ + TSIDSS SRTSSF EKPNSNLSLKK IRN+GGSS VS      SS+R ++PLE V
Subjt:  LSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEV

Query:  MAVSAAETSTLS----------------------------------------------------------------------------------------
        MA+S AET TLS                                                                                        
Subjt:  MAVSAAETSTLS----------------------------------------------------------------------------------------

Query:  --------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEK
                                                          EVK  MET  D ++M LA EVREASFSNM+D+VSFVI LDEKLSSLVD  
Subjt:  --------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEK

Query:  EILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC
        EILEH DWP  K  AL+EAAF YQ LMKL EEVSSFVD+PKLTCEVAL KM+SLLDKVEQSV+ ++ TRD TISRYEELGIPIDWLLD GVV K+KV+C
Subjt:  EILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC

A0A151S1U7 Uncharacterized protein6.1e-13449.25Show/hide
Query:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDT-ENNVRRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTL
        M+ RLG++VA S AA+ ++QL +RS        +  EN E T E++V R    ++    EEEE     +  S +  R +D   +                
Subjt:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDT-ENNVRRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTL

Query:  ILPTNCSGGENSSAELRELLTQESENPLLDDN---------KIENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINT
        ILP           E  +LL+   E PL  D          ++ N+  EL+RL +LV ELE R+VKLEGELL    +K  ++++ EL +QL+ K  +I+ 
Subjt:  ILPTNCSGGENSSAELRELLTQESENPLLDDN---------KIENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINT

Query:  LNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNR
        LNITI SLQAERKKL EE+ +G S K ELE AR+KIKELQ+QIQL+A++TK  LLLLKQ+VS LQ KEEE  +K+A+L K+LKA  D E+ +  LKR+N+
Subjt:  LNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNR

Query:  ELRHERRELTSKLDVMK---AQVTTLRESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGE-SARFLNKSL
        EL+HE+REL  KL+  +   A+++ + ES+++ K  EE  NLR  NE+L+KQ+E LQMNRFSEVE+LVYLRW+NACLRYELR+ +T  G+ SAR L+KSL
Subjt:  ELRHERRELTSKLDVMK---AQVTTLRESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGE-SARFLNKSL

Query:  SPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENP--LEE
        SPKS+EKAKQLMLEYA  + GQ +TD +SNFSHP S G EDFD  SIDSS S+ SS S+K +     KK+ ++K  SS +SS      S+ GEN    EE
Subjt:  SPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENP--LEE

Query:  VMAVSAAETSTLSEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEIL-----EHVDWPDVKAYALKEAAFEYQNLMKLGEEV
                     EVKAD+ET  D +  LA EVR ASFS+++D+V+FV  LDE+LS L      L        +WP+ KA AL+EAAFEYQ+LMKL  +V
Subjt:  VMAVSAAETSTLSEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEIL-----EHVDWPDVKAYALKEAAFEYQNLMKLGEEV

Query:  SSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV
        S+F+D P+L CE ALKKM+SLL+KVEQSV+ ++ TRD  ISRY E GIP++WLLDSGVV K+K+
Subjt:  SSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV

A0A1S3CSZ9 protein CHUP1, chloroplastic-like4.1e-19956.98Show/hide
Query:  IKMMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGL---AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPS
        ++MMNR+ V+VAVS AAYAI+QL IRSW+S  LPTNCSENGED + N         GL    EEEEEA+S+N++TSQV GRTSD                
Subjt:  IKMMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGL---AEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPS

Query:  QSTLILPTNCSGGENSSAELRELLTQ-ESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKN
                    G++SS EL+ LL Q  SEN LL   K          E++KIE +RLLKLVMELE RKVKLEGELLMCD IK+ +T+V EL KQL+AKN
Subjt:  QSTLILPTNCSGGENSSAELRELLTQ-ESENPLLDDNK---------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKN

Query:  DDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETL
        +DI+ LN TI SLQAERK L+EEILKGA MK ELE+ARDKIKELQ+QIQLDAN+TKE LLLLKQRVS+LQAKEEE VKKEAEL+K+ KAAKDFE+EL  L
Subjt:  DDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETL

Query:  KRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFL
        K KNREL+HE++ELTSKL+VMKA++ TL    ESEIITKE EEAQ L+SENE+LIKQLE LQMNRFSEVE+LVYLRWINACLRYELR+N+   GESAR+L
Subjt:  KRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFL

Query:  NKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPL
        NKS SPKS+EKAKQLMLEYA ++FGQ ETDHESNFSHP S GI++ + TSIDSS SRTSSFSEKPNSNLSLKK IRN+GG S VS   I+GSS+R ++PL
Subjt:  NKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPL

Query:  EEVMAVSAAETSTLS-------------------------------------------------------------------------------------
        E VMA+S AET TLS                                                                                     
Subjt:  EEVMAVSAAETSTLS-------------------------------------------------------------------------------------

Query:  -----------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLV
                                                             EVKA MET  D +++ LA EVREA FSNM+D+VSFVIRLDEKLSSLV
Subjt:  -----------------------------------------------------EVKADMETHID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLV

Query:  DEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV
        D  EILEH DWP  K  AL+EAAF YQ LMKL EEVSSFVD+PKLTCEVAL KM+SLLDKVEQSV+ ++ TRD  ISRYEELGIPIDWLLD GVV K+KV
Subjt:  DEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV

Query:  MC
        +C
Subjt:  MC

A0A5D3BMR7 Protein CHUP17.8e-18257.03Show/hide
Query:  EEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTLILPTNCSGGENSSAELRELLTQ-ESENPLLDDNK---------IENSKIELDRLLKLV
        EEEEEA+S+N++TSQV GRTSD                            G++SS EL+ LL Q  SEN LL   K          EN+KIE +RLLKLV
Subjt:  EEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTLILPTNCSGGENSSAELRELLTQ-ESENPLLDDNK---------IENSKIELDRLLKLV

Query:  MELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLL
        MELE RKVKLEGELLMCD IK+ +T+V EL KQL+AKN+DI+ LN TI SLQAERK L+EEILKGA MK ELE+AR KIKELQ+QIQLDAN+TKE LLLL
Subjt:  MELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLL

Query:  KQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQ
        KQRVS+LQAKEEE VKKEAEL+K+ KAAKDFE+EL  LK KNREL+HE++ELTSKL+VMKA++ TL    ESEIITKE EEAQ L+SENE+LIKQLE LQ
Subjt:  KQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQ

Query:  MNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFS
        MNRFSEVE+LVYLRWINACLRYELR+N+   GESAR+LNKS SPKS+EKAKQLMLEYA ++FGQ ETDHESNFSHP S GI++ + TSIDSS SRTSSFS
Subjt:  MNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFS

Query:  EKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEVMAVSAAETSTLS-----------------------------------------------
        EKPNSNLSLKK IRN+GG S VS   I+GSS+R ++PLE VMA+S AET TLS                                               
Subjt:  EKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEVMAVSAAETSTLS-----------------------------------------------

Query:  -------------------------------------------------------------------------------------------EVKADMETH
                                                                                                   EVKA MET 
Subjt:  -------------------------------------------------------------------------------------------EVKADMETH

Query:  ID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVE
         D +++ LA EVREA FSNM+D+VSFVIRLDEKLSSLVD  EILEH DWP  K  AL+EAAF YQ LMKL EEVSSFVD+PKLTCEVAL KM+SLLDKVE
Subjt:  ID-MIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVE

Query:  QSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC
        QSV+ ++ TRD  ISRYEELGIPIDWLLD GVV K+KV+C
Subjt:  QSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMC

A0A6J1DWY5 protein CHUP1, chloroplastic-like3.9e-18158.47Show/hide
Query:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTLI
        +M +LGVLVAVS AAYAI+QL IRSWSS  LPTNCSENGE TE N         GL  EE++ NS+N + SQV+G +SD                     
Subjt:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTLI

Query:  LPTNCSGGENSSAELRELLTQESENPLLDDNK----------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINT
                     ELRELL ++SE+ LLD NK          +EN+KIEL RLLKLVMELE RKVKLE ELLM D +K  K++  EL K+LEAK++D++ 
Subjt:  LPTNCSGGENSSAELRELLTQESENPLLDDNK----------IENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINT

Query:  LNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNR
        LNITI SLQAERKKLQEEI+KGA MK ELE+A+ KIKELQ+Q+QLDAN+TKEHL  LK+RVS+LQAKEEE VKKEA+LY++LKAAK FE+EL  LK+KNR
Subjt:  LNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNR

Query:  ELRHERRELTSKLDVMKAQV---TTLRESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKSLS
        +L+ E+ ELTSKL+VM+A++   TTL ESEIIT+E EE + LR  NE L KQLE LQMNRFSEVE+LVYLRW+NACLRYELRDNET  GESA  L+KSLS
Subjt:  ELRHERRELTSKLDVMKAQV---TTLRESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKSLS

Query:  PKSKEKAKQLMLEYAELDFGQAETDHESNFSHP-ISSGIEDFDKTSIDSSMSRTSSFSEK---------------PNSNLSLKKFIRNKGGSSGVSSQRI
        PKSKEKAKQLMLEYA L FGQ ETDHESNFSHP  SSGIEDFD TS  SS SRTSSF  K                 S +  +   RN   S   S Q +
Subjt:  PKSKEKAKQLMLEYAELDFGQAETDHESNFSHP-ISSGIEDFDKTSIDSSMSRTSSFSEK---------------PNSNLSLKKFIRNKGGSSGVSSQRI

Query:  TG----------SSYRGENPL-----EEVMAVSAAETSTL-------------------SEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDE
        +           S+Y+  + L      E      AE   +                   +EVKADMET  D++MPL  +V+  SF+NM+DVVSFVI LD+
Subjt:  TG----------SSYRGENPL-----EEVMAVSAAETSTL-------------------SEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDE

Query:  KLSSLVDEK-EILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSG
        K SSLVDE+  ILEH DWP+ K+ AL+EAA EYQNLMKLGEEVSSFVDSPKLT EVALK MHSLL K+EQSVH V+  R+  IS+YEELGIP+DWLLDSG
Subjt:  KLSSLVDEK-EILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSG

Query:  VVAKMKVM
        VV KMKV+
Subjt:  VVAKMKVM

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic3.1e-11135.27Show/hide
Query:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGE--DTENNV-----RRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRI
        M  R+G +VA S AA  +++L ++       P+  S+NGE  D E +V          +     EEEEE   +N   +Q  G  SD+  +          
Subjt:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGE--DTENNV-----RRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRI

Query:  PSQSTLILPTNCSGGENSSAELRELLTQESENPLL-DDNKIENSKI-------------ELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLK
              ILP           E  +LL+ E E PL  DDN +E ++              EL+RL +LV ELE R+VKLEGELL    +K  ++++ EL +
Subjt:  PSQSTLILPTNCSGGENSSAELRELLTQESENPLL-DDNKIENSKI-------------ELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLK

Query:  QLEAKNDDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFE
        QL+ K  +I+ LNITI SLQAERKKLQEE+ +   ++ ELE AR+KIKELQ+QIQLDAN+TK  LLLLKQ VSSLQ KEEE + K+ E+ ++LKA +D E
Subjt:  QLEAKNDDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFE

Query:  IELETLKRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIG
        +++  LKRKNREL+HE+REL+ KLD  +A++ TL    ES+ + K  EE  NL+  NE+L+KQ+E LQMNRFSEVE+LVYLRW+NACLRYELR+ +T  G
Subjt:  IELETLKRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIG

Query:  E-SARFLNKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSS----------G
        + SAR L+K+LSPKS+ KAK+LMLEYA  + GQ +TD ESN+S P S G +DFD  S+DSS SR SSFS+KP     LKK+ ++K  SS          G
Subjt:  E-SARFLNKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSS----------G

Query:  VSSQRITGSSYRGENPLEEVMAVSAAE-------------------------------------------------------------------------
         S  R++ S  +   PLE +M  +A E                                                                         
Subjt:  VSSQRITGSSYRGENPLEEVMAVSAAE-------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TSTL
                                                                                                        ++ L
Subjt:  ------------------------------------------------------------------------------------------------TSTL

Query:  SEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKK
          VKAD+ET  D +  LA EVR +SF++++D+++FV  LDE+LS LVDE+ +L+H DWP+ KA AL+EAAFEYQ+LMKL ++V+SFVD P L+CE ALKK
Subjt:  SEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKK

Query:  MHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV
        M+ LL+KVEQSV+ ++ TRD  ISRY+E GIP+DWL D+GVV K+K+
Subjt:  MHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein4.9e-2728.43Show/hide
Query:  KQRVSSLQAKEEEGVKKEAELYKRL----KAAKDFEI-------ELETLKRKNRELRHERRELTSKLDVMKAQVTTLRESEIITKETEEAQNLRSENENL
        ++ V +  A E+E  K+  EL ++L       KD ++       ELE  +  N EL    R+L+  L   +A++++L  ++                   
Subjt:  KQRVSSLQAKEEEGVKKEAELYKRL----KAAKDFEI-------ELETLKRKNRELRHERRELTSKLDVMKAQVTTLRESEIITKETEEAQNLRSENENL

Query:  IKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKSLSPKSKEKAKQL--MLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSID
         K  +E Q +RF ++++L+  +     ++ E+        ES+R    S SP        L   L       G+ + ++ S F+ P              
Subjt:  IKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKSLSPKSKEKAKQL--MLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSID

Query:  SSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEVMAVSAAETSTLSEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIR
        +  +R       P S L     + NK  +S   SQ + G+  +  +    ++      ++ L  +KAD+ET  + I  L ++V    FS+M+DV+ FV  
Subjt:  SSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEVMAVSAAETSTLSEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIR

Query:  LDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLD
        LD++L++L DE+ +L+H  WP+ KA  L+EAA EY+ L KL +E+SS+ D P +   VALKKM +LLDK EQ +  +V  R +++  Y++  IP++W+LD
Subjt:  LDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLD

Query:  SGVVAKMK
        SG++ K+K
Subjt:  SGVVAKMK

AT1G52080.1 actin binding protein family3.6e-3028.16Show/hide
Query:  RLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTLILPT
        +LG  +AVS A +   +   R  +  I PT        ++N  R +    + +   +E     +E T        + + +    ++ P    ++  +   
Subjt:  RLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTLILPT

Query:  NCSGGENSSAELRELLTQESENPLLDDNKIENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINTLNITIKSLQAERK
         C   E   +++   L   SE     +N       E++RL   V  L  R+  LE +LL    +K  +    EL  +L+    +    N  IK LQAE +
Subjt:  NCSGGENSSAELRELLTQESENPLLDDNKIENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINTLNITIKSLQAERK

Query:  KLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNRELRHERRELTSKL
        KL+ E  + + +  EL+ A+ +++ L+K++ ++  +    +L LKQRV+ LQ +E + V  + E  K ++  +D E E+  L   N  L+ E  EL+ KL
Subjt:  KLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNRELRHERRELTSKL

Query:  DVMKAQVTT-LRESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGES-ARFLNKSLSPKSKEKAKQLMLEY
        + ++    + L E E I    E+   LRSENE L K +E+LQ +R +++E+LVYLRWINACLRYELR  +   G++ AR L+ +LSP S+EKAKQL+LEY
Subjt:  DVMKAQVTT-LRESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGES-ARFLNKSLSPKSKEKAKQLMLEY

Query:  AELDFGQAETDHES-NFSHPISSGIED---FDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEVMAVSAAETSTLS
        A     +  TD++  + S   SS I D    D +S+D+     ++ ++K      + K ++   G     S++  GSS             S++ T   S
Subjt:  AELDFGQAETDHES-NFSHPISSGIED---FDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEVMAVSAAETSTLS

Query:  EVKADMETH-IDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSF
          +    TH +D  M +  +  E  F N   +V    + +   SS   E+  LE       +   L +A  + ++  KL ++  SF
Subjt:  EVKADMETH-IDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSF

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein2.2e-11235.27Show/hide
Query:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGE--DTENNV-----RRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRI
        M  R+G +VA S AA  +++L ++       P+  S+NGE  D E +V          +     EEEEE   +N   +Q  G  SD+  +          
Subjt:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGE--DTENNV-----RRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRI

Query:  PSQSTLILPTNCSGGENSSAELRELLTQESENPLL-DDNKIENSKI-------------ELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLK
              ILP           E  +LL+ E E PL  DDN +E ++              EL+RL +LV ELE R+VKLEGELL    +K  ++++ EL +
Subjt:  PSQSTLILPTNCSGGENSSAELRELLTQESENPLL-DDNKIENSKI-------------ELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLK

Query:  QLEAKNDDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFE
        QL+ K  +I+ LNITI SLQAERKKLQEE+ +   ++ ELE AR+KIKELQ+QIQLDAN+TK  LLLLKQ VSSLQ KEEE + K+ E+ ++LKA +D E
Subjt:  QLEAKNDDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFE

Query:  IELETLKRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIG
        +++  LKRKNREL+HE+REL+ KLD  +A++ TL    ES+ + K  EE  NL+  NE+L+KQ+E LQMNRFSEVE+LVYLRW+NACLRYELR+ +T  G
Subjt:  IELETLKRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIG

Query:  E-SARFLNKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSS----------G
        + SAR L+K+LSPKS+ KAK+LMLEYA  + GQ +TD ESN+S P S G +DFD  S+DSS SR SSFS+KP     LKK+ ++K  SS          G
Subjt:  E-SARFLNKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSS----------G

Query:  VSSQRITGSSYRGENPLEEVMAVSAAE-------------------------------------------------------------------------
         S  R++ S  +   PLE +M  +A E                                                                         
Subjt:  VSSQRITGSSYRGENPLEEVMAVSAAE-------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TSTL
                                                                                                        ++ L
Subjt:  ------------------------------------------------------------------------------------------------TSTL

Query:  SEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKK
          VKAD+ET  D +  LA EVR +SF++++D+++FV  LDE+LS LVDE+ +L+H DWP+ KA AL+EAAFEYQ+LMKL ++V+SFVD P L+CE ALKK
Subjt:  SEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKK

Query:  MHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV
        M+ LL+KVEQSV+ ++ TRD  ISRY+E GIP+DWL D+GVV K+K+
Subjt:  MHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein2.2e-11235.27Show/hide
Query:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGE--DTENNV-----RRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRI
        M  R+G +VA S AA  +++L ++       P+  S+NGE  D E +V          +     EEEEE   +N   +Q  G  SD+  +          
Subjt:  MMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGE--DTENNV-----RRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRI

Query:  PSQSTLILPTNCSGGENSSAELRELLTQESENPLL-DDNKIENSKI-------------ELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLK
              ILP           E  +LL+ E E PL  DDN +E ++              EL+RL +LV ELE R+VKLEGELL    +K  ++++ EL +
Subjt:  PSQSTLILPTNCSGGENSSAELRELLTQESENPLL-DDNKIENSKI-------------ELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLK

Query:  QLEAKNDDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFE
        QL+ K  +I+ LNITI SLQAERKKLQEE+ +   ++ ELE AR+KIKELQ+QIQLDAN+TK  LLLLKQ VSSLQ KEEE + K+ E+ ++LKA +D E
Subjt:  QLEAKNDDINTLNITIKSLQAERKKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFE

Query:  IELETLKRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIG
        +++  LKRKNREL+HE+REL+ KLD  +A++ TL    ES+ + K  EE  NL+  NE+L+KQ+E LQMNRFSEVE+LVYLRW+NACLRYELR+ +T  G
Subjt:  IELETLKRKNRELRHERRELTSKLDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIG

Query:  E-SARFLNKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSS----------G
        + SAR L+K+LSPKS+ KAK+LMLEYA  + GQ +TD ESN+S P S G +DFD  S+DSS SR SSFS+KP     LKK+ ++K  SS          G
Subjt:  E-SARFLNKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSS----------G

Query:  VSSQRITGSSYRGENPLEEVMAVSAAE-------------------------------------------------------------------------
         S  R++ S  +   PLE +M  +A E                                                                         
Subjt:  VSSQRITGSSYRGENPLEEVMAVSAAE-------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------TSTL
                                                                                                        ++ L
Subjt:  ------------------------------------------------------------------------------------------------TSTL

Query:  SEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKK
          VKAD+ET  D +  LA EVR +SF++++D+++FV  LDE+LS LVDE+ +L+H DWP+ KA AL+EAAFEYQ+LMKL ++V+SFVD P L+CE ALKK
Subjt:  SEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKK

Query:  MHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV
        M+ LL+KVEQSV+ ++ TRD  ISRY+E GIP+DWL D+GVV K+K+
Subjt:  MHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKV

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein3.7e-9135.36Show/hide
Query:  KKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNRELRHERRELTSK
        K LQEE+ +   ++ ELE AR+KIKELQ+QIQLDAN+TK  LLLLKQ VSSLQ KEEE + K+ E+ ++LKA +D E+++  LKRKNREL+HE+REL+ K
Subjt:  KKLQEEILKGASMKTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNRELRHERRELTSK

Query:  LDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGE-SARFLNKSLSPKSKEKAKQLM
        LD  +A++ TL    ES+ + K  EE  NL+  NE+L+KQ+E LQMNRFSEVE+LVYLRW+NACLRYELR+ +T  G+ SAR L+K+LSPKS+ KAK+LM
Subjt:  LDVMKAQVTTL---RESEIITKETEEAQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGE-SARFLNKSLSPKSKEKAKQLM

Query:  LEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSS----------GVSSQRITGSSYRGENPLEEVMAV
        LEYA  + GQ +TD ESN+S P S G +DFD  S+DSS SR SSFS+KP     LKK+ ++K  SS          G S  R++ S  +   PLE +M  
Subjt:  LEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDSSMSRTSSFSEKPNSNLSLKKFIRNKGGSS----------GVSSQRITGSSYRGENPLEEVMAV

Query:  SAAE------------------------------------------------------------------------------------------------
        +A E                                                                                                
Subjt:  SAAE------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------TSTLSEVKADMETHIDMIMPLAKEVRE
                                                                                 ++ L  VKAD+ET  D +  LA EVR 
Subjt:  -------------------------------------------------------------------------TSTLSEVKADMETHIDMIMPLAKEVRE

Query:  ASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATI
        +SF++++D+++FV  LDE+LS LVDE+ +L+H DWP+ KA AL+EAAFEYQ+LMKL ++V+SFVD P L+CE ALKKM+ LL+KVEQSV+ ++ TRD  I
Subjt:  ASFSNMDDVVSFVIRLDEKLSSLVDEKEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATI

Query:  SRYEELGIPIDWLLDSGVVAKMKV
        SRY+E GIP+DWL D+GVV K+K+
Subjt:  SRYEELGIPIDWLLDSGVVAKMKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCTAATTAAGATGATGAACAGATTGGGTGTTCTTGTTGCTGTTTCAGCTGCAGCTTATGCAATTAGGCAGCTACCAATCAGATCATGGAGCTCATTAATCTTGCC
TACCAACTGTTCAGAAAATGGGGAAGACACCGAGAACAACGTAAGGCGACACGGAAGAATTGTCCGTGGCTTGGCTGAAGAAGAAGAGGAAGCTAATTCTGTAAATGAAT
CAACTAGTCAAGTTACTGGTAGAACTTCTGATCATAATGCTGAGATGACTTTGAACTATGTTGCCCCACGAATACCTAGTCAGAGCACCTTAATCTTGCCTACCAATTGT
TCAGGTGGAGAAAATAGCTCAGCAGAACTTAGAGAACTTCTAACACAGGAGTCCGAAAATCCATTGCTTGATGACAACAAGATTGAAAACAGTAAGATTGAATTGGACAG
GTTGCTGAAACTAGTAATGGAATTAGAGTTGCGGAAAGTGAAACTTGAAGGTGAATTGCTCATGTGCGACGAAATAAAGCATGACAAAACCAATGTCGCGGAGTTACTAA
AGCAGCTCGAGGCCAAGAATGACGATATCAATACGCTTAATATCACAATCAAGTCTTTGCAGGCTGAGAGAAAGAAGCTCCAAGAAGAGATACTGAAGGGAGCATCAATG
AAAACTGAACTAGAGAAGGCTAGGGACAAGATTAAGGAGCTGCAAAAGCAGATTCAGCTTGATGCAAACAAAACAAAAGAACACTTGCTATTACTCAAACAAAGAGTTTC
CTCTTTGCAGGCAAAAGAGGAAGAGGGTGTCAAGAAAGAAGCAGAACTTTATAAGAGGTTGAAAGCAGCAAAGGATTTCGAGATAGAACTCGAGACGCTCAAGCGGAAGA
ATCGAGAGCTTCGACATGAAAGGAGGGAATTAACTTCTAAGCTTGATGTTATGAAGGCCCAAGTCACAACATTGAGAGAGAGTGAAATAATCACTAAGGAAACAGAGGAG
GCTCAAAATTTAAGAAGTGAAAATGAGAATCTGATCAAGCAACTTGAAGAGCTTCAAATGAATAGGTTTAGTGAAGTTGAAAAATTAGTGTACCTACGGTGGATCAATGC
TTGCTTGAGGTACGAGCTTCGAGACAACGAGACATCGATAGGCGAATCCGCTCGTTTTCTCAACAAGAGTTTAAGTCCAAAGTCCAAAGAGAAGGCCAAACAGCTCATGC
TAGAGTATGCAGAATTGGACTTTGGACAAGCAGAAACAGATCATGAAAGCAACTTTTCTCATCCAATTTCTTCTGGGATTGAAGATTTTGACAAAACTTCAATTGATAGT
TCAATGAGCAGAACCAGCAGTTTCAGTGAGAAGCCTAATTCAAATCTGAGTTTGAAGAAATTTATCAGAAACAAAGGTGGTTCGAGTGGTGTTTCGTCTCAGCGCATAAC
TGGTTCGAGCTACAGAGGGGAGAATCCTTTAGAAGAGGTGATGGCTGTGAGTGCTGCTGAAACTTCAACTCTTTCAGAAGTGAAAGCTGATATGGAGACTCACATAGACA
TGATTATGCCACTAGCAAAGGAAGTTAGAGAAGCTTCTTTCTCCAACATGGACGATGTCGTCTCCTTTGTAATACGCCTCGATGAGAAATTATCTTCGTTGGTTGATGAG
AAGGAGATTCTCGAGCACGTCGATTGGCCCGATGTAAAGGCATATGCATTGAAAGAAGCAGCATTTGAGTACCAGAACCTGATGAAACTAGGAGAAGAAGTCTCTTCTTT
TGTGGACAGTCCCAAACTCACATGTGAAGTTGCTTTGAAAAAAATGCACTCCTTGCTAGATAAGGTGGAGCAGAGTGTTCATGAAGTCGTAAGTACAAGGGACGCAACAA
TTTCGCGATACGAGGAACTCGGAATTCCCATTGATTGGCTGTTGGATTCTGGAGTTGTTGCCAAGATGAAGGTTATGTGTACAATTGGCAAGGAAGTTCATGAAACGCAT
AGTAAATGA
mRNA sequenceShow/hide mRNA sequence
GTTCTCTATAAACTAGCAGAAGAAGACAAAGACAACCAGAATCTAGTCTCTTCAGAGGCTTCATTCTAACTTTTGTTCTTTCTGTTCATTCTCTATGTTTCTAATTAAGA
TGATGAACAGATTGGGTGTTCTTGTTGCTGTTTCAGCTGCAGCTTATGCAATTAGGCAGCTACCAATCAGATCATGGAGCTCATTAATCTTGCCTACCAACTGTTCAGAA
AATGGGGAAGACACCGAGAACAACGTAAGGCGACACGGAAGAATTGTCCGTGGCTTGGCTGAAGAAGAAGAGGAAGCTAATTCTGTAAATGAATCAACTAGTCAAGTTAC
TGGTAGAACTTCTGATCATAATGCTGAGATGACTTTGAACTATGTTGCCCCACGAATACCTAGTCAGAGCACCTTAATCTTGCCTACCAATTGTTCAGGTGGAGAAAATA
GCTCAGCAGAACTTAGAGAACTTCTAACACAGGAGTCCGAAAATCCATTGCTTGATGACAACAAGATTGAAAACAGTAAGATTGAATTGGACAGGTTGCTGAAACTAGTA
ATGGAATTAGAGTTGCGGAAAGTGAAACTTGAAGGTGAATTGCTCATGTGCGACGAAATAAAGCATGACAAAACCAATGTCGCGGAGTTACTAAAGCAGCTCGAGGCCAA
GAATGACGATATCAATACGCTTAATATCACAATCAAGTCTTTGCAGGCTGAGAGAAAGAAGCTCCAAGAAGAGATACTGAAGGGAGCATCAATGAAAACTGAACTAGAGA
AGGCTAGGGACAAGATTAAGGAGCTGCAAAAGCAGATTCAGCTTGATGCAAACAAAACAAAAGAACACTTGCTATTACTCAAACAAAGAGTTTCCTCTTTGCAGGCAAAA
GAGGAAGAGGGTGTCAAGAAAGAAGCAGAACTTTATAAGAGGTTGAAAGCAGCAAAGGATTTCGAGATAGAACTCGAGACGCTCAAGCGGAAGAATCGAGAGCTTCGACA
TGAAAGGAGGGAATTAACTTCTAAGCTTGATGTTATGAAGGCCCAAGTCACAACATTGAGAGAGAGTGAAATAATCACTAAGGAAACAGAGGAGGCTCAAAATTTAAGAA
GTGAAAATGAGAATCTGATCAAGCAACTTGAAGAGCTTCAAATGAATAGGTTTAGTGAAGTTGAAAAATTAGTGTACCTACGGTGGATCAATGCTTGCTTGAGGTACGAG
CTTCGAGACAACGAGACATCGATAGGCGAATCCGCTCGTTTTCTCAACAAGAGTTTAAGTCCAAAGTCCAAAGAGAAGGCCAAACAGCTCATGCTAGAGTATGCAGAATT
GGACTTTGGACAAGCAGAAACAGATCATGAAAGCAACTTTTCTCATCCAATTTCTTCTGGGATTGAAGATTTTGACAAAACTTCAATTGATAGTTCAATGAGCAGAACCA
GCAGTTTCAGTGAGAAGCCTAATTCAAATCTGAGTTTGAAGAAATTTATCAGAAACAAAGGTGGTTCGAGTGGTGTTTCGTCTCAGCGCATAACTGGTTCGAGCTACAGA
GGGGAGAATCCTTTAGAAGAGGTGATGGCTGTGAGTGCTGCTGAAACTTCAACTCTTTCAGAAGTGAAAGCTGATATGGAGACTCACATAGACATGATTATGCCACTAGC
AAAGGAAGTTAGAGAAGCTTCTTTCTCCAACATGGACGATGTCGTCTCCTTTGTAATACGCCTCGATGAGAAATTATCTTCGTTGGTTGATGAGAAGGAGATTCTCGAGC
ACGTCGATTGGCCCGATGTAAAGGCATATGCATTGAAAGAAGCAGCATTTGAGTACCAGAACCTGATGAAACTAGGAGAAGAAGTCTCTTCTTTTGTGGACAGTCCCAAA
CTCACATGTGAAGTTGCTTTGAAAAAAATGCACTCCTTGCTAGATAAGGTGGAGCAGAGTGTTCATGAAGTCGTAAGTACAAGGGACGCAACAATTTCGCGATACGAGGA
ACTCGGAATTCCCATTGATTGGCTGTTGGATTCTGGAGTTGTTGCCAAGATGAAGGTTATGTGTACAATTGGCAAGGAAGTTCATGAAACGCATAGTAAATGAACATAAT
GCATTGAGTGGACAAGAAAAACAGCCAAAGAGAGAATTTTTGCTTTTGCAAGGAGTGCGTTTTGCTTCTCGTGTTCATAAGGTATGCGAATTCAAATATGTAATCTCTTG
ATTATTAAATGGTACATAATGCCACTAATATAGAAACAAAGTTTTAAAACCTTGCAACAATACAATACTGGTGACAACATTTCATTGAAGAATGTTGTGCCGCTAATTTT
GGATTCCCACATCCGGAATTTGAAATAATTTCACCCCCACTCACTATATAAACACATATATAGTTTATATTTTAATGCACACAAAGAATTAAGAATTATATAATTCTTAT
TTTAAATTCCTTTTATATTAATTTCTAATTGCATT
Protein sequenceShow/hide protein sequence
MFLIKMMNRLGVLVAVSAAAYAIRQLPIRSWSSLILPTNCSENGEDTENNVRRHGRIVRGLAEEEEEANSVNESTSQVTGRTSDHNAEMTLNYVAPRIPSQSTLILPTNC
SGGENSSAELRELLTQESENPLLDDNKIENSKIELDRLLKLVMELELRKVKLEGELLMCDEIKHDKTNVAELLKQLEAKNDDINTLNITIKSLQAERKKLQEEILKGASM
KTELEKARDKIKELQKQIQLDANKTKEHLLLLKQRVSSLQAKEEEGVKKEAELYKRLKAAKDFEIELETLKRKNRELRHERRELTSKLDVMKAQVTTLRESEIITKETEE
AQNLRSENENLIKQLEELQMNRFSEVEKLVYLRWINACLRYELRDNETSIGESARFLNKSLSPKSKEKAKQLMLEYAELDFGQAETDHESNFSHPISSGIEDFDKTSIDS
SMSRTSSFSEKPNSNLSLKKFIRNKGGSSGVSSQRITGSSYRGENPLEEVMAVSAAETSTLSEVKADMETHIDMIMPLAKEVREASFSNMDDVVSFVIRLDEKLSSLVDE
KEILEHVDWPDVKAYALKEAAFEYQNLMKLGEEVSSFVDSPKLTCEVALKKMHSLLDKVEQSVHEVVSTRDATISRYEELGIPIDWLLDSGVVAKMKVMCTIGKEVHETH
SK