; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011440 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011440
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTranslation initiation factor IF-2 like
Genome locationtig00152909:347700..352583
RNA-Seq ExpressionSgr011440
SyntenySgr011440
Gene Ontology termsNA
InterPro domainsIPR025486 - Domain of unknown function DUF4378
IPR032795 - DUF3741-associated sequence motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3433067.1 hypothetical protein FNV43_RR24169 [Rhamnella rubrinervis]2.7e-5733.47Show/hide
Query:  MGKEWHFGGRSSRRV-TTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPK-ASPRGVEAPRNSLELDETSISWL--RKEE
        MGKEW++GGRSS+R   TDHR    S  GCMSA+F  F FH   F      N HH   SFNP+   P     SP+GVEAPRNSL+ + +S+S +   +EE
Subjt:  MGKEWHFGGRSSRRV-TTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPK-ASPRGVEAPRNSLELDETSISWL--RKEE

Query:  NLHIPMGIQIKTSRKV-SKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTSSS-SYHHLRPNP---------------------------
        +L+IPMGIQIKT+    S K  A+   + +     SS SP AKTPNL+ARLMGL++LP+ S S S+H   PNP                           
Subjt:  NLHIPMGIQIKTSRKV-SKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTSSS-SYHHLRPNP---------------------------

Query:  ----GTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDKENPA-------------------------STGEWIEQEKERRRKKV---ALVDITN--NNR
            GT SLPE+PR+SSAR+ DV+  +H RLSLQI  +KEN +                         S G +  Q  ++ ++ V   A  DITN   NR
Subjt:  ----GTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDKENPA-------------------------STGEWIEQEKERRRKKV---ALVDITN--NNR

Query:  KLEFGEQEVGFSRSKAEKSSKRFKKMVAD------------------ESSSKLRNQEVIILKK----------------------------------QKP
             + ++     K++K+SK F K+V D                  ES S  +      L                                     KP
Subjt:  KLEFGEQEVGFSRSKAEKSSKRFKKMVAD------------------ESSSKLRNQEVIILKK----------------------------------QKP

Query:  KQMQMQMQMPMPMPMQKTMPREDGK--------KEAFGCP--------------------------------TNNLPNNLNRPT-VIPVKKDPPPPAAKI
            +Q Q  +    QK     + K        +E FG P                                TNNL N  N PT +IPVKK P PPA KI
Subjt:  KQMQMQMQMPMPMPMQKTMPREDGK--------KEAFGCP--------------------------------TNNLPNNLNRPT-VIPVKKDPPPPAAKI

Query:  PHEQ-PCSYPKGKQ----------------KQPA---------GDGGE-RNAVASTATTD--GGAAEFNYIKTIITSNGISMETPV--------SNPLDP
        PH+Q     P+ +Q                +Q A           GG+  N   ST TT+  GGA  ++YI  I+   GI+ +TPV        S+PLDP
Subjt:  PHEQ-PCSYPKGKQ----------------KQPA---------GDGGE-RNAVASTATTD--GGAAEFNYIKTIITSNGISMETPV--------SNPLDP

Query:  SLFHNLEAAAGAAEE-----PMRHRCNRKLLFDLVDEVVGEILRSERKKK--------------SSKMEGWELAEKVREKVEEFPRANCQVLEDIDALID
        S+F++LE  + +         +  +CNR LLF LVDE++ +IL+     K               S MEG +L + +  +++ FP A+CQVLEDIDALID
Subjt:  SLFHNLEAAAGAAEE-----PMRHRCNRKLLFDLVDEVVGEILRSERKKK--------------SSKMEGWELAEKVREKVEEFPRANCQVLEDIDALID

Query:  KDLPKSKLSLSTM--AEADGIVMAVELHVLDSLLRETA
        KDLP+ K+  +T    E +G+V  VE  ++D+L+ ETA
Subjt:  KDLPKSKLSLSTM--AEADGIVMAVELHVLDSLLRETA

KAG7035594.1 hypothetical protein SDJN02_02391, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-6840.98Show/hide
Query:  MGKEWHFGGRSS-RRVTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKEENLH
        M ++W FGG SS RR   D  R+ PSLP CM+ LFH F  HS  F +  L +  H P S +             GV APRNSLE        L +E+N  
Subjt:  MGKEWHFGGRSS-RRVTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKEENLH

Query:  IPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTSSSSYHHLRPNPGTHSLPESPRVSSARQLDVDRRHHRRLSLQIS
        I MG++I T                N DH A  +SPS KTPNLLARLMGLDILPQT++S          T SLP SPRVSS+R  DVDR HHR  SL I+
Subjt:  IPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTSSSSYHHLRPNPGTHSLPESPRVSSARQLDVDRRHHRRLSLQIS

Query:  LDKENPASTGEWIEQEKERRRKKVALVDITNNNRKLEFGEQEVGFSRSKAEKSSKRFKKMVADESSSKLRNQEVIILKKQKPKQMQMQMQMPMPMPMQKT
        LD EN     E ++QE+E+ R+KVALVDITNNN KL +G                            KL+NQ+V + +K         +    P P  K 
Subjt:  LDKENPASTGEWIEQEKERRRKKVALVDITNNNRKLEFGEQEVGFSRSKAEKSSKRFKKMVADESSSKLRNQEVIILKKQKPKQMQMQMQMPMPMPMQKT

Query:  MPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQP-----CSYPKGKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISME
         PR   ++E                     +++ PPPAAK+ HEQ      C +P GKQ+  A + G R      AT DGGA E  YIK I+TS      
Subjt:  MPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQP-----CSYPKGKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISME

Query:  TPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSSKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSL
        T   NPL+PS+FH+LE ++ A  EP   R N+    D  DEV+GE++ + R  +   M+GWELA           RA C VLEDID+LIDKDL K K  L
Subjt:  TPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSSKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSL

Query:  STMAEADGIVMAVELHVLDSLLRETAAAIASI
            E +G+V   + H+LDSLLRET A I S+
Subjt:  STMAEADGIVMAVELHVLDSLLRETAAAIASI

XP_011656164.1 uncharacterized protein LOC105435648 [Cucumis sativus]6.8e-7745.65Show/hide
Query:  MGKE--WHFGGR--SSRRVTT---DH--RRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPF---PKASPRGVEAPRNSLELDE-T
        MGK   W+FGGR  SSRRVTT   DH  RR+H SLP CMS LFHLF F SSHFT   +F+ +H   SF+ SHH P     KAS  GVEAPRNSLELD   
Subjt:  MGKE--WHFGGR--SSRRVTT---DH--RRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPF---PKASPRGVEAPRNSLELDE-T

Query:  SISWLR-KEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTS-SSSYHHLRPNPGTHSLPESPRVSSARQLD
        SIS LR KEENL + MG+QIKT R  S K KA+EQQLPN D+I + ESPS  TPNLLARLMGLD  PQT+ SSSY+H  PN GT SL ESPR S +R  D
Subjt:  SISWLR-KEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTS-SSSYHHLRPNPGTHSLPESPRVSSARQLD

Query:  VDRRHHRRLSLQISLDKE--NPASTGEWIEQEKERR--RKKVALVDITNNNRKLEFGEQEVGFSRSK--AEKSSKRFKKMVADESSSK---LRNQEVIIL
        VD  HHRRLSLQI++ ++  N     E I + ++++  R KVAL+DITN+  K+    QE+G S+S+    KS K+ KK   ++SSS     R+ +  ++
Subjt:  VDRRHHRRLSLQISLDKE--NPASTGEWIEQEKERR--RKKVALVDITNNNRKLEFGEQEVGFSRSK--AEKSSKRFKKMVADESSSK---LRNQEVIIL

Query:  KKQKPKQMQMQMQMPMPMPMQKTMPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQPCSYPKGKQKQPAGDGGERNAVASTATTDGGA
           K K + M MQ+P          R   + EA  CP +N  + L+  T+                 QPCSYPKGK K     GGE NAV +  TTDGG+
Subjt:  KKQKPKQMQMQMQMPMPMPMQKTMPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQPCSYPKGKQKQPAGDGGERNAVASTATTDGGA

Query:  AEFNYIKTIITS---NGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSSKMEGWELAEKVREKVEEFPRANC
        AEF YIKTI  S   N   +  P S       +H++     A EE    R  +++  +L   VVG         +    +GW   ++ R++  EFP    
Subjt:  AEFNYIKTIITS---NGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSSKMEGWELAEKVREKVEEFPRANC

Query:  QVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIAS
        +++E   ALI+KDL KSK  +    E +GIV  VELH+LDSLLRE   ++ S
Subjt:  QVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIAS

XP_015878027.1 uncharacterized protein LOC107414431 [Ziziphus jujuba]3.2e-5833.2Show/hide
Query:  MGKEWHFGGRSSRR--VTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKE-EN
        MGKEWH+GGRSS+R  V         +  GCMSA+F  F FH      QF F  HH  PSFNP+  P     SP+GVEAPRNSL+ D++S+S + +E EN
Subjt:  MGKEWHFGGRSSRR--VTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKE-EN

Query:  LHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTS-SSSYHHLRPNP-----------------------------
        L+IPMGIQIKT      KP+AS     +     +S SP AKTPNL+ARLMGLD+LP  S S ++H   PNP                             
Subjt:  LHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTS-SSSYHHLRPNP-----------------------------

Query:  GTHSLPESPRVSSARQLDVDRRHHRRLSLQISLD-----------------------KENPASTGEWIEQEKERRRKKV---ALVDITNNNRKLEFGEQE
        G+ SLPE+PR+SSAR+ DVD   H RLSLQI+ +                       ++   S G +  Q  ++ ++ V   A +DITN  R  +    E
Subjt:  GTHSLPESPRVSSARQLDVDRRHHRRLSLQISLD-----------------------KENPASTGEWIEQEKERRRKKV---ALVDITNNNRKLEFGEQE

Query:  VGFSRSKAEKSSKRFKKMVADESS-----------------SKLRN----------------------------------QEVIILKKQKPKQMQMQMQM
        +  +  +++K+SK F K +ADE+S                 SK  N                                  Q V +  K KP+ +Q +  +
Subjt:  VGFSRSKAEKSSKRFKKMVADESS-----------------SKLRN----------------------------------QEVIILKKQKPKQMQMQMQM

Query:  PMPMPMQ---KTMPRE------DGKKEAFGCP------------------TNNLPNNLNRPT-VIPVKKDPPPPAAKIPHEQPCSY--------------
           + +Q   KT  RE        ++E+F  P                  +NNL N  + PT ++P+KK P PPA KI  +Q                  
Subjt:  PMPMPMQ---KTMPRE------DGKKEAFGCP------------------TNNLPNNLNRPT-VIPVKKDPPPPAAKIPHEQPCSY--------------

Query:  -PKGKQKQPAGDGGERNAV-----------ASTATTDGGAAEFN-----YIKTIITSNGISMETPV--------SNPLDPSLFHNLE--------AAAGA
         P  K +Q A  G E               +ST +T+GG AE +     YI  I++  GI  ETPV        S+PLDPS+F  LE        + +  
Subjt:  -PKGKQKQPAGDGGERNAV-----------ASTATTDGGAAEFN-----YIKTIITSNGISMETPV--------SNPLDPSLFHNLE--------AAAGA

Query:  AEEPMRHRCNRKLLFDLVDEVVGEILR----------------SERKKKSSKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLP--KSKLSLSTM
            +  +CNR+LLF +VDE++ +IL+                + R  + + M+G EL E +  +++ FP A+CQVLEDID LIDKDLP  K++ +++  
Subjt:  AEEPMRHRCNRKLLFDLVDEVVGEILR----------------SERKKKSSKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLP--KSKLSLSTM

Query:  AEADGIVMAVELHVLDSLLRETA
         E +G+V  +E H+LD+L+ ETA
Subjt:  AEADGIVMAVELHVLDSLLRETA

XP_022958521.1 uncharacterized protein LOC111459727 [Cucurbita moschata]3.7e-5941.39Show/hide
Query:  NKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLD
        ++H H       HHP  P      V APRNSLE        L +E+N  I MG++I T                N DH A  +SPS KTPNLLARLMGLD
Subjt:  NKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLD

Query:  ILPQTSSSSYHHLRPNPGTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDKENPASTGEWIEQEKERRRKKVALVDITNNNRKLEFGEQEVGFSRSKAE
        ILPQT++S          T SLP SPRVSS R  DVDR HHR  SL I+LD EN     E ++QE+E+ R+KVALVDITNNN KL +G            
Subjt:  ILPQTSSSSYHHLRPNPGTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDKENPASTGEWIEQEKERRRKKVALVDITNNNRKLEFGEQEVGFSRSKAE

Query:  KSSKRFKKMVADESSSKLRNQEVIILKKQKPKQMQMQMQMPMPMPMQKTMPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQPCSYPK
                        KL+NQ+V + +K       +  Q P P P +K  PR   ++E                     +++ PPPAAK+ HEQ C +P 
Subjt:  KSSKRFKKMVADESSSKLRNQEVIILKKQKPKQMQMQMQMPMPMPMQKTMPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQPCSYPK

Query:  GKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSS
        GKQ+  A + G R      AT DGGA E  YIK I+TS      T   NPL+PS+FH+LE +  A  EP   R N+    D  DEV+GE++ + R  +  
Subjt:  GKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSS

Query:  KMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIASI
         M+GWELA           RA C VL+DID+LIDKDL K K  L    E +G+V   E H+LDSLLRET A I S+
Subjt:  KMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIASI

TrEMBL top hitse value%identityAlignment
A0A0A0KNC2 VARLMGL domain-containing protein3.3e-7745.65Show/hide
Query:  MGKE--WHFGGR--SSRRVTT---DH--RRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPF---PKASPRGVEAPRNSLELDE-T
        MGK   W+FGGR  SSRRVTT   DH  RR+H SLP CMS LFHLF F SSHFT   +F+ +H   SF+ SHH P     KAS  GVEAPRNSLELD   
Subjt:  MGKE--WHFGGR--SSRRVTT---DH--RRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPF---PKASPRGVEAPRNSLELDE-T

Query:  SISWLR-KEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTS-SSSYHHLRPNPGTHSLPESPRVSSARQLD
        SIS LR KEENL + MG+QIKT R  S K KA+EQQLPN D+I + ESPS  TPNLLARLMGLD  PQT+ SSSY+H  PN GT SL ESPR S +R  D
Subjt:  SISWLR-KEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTS-SSSYHHLRPNPGTHSLPESPRVSSARQLD

Query:  VDRRHHRRLSLQISLDKE--NPASTGEWIEQEKERR--RKKVALVDITNNNRKLEFGEQEVGFSRSK--AEKSSKRFKKMVADESSSK---LRNQEVIIL
        VD  HHRRLSLQI++ ++  N     E I + ++++  R KVAL+DITN+  K+    QE+G S+S+    KS K+ KK   ++SSS     R+ +  ++
Subjt:  VDRRHHRRLSLQISLDKE--NPASTGEWIEQEKERR--RKKVALVDITNNNRKLEFGEQEVGFSRSK--AEKSSKRFKKMVADESSSK---LRNQEVIIL

Query:  KKQKPKQMQMQMQMPMPMPMQKTMPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQPCSYPKGKQKQPAGDGGERNAVASTATTDGGA
           K K + M MQ+P          R   + EA  CP +N  + L+  T+                 QPCSYPKGK K     GGE NAV +  TTDGG+
Subjt:  KKQKPKQMQMQMQMPMPMPMQKTMPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQPCSYPKGKQKQPAGDGGERNAVASTATTDGGA

Query:  AEFNYIKTIITS---NGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSSKMEGWELAEKVREKVEEFPRANC
        AEF YIKTI  S   N   +  P S       +H++     A EE    R  +++  +L   VVG         +    +GW   ++ R++  EFP    
Subjt:  AEFNYIKTIITS---NGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSSKMEGWELAEKVREKVEEFPRANC

Query:  QVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIAS
        +++E   ALI+KDL KSK  +    E +GIV  VELH+LDSLLRE   ++ S
Subjt:  QVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIAS

A0A2I4FG38 uncharacterized protein LOC1089985039.0e-5130.56Show/hide
Query:  MGKEWHFGGRSSRRVTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWL------RK
        MG+EW++GGRSS++      ++  S  GCM+A+F  F FH      QF F  HH  PSF P+   P    +P+G+EAPRNSL+ +E S+S +       +
Subjt:  MGKEWHFGGRSSRRVTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWL------RK

Query:  EENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTSS----------------SSYHHLRPN------------
        EE+ +IP+GIQIKTS     K  A     PN      S SP  KTPNL+ARLMGLD+LP+T S                S  H LRP             
Subjt:  EENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTSS----------------SSYHHLRPN------------

Query:  ----PGTHSLPESPRVSSARQLDVDRRHHRRLSLQIS------------------------LDKENPASTGEW----IEQEKERRRKKVALVDITNNNRK
             GT SLPE+PR+SSAR+ +VD     RLSLQI+                        +++EN  S  E+    ++Q KE+  +KV + DITN    
Subjt:  ----PGTHSLPESPRVSSARQLDVDRRHHRRLSLQIS------------------------LDKENPASTGEW----IEQEKERRRKKVALVDITNNNRK

Query:  LEFGEQEVGFSRSKAEKSSKRFKKMVADESSS------------------KLRNQEVIILKKQK-------------------PKQMQMQMQMPMPMPMQ
         + G      ++ K+ K SK F K+V D+ SS                  K+++    +  K K                   P+ +++ +  P P P+Q
Subjt:  LEFGEQEVGFSRSKAEKSSKRFKKMVADESSS------------------KLRNQEVIILKKQK-------------------PKQMQMQMQMPMPMPMQ

Query:  KTMPREDGKKEAFGC-------------------------------------------------PTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQ----
        K   ++ G K    C                                                 P +N   N+  P ++ VKK+P P A KIP +Q    
Subjt:  KTMPREDGKKEAFGC-------------------------------------------------PTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQ----

Query:  -----PCS-----------YPKGKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISMETPV---------SNPLDPSLFHNLE---------
              CS            P+   K  A +  + +     +TT    AE+ Y+  I+   GI  ET +         S+PLDP++ H LE         
Subjt:  -----PCS-----------YPKGKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISMETPV---------SNPLDPSLFHNLE---------

Query:  ---------AAAGAAEEPMRHRCNRK-LLFDLVDEVVGEILRSE----RKKKSSK-------MEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPK
                  + G     +   CN+K LLF LVDE++ EIL+      +   SSK       M+G +L EK+  K+  FP A+C+VL+DIDALIDKD+P+
Subjt:  ---------AAAGAAEEPMRHRCNRK-LLFDLVDEVVGEILRSE----RKKKSSK-------MEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPK

Query:  SKLSLSTMAEADG--IVMAVELHVLDSLLRETA
         KL  +   E +G  IV  +E  +LD+L+ ETA
Subjt:  SKLSLSTMAEADG--IVMAVELHVLDSLLRETA

A0A2N9FV46 Uncharacterized protein2.5e-5332.57Show/hide
Query:  MGKEWHFG--GRSSRRVTTD-HRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWL-----
        MG+EW +G  GRSS   +         S  GC+S +FH F FH  HF            PSFN +   P     P+GVEAPRNSLE  E S+S++     
Subjt:  MGKEWHFG--GRSSRRVTTD-HRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWL-----

Query:  RKEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQT---SSSSYHHLRPNP----------------------
         +EENL I MGIQIKTS         S+    N      S SP+ KTP L+ARLMGLD LP+T   SS S  H  P P                      
Subjt:  RKEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQT---SSSSYHHLRPNP----------------------

Query:  -------GTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDK-------------------------ENPASTGEW----IEQEKERRRKKVALVDITN-
               GT SLPE+PR+SSAR+ DVD  HH RLSLQI+ +                          EN  S  ++    + Q KE+  +KV L DITN 
Subjt:  -------GTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDK-------------------------ENPASTGEW----IEQEKERRRKKVALVDITN-

Query:  -NNRKLEFGEQEVGFSRSKAEKSSKRFKKMVADESSSK-----------------------------------LRNQEVIILK------KQKPKQMQMQM
         NNR  +F          K++KSSK F K+V + S SK                                   L + + + L+      K KP+ +Q Q 
Subjt:  -NNRKLEFGEQEVGFSRSKAEKSSKRFKKMVADESSSK-----------------------------------LRNQEVIILK------KQKPKQMQMQM

Query:  Q---------------MPMPMPMQKTMPREDGKKEAFGCPTN-----NLPN---------------NLNRPTVIPVKKDPPPPAAKIPHEQ---------
        Q                 +  P Q +    + ++E F  P+      N P+               N+N PT +PVKKDP PPA KIP +Q         
Subjt:  Q---------------MPMPMPMQKTMPREDGKKEAFGCPTN-----NLPN---------------NLNRPTVIPVKKDPPPPAAKIPHEQ---------

Query:  -----PCSYPKGKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISMETPVS--------NPLDPSLFHNLEAA--AGAAEEPMRHRCNRKLL
              CS  + KQ+       + +      TT     E+ Y+  +++  GI  +T VS        +PLDPS+FH LE +         + + CNRKLL
Subjt:  -----PCSYPKGKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISMETPVS--------NPLDPSLFHNLEAA--AGAAEEPMRHRCNRKLL

Query:  FDLVDEVVGEILRSERKKKS----------SKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSLSTM--AEADGIVMAVELHVLDSLLR
        F LVDE++ EIL+     K             M+G  L + +  K+  FPRANC+VLED+D L+DKDL   K+  + +   E +GIVM +E  +LD+L+ 
Subjt:  FDLVDEVVGEILRSERKKKS----------SKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSLSTM--AEADGIVMAVELHVLDSLLR

Query:  ETA
        E A
Subjt:  ETA

A0A6J1H2A5 uncharacterized protein LOC1114597271.8e-5941.39Show/hide
Query:  NKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLD
        ++H H       HHP  P      V APRNSLE        L +E+N  I MG++I T                N DH A  +SPS KTPNLLARLMGLD
Subjt:  NKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLD

Query:  ILPQTSSSSYHHLRPNPGTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDKENPASTGEWIEQEKERRRKKVALVDITNNNRKLEFGEQEVGFSRSKAE
        ILPQT++S          T SLP SPRVSS R  DVDR HHR  SL I+LD EN     E ++QE+E+ R+KVALVDITNNN KL +G            
Subjt:  ILPQTSSSSYHHLRPNPGTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDKENPASTGEWIEQEKERRRKKVALVDITNNNRKLEFGEQEVGFSRSKAE

Query:  KSSKRFKKMVADESSSKLRNQEVIILKKQKPKQMQMQMQMPMPMPMQKTMPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQPCSYPK
                        KL+NQ+V + +K       +  Q P P P +K  PR   ++E                     +++ PPPAAK+ HEQ C +P 
Subjt:  KSSKRFKKMVADESSSKLRNQEVIILKKQKPKQMQMQMQMPMPMPMQKTMPREDGKKEAFGCPTNNLPNNLNRPTVIPVKKDPPPPAAKIPHEQPCSYPK

Query:  GKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSS
        GKQ+  A + G R      AT DGGA E  YIK I+TS      T   NPL+PS+FH+LE +  A  EP   R N+    D  DEV+GE++ + R  +  
Subjt:  GKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEILRSERKKKSS

Query:  KMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIASI
         M+GWELA           RA C VL+DID+LIDKDL K K  L    E +G+V   E H+LDSLLRET A I S+
Subjt:  KMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIASI

A0A6P3ZRI2 uncharacterized protein LOC1074144311.5e-5833.2Show/hide
Query:  MGKEWHFGGRSSRR--VTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKE-EN
        MGKEWH+GGRSS+R  V         +  GCMSA+F  F FH      QF F  HH  PSFNP+  P     SP+GVEAPRNSL+ D++S+S + +E EN
Subjt:  MGKEWHFGGRSSRR--VTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKE-EN

Query:  LHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTS-SSSYHHLRPNP-----------------------------
        L+IPMGIQIKT      KP+AS     +     +S SP AKTPNL+ARLMGLD+LP  S S ++H   PNP                             
Subjt:  LHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTS-SSSYHHLRPNP-----------------------------

Query:  GTHSLPESPRVSSARQLDVDRRHHRRLSLQISLD-----------------------KENPASTGEWIEQEKERRRKKV---ALVDITNNNRKLEFGEQE
        G+ SLPE+PR+SSAR+ DVD   H RLSLQI+ +                       ++   S G +  Q  ++ ++ V   A +DITN  R  +    E
Subjt:  GTHSLPESPRVSSARQLDVDRRHHRRLSLQISLD-----------------------KENPASTGEWIEQEKERRRKKV---ALVDITNNNRKLEFGEQE

Query:  VGFSRSKAEKSSKRFKKMVADESS-----------------SKLRN----------------------------------QEVIILKKQKPKQMQMQMQM
        +  +  +++K+SK F K +ADE+S                 SK  N                                  Q V +  K KP+ +Q +  +
Subjt:  VGFSRSKAEKSSKRFKKMVADESS-----------------SKLRN----------------------------------QEVIILKKQKPKQMQMQMQM

Query:  PMPMPMQ---KTMPRE------DGKKEAFGCP------------------TNNLPNNLNRPT-VIPVKKDPPPPAAKIPHEQPCSY--------------
           + +Q   KT  RE        ++E+F  P                  +NNL N  + PT ++P+KK P PPA KI  +Q                  
Subjt:  PMPMPMQ---KTMPRE------DGKKEAFGCP------------------TNNLPNNLNRPT-VIPVKKDPPPPAAKIPHEQPCSY--------------

Query:  -PKGKQKQPAGDGGERNAV-----------ASTATTDGGAAEFN-----YIKTIITSNGISMETPV--------SNPLDPSLFHNLE--------AAAGA
         P  K +Q A  G E               +ST +T+GG AE +     YI  I++  GI  ETPV        S+PLDPS+F  LE        + +  
Subjt:  -PKGKQKQPAGDGGERNAV-----------ASTATTDGGAAEFN-----YIKTIITSNGISMETPV--------SNPLDPSLFHNLE--------AAAGA

Query:  AEEPMRHRCNRKLLFDLVDEVVGEILR----------------SERKKKSSKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLP--KSKLSLSTM
            +  +CNR+LLF +VDE++ +IL+                + R  + + M+G EL E +  +++ FP A+CQVLEDID LIDKDLP  K++ +++  
Subjt:  AEEPMRHRCNRKLLFDLVDEVVGEILR----------------SERKKKSSKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLP--KSKLSLSTM

Query:  AEADGIVMAVELHVLDSLLRETA
         E +G+V  +E H+LD+L+ ETA
Subjt:  AEADGIVMAVELHVLDSLLRETA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G25430.1 unknown protein2.2e-1728.62Show/hide
Query:  MGKEWHFGGRSSRRVTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDE---TSISWLRKEEN
        MG+EW+ GGRS+   ++  ++N     GC++AL+H FHFH  +F ++   + HHH PS +       P  + +G+ APRNSL+L E    S ++  + E 
Subjt:  MGKEWHFGGRSSRRVTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDE---TSISWLRKEEN

Query:  LHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQT--------SSSSYHHLRPN-PGTHSLPESPRVSSARQ-----
        L+I +G +  T R +     +    LP             KTPN++ARLMGLD+LP          +    H L  N  GT SLP SPR+SS  +     
Subjt:  LHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQT--------SSSSYHHLRPN-PGTHSLPESPRVSSARQ-----

Query:  LDVDRRHHRRLSL------QISLDKENPA---STGEWIEQEKERRRKKVALVDITNNNRKLEFGEQEVGFSRSKAEKSSKRFKKMVADESSSKLRNQEVI
        L+++R +++          ++  D+++P+   S  + ++Q K+R   +   +D+TN   K   G    G ++++  +  K      A       +   VI
Subjt:  LDVDRRHHRRLSL------QISLDKENPA---STGEWIEQEKERRRKKVALVDITNNNRKLEFGEQEVGFSRSKAEKSSKRFKKMVADESSSKLRNQEVI

Query:  ILKKQKPKQMQ
         L K+  + ++
Subjt:  ILKKQKPKQMQ

AT5G51850.1 unknown protein7.6e-1827.35Show/hide
Query:  RNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKEENLHIPMGIQIKTSRKVSKKPKASE
        R+  +  GCM+A +H             LF+ HHH    +PS          +G++    SL     S ++  KE + +IP+G+++KT        K+S 
Subjt:  RNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKEENLHIPMGIQIKTSRKVSKKPKASE

Query:  QQLPNIDHIASS----ESPSAKTPNLLARLMGLDILPQTSS-----------SSYH---HLRPNPGTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDK
         +    D   SS     SP +KTPNL+ARLMGLD+LP  +            SS+H   H     GT SLP SPR+SSAR+ D D     RLSLQ++ +K
Subjt:  QQLPNIDHIASS----ESPSAKTPNLLARLMGLDILPQTSS-----------SSYH---HLRPNPGTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDK

Query:  ENPAS---------------TGEWIEQEKER-RRKKVALVDITNN--NRKL----EFGEQEVGFSRSKAEKSSKRFKKMVADESSSKLRNQEVIILKKQK
        E   S                 + ++Q KER   ++V  +DITN+  NR+     E           +   S K  K+  + + +S   ++   I++K K
Subjt:  ENPAS---------------TGEWIEQEKER-RRKKVALVDITNN--NRKL----EFGEQEVGFSRSKAEKSSKRFKKMVADESSSKLRNQEVIILKKQK

Query:  P-------KQMQMQMQMPMPMPMQKTMPRE-DGKKEAFGCPTNNLPN-----------NLNRPTVIPVKKDPPPP---------AAKIPHEQ------PC
        P       KQ Q +++     P+      E + ++     PT+++ N           ++    +  +KK    P         A + PH+Q        
Subjt:  P-------KQMQMQMQMPMPMPMQKTMPRE-DGKKEAFGCPTNNLPN-----------NLNRPTVIPVKKDPPPP---------AAKIPHEQ------PC

Query:  SYPKGKQKQPAGDGGERNA--VASTATTDGGAAEFN----YIKTIITSNGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEI
        S      +  +    E+N+  VA     D  A E N    YI  I+   GI  ++     LD S+F  LE         +   CNR+LLFDLV+E++ E 
Subjt:  SYPKGKQKQPAGDGGERNA--VASTATTDGGAAEFN----YIKTIITSNGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEI

Query:  LRSERKKKSSKMEGWELAEKVREKVEEFPRANCQVLEDIDALID-KDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIA
        +     K+    +G EL  ++   V  +      V E+I AL+D K L + K  L    E + I++ +E  ++D+L+RET + ++
Subjt:  LRSERKKKSSKMEGWELAEKVREKVEEFPRANCQVLEDIDALID-KDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIA

AT5G62170.1 unknown protein3.9e-3028.4Show/hide
Query:  MGKEWHF--GGR---SSRRVTTDHRRNHPSLP----------GCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLEL-
        MG++W +  GG+   SS+    D +   P  P          GCMSA+F++F F       QF  N HH                 P+GV+APRNSLE  
Subjt:  MGKEWHF--GGR---SSRRVTTDHRRNHPSLP----------GCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLEL-

Query:  -DETSISWLRKEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDIL---------PQTSSSS---------------
         +ETS S  RK+ NL+I MGI+IKT      KP+A             + SPS KTP L+ARLMGLD++         P +SSSS               
Subjt:  -DETSISWLRKEENLHIPMGIQIKTSRKVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDIL---------PQTSSSS---------------

Query:  ---YHHLRPNP---GTHSLPESPRVS-SARQLDVDRRHHRRLSL--------------------------QISLDKENPASTGEWIEQ------EKERRR
           ++ L+ N    GT SLPE+PR+S   R +DV+   H+R SL                          +I  DKEN  S  E+  Q      E   RR
Subjt:  ---YHHLRPNP---GTHSLPESPRVS-SARQLDVDRRHHRRLSL--------------------------QISLDKENPASTGEWIEQ------EKERRR

Query:  KKVALVDITN-NNRKLEFGEQEVGFSRSK------------------------------AEKSSKRFKKMVADESSSKLRNQEVIILKKQKPKQMQMQMQ
        +++   DITN   +  E  E +   S++                               A K  +     V D++     ++E    +K+K ++   + +
Subjt:  KKVALVDITN-NNRKLEFGEQEVGFSRSK------------------------------AEKSSKRFKKMVADESSSKLRNQEVIILKKQKPKQMQMQMQ

Query:  MPMPMPMQKTMPREDGKKEAF-GCPTNNLPNNLNRPTVIPV-------KKDP----------PPPAAKIPHEQP---CSYPKGKQKQPAGDGGERNAVAS
         P     +   P +  ++E F   P  N  NN N   ++ +       KK P            P  K    QP    S  K ++ Q   +    + + S
Subjt:  MPMPMPMQKTMPREDGKKEAF-GCPTNNLPNNLNRPTVIPV-------KKDP----------PPPAAKIPHEQP---CSYPKGKQKQPAGDGGERNAVAS

Query:  --------TATTDGGAAEFNYIKTIITSNGISMETPV--------SNPLDPSLFHNLEAAAGAAEEP------MRHRCNRKLLFDLVDEVVGEILRSERK
                 A   GG  E  YI   +   GI  +TP+        S+PLDPS+F+ LE  A  +  P      +  RCNRKLLF LVDE++ +IL+    
Subjt:  --------TATTDGGAAEFNYIKTIITSNGISMETPV--------SNPLDPSLFHNLEAAAGAAEEP------MRHRCNRKLLFDLVDEVVGEILRSERK

Query:  KK----------SSKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRET
         K             ++G EL +++  ++E FP A C VLEDIDAL+  D P+ + +     + +GIV  +E  + ++L+ ET
Subjt:  KK----------SSKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAGAATGGCACTTTGGAGGAAGATCTTCCCGCCGAGTTACCACCGACCACCGGCGCAACCACCCTTCTCTTCCGGGCTGTATGAGCGCCCTCTTTCATCTCTT
TCATTTCCATTCCTCTCATTTCACTGCTCAGTTTCTCTTCAACAAGCACCACCACCCACCCTCTTTCAACCCCTCGCATCATCCTCCTTTCCCCAAAGCTTCACCTCGTG
GTGTTGAAGCACCAAGAAACAGCTTGGAATTAGATGAAACTTCAATTTCTTGGTTAAGAAAGGAAGAAAATTTGCATATTCCAATGGGAATTCAAATCAAAACAAGTAGA
AAAGTTAGCAAAAAACCAAAAGCAAGTGAGCAACAACTTCCAAATATTGATCACATTGCATCATCAGAATCTCCAAGTGCCAAGACACCAAATCTCTTAGCAAGACTAAT
GGGTCTTGATATTCTCCCTCAAACCTCCTCTTCTTCCTACCACCACCTCCGACCAAACCCCGGCACACATTCCTTGCCAGAGAGCCCGAGAGTGTCGTCGGCTAGGCAAT
TGGATGTCGACCGTCGTCATCACCGCCGATTGTCGTTGCAAATCAGTTTGGATAAAGAGAATCCCGCCAGCACTGGCGAGTGGATTGAGCAGGAAAAAGAGAGAAGGAGA
AAGAAAGTTGCACTCGTTGATATCACCAATAACAACAGAAAATTGGAGTTTGGTGAACAAGAAGTTGGTTTCAGCCGAAGTAAAGCCGAGAAATCCTCTAAGAGGTTCAA
GAAAATGGTGGCCGATGAATCGAGTTCAAAACTTCGAAATCAAGAGGTAATAATTCTTAAGAAACAAAAGCCGAAGCAGATGCAGATGCAGATGCAGATGCCGATGCCGA
TGCCGATGCAGAAGACAATGCCTCGTGAAGATGGAAAAAAAGAAGCATTTGGTTGTCCAACAAATAATCTCCCCAACAACCTCAATCGCCCAACAGTTATTCCAGTAAAG
AAAGACCCTCCTCCTCCGGCGGCCAAAATCCCTCATGAACAGCCGTGTAGTTATCCAAAGGGCAAGCAAAAGCAGCCGGCCGGAGACGGAGGAGAAAGAAACGCCGTGGC
CAGCACCGCCACGACCGACGGAGGAGCAGCCGAGTTCAACTACATCAAAACAATAATAACCAGTAACGGCATTTCAATGGAGACCCCTGTATCCAACCCTTTAGACCCAT
CACTCTTCCATAACCTGGAAGCCGCCGCTGGGGCCGCGGAGGAGCCGATGCGCCACCGATGCAACAGAAAATTACTGTTCGATCTGGTGGACGAAGTGGTGGGGGAAATT
CTGAGAAGCGAGAGGAAGAAGAAGAGTTCGAAAATGGAGGGGTGGGAGTTGGCGGAGAAAGTGAGGGAGAAAGTTGAAGAGTTCCCACGTGCGAATTGTCAAGTTCTGGA
AGATATCGATGCCTTAATCGACAAAGATTTGCCAAAATCAAAGTTGAGTTTGAGTACAATGGCAGAAGCCGATGGGATTGTCATGGCCGTTGAGCTTCATGTTTTGGACT
CCCTCCTGCGTGAAACTGCTGCCGCCATTGCTTCCATCTGTCCGTACAAAGACGGTGTCGTTTTGACGTTTTCCAGGAACCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAGAATGGCACTTTGGAGGAAGATCTTCCCGCCGAGTTACCACCGACCACCGGCGCAACCACCCTTCTCTTCCGGGCTGTATGAGCGCCCTCTTTCATCTCTT
TCATTTCCATTCCTCTCATTTCACTGCTCAGTTTCTCTTCAACAAGCACCACCACCCACCCTCTTTCAACCCCTCGCATCATCCTCCTTTCCCCAAAGCTTCACCTCGTG
GTGTTGAAGCACCAAGAAACAGCTTGGAATTAGATGAAACTTCAATTTCTTGGTTAAGAAAGGAAGAAAATTTGCATATTCCAATGGGAATTCAAATCAAAACAAGTAGA
AAAGTTAGCAAAAAACCAAAAGCAAGTGAGCAACAACTTCCAAATATTGATCACATTGCATCATCAGAATCTCCAAGTGCCAAGACACCAAATCTCTTAGCAAGACTAAT
GGGTCTTGATATTCTCCCTCAAACCTCCTCTTCTTCCTACCACCACCTCCGACCAAACCCCGGCACACATTCCTTGCCAGAGAGCCCGAGAGTGTCGTCGGCTAGGCAAT
TGGATGTCGACCGTCGTCATCACCGCCGATTGTCGTTGCAAATCAGTTTGGATAAAGAGAATCCCGCCAGCACTGGCGAGTGGATTGAGCAGGAAAAAGAGAGAAGGAGA
AAGAAAGTTGCACTCGTTGATATCACCAATAACAACAGAAAATTGGAGTTTGGTGAACAAGAAGTTGGTTTCAGCCGAAGTAAAGCCGAGAAATCCTCTAAGAGGTTCAA
GAAAATGGTGGCCGATGAATCGAGTTCAAAACTTCGAAATCAAGAGGTAATAATTCTTAAGAAACAAAAGCCGAAGCAGATGCAGATGCAGATGCAGATGCCGATGCCGA
TGCCGATGCAGAAGACAATGCCTCGTGAAGATGGAAAAAAAGAAGCATTTGGTTGTCCAACAAATAATCTCCCCAACAACCTCAATCGCCCAACAGTTATTCCAGTAAAG
AAAGACCCTCCTCCTCCGGCGGCCAAAATCCCTCATGAACAGCCGTGTAGTTATCCAAAGGGCAAGCAAAAGCAGCCGGCCGGAGACGGAGGAGAAAGAAACGCCGTGGC
CAGCACCGCCACGACCGACGGAGGAGCAGCCGAGTTCAACTACATCAAAACAATAATAACCAGTAACGGCATTTCAATGGAGACCCCTGTATCCAACCCTTTAGACCCAT
CACTCTTCCATAACCTGGAAGCCGCCGCTGGGGCCGCGGAGGAGCCGATGCGCCACCGATGCAACAGAAAATTACTGTTCGATCTGGTGGACGAAGTGGTGGGGGAAATT
CTGAGAAGCGAGAGGAAGAAGAAGAGTTCGAAAATGGAGGGGTGGGAGTTGGCGGAGAAAGTGAGGGAGAAAGTTGAAGAGTTCCCACGTGCGAATTGTCAAGTTCTGGA
AGATATCGATGCCTTAATCGACAAAGATTTGCCAAAATCAAAGTTGAGTTTGAGTACAATGGCAGAAGCCGATGGGATTGTCATGGCCGTTGAGCTTCATGTTTTGGACT
CCCTCCTGCGTGAAACTGCTGCCGCCATTGCTTCCATCTGTCCGTACAAAGACGGTGTCGTTTTGACGTTTTCCAGGAACCGATAA
Protein sequenceShow/hide protein sequence
MGKEWHFGGRSSRRVTTDHRRNHPSLPGCMSALFHLFHFHSSHFTAQFLFNKHHHPPSFNPSHHPPFPKASPRGVEAPRNSLELDETSISWLRKEENLHIPMGIQIKTSR
KVSKKPKASEQQLPNIDHIASSESPSAKTPNLLARLMGLDILPQTSSSSYHHLRPNPGTHSLPESPRVSSARQLDVDRRHHRRLSLQISLDKENPASTGEWIEQEKERRR
KKVALVDITNNNRKLEFGEQEVGFSRSKAEKSSKRFKKMVADESSSKLRNQEVIILKKQKPKQMQMQMQMPMPMPMQKTMPREDGKKEAFGCPTNNLPNNLNRPTVIPVK
KDPPPPAAKIPHEQPCSYPKGKQKQPAGDGGERNAVASTATTDGGAAEFNYIKTIITSNGISMETPVSNPLDPSLFHNLEAAAGAAEEPMRHRCNRKLLFDLVDEVVGEI
LRSERKKKSSKMEGWELAEKVREKVEEFPRANCQVLEDIDALIDKDLPKSKLSLSTMAEADGIVMAVELHVLDSLLRETAAAIASICPYKDGVVLTFSRNR