; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018320 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018320
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTryptophan synthase
Genome locationChr04:3027245..3037187
RNA-Seq ExpressionHG10018320
SyntenyHG10018320
Gene Ontology termsGO:0000162 - tryptophan biosynthetic process (biological process)
GO:0004834 - tryptophan synthase activity (molecular function)
GO:0030170 - pyridoxal phosphate binding (molecular function)
GO:0052684 - L-serine hydro-lyase (adding indole, L-tryptophan-forming) activity (molecular function)
InterPro domainsIPR001926 - Pyridoxal-phosphate dependent enzyme
IPR006316 - Tryptophan synthase, beta chain-like
IPR006653 - Tryptophan synthase, beta chain, conserved site
IPR006654 - Tryptophan synthase, beta chain
IPR023026 - Tryptophan synthase beta chain/beta chain-like
IPR036052 - Tryptophan synthase beta subunit-like PLP-dependent enzyme


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4352321.1 hypothetical protein G4B88_000924 [Cannabis sativa]0.0e+0069.57Show/hide
Query:  SAPPSFPSSLINLNLFNSVGEQSPGCLSL-KRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIK
        SA   F SS   L+    V E+  G   L +R P HLRFS+  RT+A  N N K+VE+PRQWYNL+ADLP+KPPPPL+P+T EP+KPEDLS LFPDELIK
Subjt:  SAPPSFPSSLINLNLFNSVGEQSPGCLSL-KRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIK

Query:  QEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGC
        QE +++RYI+IPDEV+D+Y LWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQ +YN +EGVKNVVTETGAGQWGSSLAFA S+FGIGC
Subjt:  QEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGC

Query:  EVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGES
        EVWQVRASYDQKPYRRMMMQTWGAKVHPSPS+LT+AGR  LQ+DPSSPGSLGIAISEAVEVAA+N DTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGE+
Subjt:  EVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGES

Query:  PDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGL
        PD++IGCTGGGSNFAGLSFPF+REKL GK+NPVIRAVEP+ACPSLTKGVYAYDYGDTAG+TPLMKMHTLGHDF+PDPIHAGGLRYHGMAPLISHVY LG+
Subjt:  PDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGL

Query:  LEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE-----------------------------NK
        +EAIS+PQ ECFQGAIQFAR+EGLIPAPEPTHAIAATIREAL CRETGE+KVILTAMCGHGHFDLPAY+                             +K
Subjt:  LEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE-----------------------------NK

Query:  GRIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSERE
        GR+PP H RR LPG G+ HPE FG G+ PPPG FP FDM+PPPEVME KLA QH+E+Q+L TENQRLA+THGTLRQELA AQHELQ+LH QIGAVK+ERE
Subjt:  GRIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSERE

Query:  QQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESL
        Q+ R + +KI+KMEAEL+AAEP+K ELQ+A+++AQ+L+ ARQELI++ Q L+QDLQRAH DVQQ+P LM+EL+SLRQEYQHCRATYD EKKLYNDHLESL
Subjt:  QQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESL

Query:  QVMEKNYITMARELEKLRAEL-TNTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAASSAGATAYTGPQTG
        Q+ME NY+TMARE+EKL+ EL  N    +RR G PYG T   N+ EASG+  GQN YEDGYG +QGR P PA          AG ++ AG + Y G Q+ 
Subjt:  QVMEKNYITMARELEKLRAEL-TNTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAASSAGATAYTGPQTG

Query:  STATRPNFDATRGP----------------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQARG---VAGHAAPG
          + RP +DA+RGP                      QRG  YDG RGS YD+ RPGYD QRGP YNV  G P YD  +   GYD Q+RG     GH AP 
Subjt:  STATRPNFDATRGP----------------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQARG---VAGHAAPG

Query:  NTAPYRSSTPPGRGGGYEAPSRGGGNPGRR
        N APY SSTPP         +  GGNP RR
Subjt:  NTAPYRSSTPPGRGGGYEAPSRGGGNPGRR

KAF4353256.1 hypothetical protein G4B88_028447, partial [Cannabis sativa]0.0e+0070.69Show/hide
Query:  VGEQSPGCLSLK-RIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI
        V E+  G   LK R P HLRFS+  RT+A  N N K+VE+PRQWYNL+ADLP+KPPPPL+P+T E +KPEDLS LFPDELIKQE +++RYI+IPDE++D+
Subjt:  VGEQSPGCLSLK-RIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI

Query:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM
        Y LWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQ +YN +EGVKNVVTETGAGQWGSSLAFA S+FGIGCEVWQVRASYDQKPYRRMM
Subjt:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM

Query:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS
        MQTWGAKVHPSPS+LT+AGR  LQ+DPSSPGSLGIAISEAVEVAA+N DTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGE+PD++IGCTGGGSNFAGLS
Subjt:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS

Query:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF
        FPF+REKL GK+NPVIRAVEP+ACPSLTKGVYAYDYGDTAG+TPLMKMHTLGHDF+PDPIHAGGLRYHGMAPLISHVY LG++EAIS+PQ ECFQGAIQF
Subjt:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF

Query:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN------------------------KGRIPPPHARRTLPGSGVLHPEAF
        AR+EGLIPAPEPTHAIAATIREAL CRETGE+KVILTAMCGHGHFDLPAY+                         KGR+PP H RR LPG G+ HPE F
Subjt:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN------------------------KGRIPPPHARRTLPGSGVLHPEAF

Query:  GHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPI
        G G+ PPPG FP FDM+PPPEVME KLA QH+E+Q+L TENQRLA+THGTLRQELA AQHELQ+LH QIGAVK+EREQ+ RN+ +KI+KMEAEL+AAEP+
Subjt:  GHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPI

Query:  KLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAEL-T
        K ELQ+A+++AQ+L+ ARQELI++ Q L+QDLQRAH DVQQ+P LM+EL+SLRQEYQHCRATYD EKKLYNDHLESLQ+ME NY+TMARE+EKL+ EL  
Subjt:  KLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAEL-T

Query:  NTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAASSAGATAYTGPQTGSTATRPNFDATRGP---------
        N    +RR G PYG T   N+ EASG+  GQN YEDGYG +QGR P PA          AG ++ AG + Y G Q+   + RP +DA+RGP         
Subjt:  NTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAASSAGATAYTGPQTGSTATRPNFDATRGP---------

Query:  -------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQARG---VAGHAAPGNTAPYRSSTPPGRGGGYEAPSRG
                     QRG  YDG RGS YD+ RPGYD QRGP YNV  G P YD  +   GYD Q+RG     GH AP N APY SSTPP         +  
Subjt:  -------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQARG---VAGHAAPGNTAPYRSSTPPGRGGGYEAPSRG

Query:  GGNP
        GGNP
Subjt:  GGNP

KAG6578517.1 hypothetical protein SDJN03_22965, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.45Show/hide
Query:  SVGEQSPGCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI
        S G    GCLSL+RIP HLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLS LFPDELIKQEAS +RYI+IPDEVVDI
Subjt:  SVGEQSPGCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI

Query:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM
        YGLWRPTPLIRAKRLEKLL TPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWG SLAFACS+FGIGCEVWQVRASYDQKPYRRMM
Subjt:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM

Query:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS
        M+TWGAKVHPSPS++TD+GRKFLQ+DPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEA+GE+PDVIIGCTGGGSNFAGLS
Subjt:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS

Query:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF
        FPFLREKLAGKI+PVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGH+FIPDPIHAGGLRYHGMAPLISHVYNLGLLEA+SLPQTECFQGAIQF
Subjt:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF

Query:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN----------------------------------------------KG
        ARSEGLIPAPEPTHAIA TIREAL CRETGESKVILTAMCGHGHFDL AYEN                                              KG
Subjt:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN----------------------------------------------KG

Query:  RIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQ
        RIPPPHARR LPGSGVLHPEAFGHGLRPP G FPPFDM+PPPEVMEQKLAGQHVE+QKL TENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQ
Subjt:  RIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQ

Query:  QARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQ
        QARNLSDKIAKME+ELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQ
Subjt:  QARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQ

Query:  VMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLPATAGAASSAGATAYTGPQTGSTATRPNFDATR
        VMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGN AGQNTYEDGYGVAQGRG LPATAG ASSAGATAYTGPQTGSTATRPN+DA R
Subjt:  VMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLPATAGAASSAGATAYTGPQTGSTATRPNFDATR

Query:  GPQRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPR-GAGYDAQARGVAGHAAPGNTAPYRSSTPPGRGGGYEAPSRGGGNPGRR
        G QR  GY+G RGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPR GAGYDAQ+RGV GHAAPGNTAPY SSTPP RG        GGGNPGRR
Subjt:  GPQRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPR-GAGYDAQARGVAGHAAPGNTAPYRSSTPPGRGGGYEAPSRGGGNPGRR

KAG7016078.1 trpB2 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0085.71Show/hide
Query:  SVGEQSPGCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI
        S G    GCLSL+RIP HLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLS LFPDELIKQEAS +RYI+IPDEVVDI
Subjt:  SVGEQSPGCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI

Query:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM
        YGLWRPTPLIRAKRLEKLL TPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWG SLAFACS+FGIGCEVWQVRASYDQKPYRRMM
Subjt:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM

Query:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS
        M+TWGAKVHPSPS++TD+GRKFLQ+DPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEA+GE+PDVIIGCTGGGSNFAGLS
Subjt:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS

Query:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF
        FPFLREKLAGKI+PVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGH+FIPDPIHAGGLRYHGMAPLISHVYNLGLLEA+SLPQTECFQGAIQF
Subjt:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF

Query:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE-------------------------------------------------
        ARSEGLIPAPEPTHAIA TIREAL CRETGESKVILTAMCGHGHFDL AYE                                                 
Subjt:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE-------------------------------------------------

Query:  ----------------------NKGRIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQ
                              +KGRIPPPHARR LPGSGVLHPEAFGHGLRPP G FPPFDM+PPPEVMEQKLAGQHVE+QKL TENQRLAATHGTLRQ
Subjt:  ----------------------NKGRIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQ

Query:  ELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLR
        ELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKME+ELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLR
Subjt:  ELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLR

Query:  QEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLPATAGAAS
        QEYQHCR TYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGN AGQNTYEDGYGVAQGRG LPATAG AS
Subjt:  QEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLPATAGAAS

Query:  SAGATAYTGPQTGSTATRPNFDATRGPQRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPR-GAGYDAQARGVAGHAAPGNTAPYRSSTPPGR
        SAGATAYTGPQTGSTATRPN+DA RG QR  GY+G RGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPR GAGYDAQ+RGV GHA PGNTAPY SSTPP R
Subjt:  SAGATAYTGPQTGSTATRPNFDATRGPQRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPR-GAGYDAQARGVAGHAAPGNTAPYRSSTPPGR

Query:  GGGYEAPSRGGGNPGRR
        G        GGGNPGRR
Subjt:  GGGYEAPSRGGGNPGRR

RXH68248.1 hypothetical protein DVH24_028395 [Malus domestica]0.0e+0070.75Show/hide
Query:  GCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPT
        G ++L R P HLR SNG R +A+ N + K++E+PRQWYNL+ADLPVKPPPPLHPKTFEPIKPEDLS LFPDELIKQEAS +R+I+IPDEV D+Y LWRP+
Subjt:  GCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPT

Query:  PLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAK
        PLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYN QEG+K VVTETGAGQWGS+LAFAC++F + CEVWQVRAS+DQKPYRR+MMQTWGAK
Subjt:  PLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAK

Query:  VHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREK
        V+PSPS LT+AGRK LQ+DPSSPGSLGIAISEAVE+AA+N DTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGE PDVIIGCTGGGSNFAGLSFPF+REK
Subjt:  VHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREK

Query:  LAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLI
        L GK+NPVIRAVEPAACPSLTKGVYAYDYGDTAG+TPLMKMHTLGHDFIPDPIHAGGLRYHGM+PLISHVY LG +EAIS+PQ ECFQGAI+FAR+EGLI
Subjt:  LAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLI

Query:  PAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN-------------------------KGRIPPPHARRTLPGSGVLHPEAFGHGLRP
        PAPEPTHAIAATIREA  CRETGE+KVILTAMCGHGHFDLPAYEN                          GRIPP H RR L G     P+ FG G+RP
Subjt:  PAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN-------------------------KGRIPPPHARRTLPGSGVLHPEAFGHGLRP

Query:  PPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQ
        P GA+PPFDMLPPP+VMEQKLA QHVE+Q+L  ENQRLAATHG+LRQELA AQHELQILHAQIGA+KSEREQQ  +L D IAKMEA+L++AEP+K EL +
Subjt:  PPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQ

Query:  AKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLER
        A+++AQ+L+V+RQELIT VQ L QDLQR H DVQQ+P L+++L+ LRQEYQ CR TYDYEK+LY+ HLESLQVMEKNY+TMARE+EKLRAEL N ++++R
Subjt:  AKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLER

Query:  RHGGP-YGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLP--------ATAGAASSAGATAYTGPQTGSTATRPNFDATRGP-----------------
        R G P YGT  NNE EA+G + GQN +ED YGV QGR P P        A  G A++ G     G Q+G    R  +DA RGP                 
Subjt:  RHGGP-YGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLP--------ATAGAASSAGATAYTGPQTGSTATRPNFDATRGP-----------------

Query:  --QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPRGAGYDAQAR----GVAGHAAPGNTAPYRSSTPPGRGG-GYEAP---SRGGGNPGRR
          QRGP YD  RG  YD+QRPGYD QR PGY V  +P YD  RG  YDAQ+R    G  GH   GN     ++TP  RGG GYEAP     GGGNP RR
Subjt:  --QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPRGAGYDAQAR----GVAGHAAPGNTAPYRSSTPPGRGG-GYEAP---SRGGGNPGRR

TrEMBL top hitse value%identityAlignment
A0A1S3CA62 Tryptophan synthase1.0e-26195.58Show/hide
Query:  NSVGEQSPGCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVD
        N+VGEQ PGCLSLKR+PTHLRFSNGYRTK+ASNPNLKAVE+PRQWYNLVADLPVKPPPPLHPKT+EPIKPEDLSHLFPDELIKQEAST+RYINIPDEVVD
Subjt:  NSVGEQSPGCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVD

Query:  IYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRM
        IYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYN QEGVKNVVTETGAGQWG SLAFACSIFGIGCEVWQVRASYDQKPYRRM
Subjt:  IYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRM

Query:  MMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGL
        MM+TWGAKVHPSPSD+TDAGRKFLQ+DPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQ+EA+GESPD+IIGCTGGGSNFAGL
Subjt:  MMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGL

Query:  SFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQ
        SFPFLREKLAG INPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGH+FIPDPIHAGGLRYHGMAPLISHVYNLGLLEA+SLPQTECFQGAI+
Subjt:  SFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQ

Query:  FARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE
        FARSEGLIPAPEPTHAIAATIREAL CRETGESKVILTAMCGHGHFDLPAYE
Subjt:  FARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE

A0A498H9W8 Tryptophan synthase0.0e+0070.75Show/hide
Query:  GCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPT
        G ++L R P HLR SNG R +A+ N + K++E+PRQWYNL+ADLPVKPPPPLHPKTFEPIKPEDLS LFPDELIKQEAS +R+I+IPDEV D+Y LWRP+
Subjt:  GCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPT

Query:  PLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAK
        PLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYN QEG+K VVTETGAGQWGS+LAFAC++F + CEVWQVRAS+DQKPYRR+MMQTWGAK
Subjt:  PLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAK

Query:  VHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREK
        V+PSPS LT+AGRK LQ+DPSSPGSLGIAISEAVE+AA+N DTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGE PDVIIGCTGGGSNFAGLSFPF+REK
Subjt:  VHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREK

Query:  LAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLI
        L GK+NPVIRAVEPAACPSLTKGVYAYDYGDTAG+TPLMKMHTLGHDFIPDPIHAGGLRYHGM+PLISHVY LG +EAIS+PQ ECFQGAI+FAR+EGLI
Subjt:  LAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLI

Query:  PAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN-------------------------KGRIPPPHARRTLPGSGVLHPEAFGHGLRP
        PAPEPTHAIAATIREA  CRETGE+KVILTAMCGHGHFDLPAYEN                          GRIPP H RR L G     P+ FG G+RP
Subjt:  PAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN-------------------------KGRIPPPHARRTLPGSGVLHPEAFGHGLRP

Query:  PPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQ
        P GA+PPFDMLPPP+VMEQKLA QHVE+Q+L  ENQRLAATHG+LRQELA AQHELQILHAQIGA+KSEREQQ  +L D IAKMEA+L++AEP+K EL +
Subjt:  PPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQ

Query:  AKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLER
        A+++AQ+L+V+RQELIT VQ L QDLQR H DVQQ+P L+++L+ LRQEYQ CR TYDYEK+LY+ HLESLQVMEKNY+TMARE+EKLRAEL N ++++R
Subjt:  AKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLER

Query:  RHGGP-YGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLP--------ATAGAASSAGATAYTGPQTGSTATRPNFDATRGP-----------------
        R G P YGT  NNE EA+G + GQN +ED YGV QGR P P        A  G A++ G     G Q+G    R  +DA RGP                 
Subjt:  RHGGP-YGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLP--------ATAGAASSAGATAYTGPQTGSTATRPNFDATRGP-----------------

Query:  --QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPRGAGYDAQAR----GVAGHAAPGNTAPYRSSTPPGRGG-GYEAP---SRGGGNPGRR
          QRGP YD  RG  YD+QRPGYD QR PGY V  +P YD  RG  YDAQ+R    G  GH   GN     ++TP  RGG GYEAP     GGGNP RR
Subjt:  --QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPRGAGYDAQAR----GVAGHAAPGNTAPYRSSTPPGRGG-GYEAP---SRGGGNPGRR

A0A7J6E1X3 Tryptophan synthase0.0e+0069.57Show/hide
Query:  SAPPSFPSSLINLNLFNSVGEQSPGCLSL-KRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIK
        SA   F SS   L+    V E+  G   L +R P HLRFS+  RT+A  N N K+VE+PRQWYNL+ADLP+KPPPPL+P+T EP+KPEDLS LFPDELIK
Subjt:  SAPPSFPSSLINLNLFNSVGEQSPGCLSL-KRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIK

Query:  QEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGC
        QE +++RYI+IPDEV+D+Y LWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQ +YN +EGVKNVVTETGAGQWGSSLAFA S+FGIGC
Subjt:  QEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGC

Query:  EVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGES
        EVWQVRASYDQKPYRRMMMQTWGAKVHPSPS+LT+AGR  LQ+DPSSPGSLGIAISEAVEVAA+N DTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGE+
Subjt:  EVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGES

Query:  PDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGL
        PD++IGCTGGGSNFAGLSFPF+REKL GK+NPVIRAVEP+ACPSLTKGVYAYDYGDTAG+TPLMKMHTLGHDF+PDPIHAGGLRYHGMAPLISHVY LG+
Subjt:  PDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGL

Query:  LEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE-----------------------------NK
        +EAIS+PQ ECFQGAIQFAR+EGLIPAPEPTHAIAATIREAL CRETGE+KVILTAMCGHGHFDLPAY+                             +K
Subjt:  LEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE-----------------------------NK

Query:  GRIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSERE
        GR+PP H RR LPG G+ HPE FG G+ PPPG FP FDM+PPPEVME KLA QH+E+Q+L TENQRLA+THGTLRQELA AQHELQ+LH QIGAVK+ERE
Subjt:  GRIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSERE

Query:  QQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESL
        Q+ R + +KI+KMEAEL+AAEP+K ELQ+A+++AQ+L+ ARQELI++ Q L+QDLQRAH DVQQ+P LM+EL+SLRQEYQHCRATYD EKKLYNDHLESL
Subjt:  QQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESL

Query:  QVMEKNYITMARELEKLRAEL-TNTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAASSAGATAYTGPQTG
        Q+ME NY+TMARE+EKL+ EL  N    +RR G PYG T   N+ EASG+  GQN YEDGYG +QGR P PA          AG ++ AG + Y G Q+ 
Subjt:  QVMEKNYITMARELEKLRAEL-TNTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAASSAGATAYTGPQTG

Query:  STATRPNFDATRGP----------------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQARG---VAGHAAPG
          + RP +DA+RGP                      QRG  YDG RGS YD+ RPGYD QRGP YNV  G P YD  +   GYD Q+RG     GH AP 
Subjt:  STATRPNFDATRGP----------------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQARG---VAGHAAPG

Query:  NTAPYRSSTPPGRGGGYEAPSRGGGNPGRR
        N APY SSTPP         +  GGNP RR
Subjt:  NTAPYRSSTPPGRGGGYEAPSRGGGNPGRR

A0A7J6E4N3 Tryptophan synthase (Fragment)0.0e+0070.69Show/hide
Query:  VGEQSPGCLSLK-RIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI
        V E+  G   LK R P HLRFS+  RT+A  N N K+VE+PRQWYNL+ADLP+KPPPPL+P+T E +KPEDLS LFPDELIKQE +++RYI+IPDE++D+
Subjt:  VGEQSPGCLSLK-RIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI

Query:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM
        Y LWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQ +YN +EGVKNVVTETGAGQWGSSLAFA S+FGIGCEVWQVRASYDQKPYRRMM
Subjt:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM

Query:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS
        MQTWGAKVHPSPS+LT+AGR  LQ+DPSSPGSLGIAISEAVEVAA+N DTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGE+PD++IGCTGGGSNFAGLS
Subjt:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS

Query:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF
        FPF+REKL GK+NPVIRAVEP+ACPSLTKGVYAYDYGDTAG+TPLMKMHTLGHDF+PDPIHAGGLRYHGMAPLISHVY LG++EAIS+PQ ECFQGAIQF
Subjt:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF

Query:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN------------------------KGRIPPPHARRTLPGSGVLHPEAF
        AR+EGLIPAPEPTHAIAATIREAL CRETGE+KVILTAMCGHGHFDLPAY+                         KGR+PP H RR LPG G+ HPE F
Subjt:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYEN------------------------KGRIPPPHARRTLPGSGVLHPEAF

Query:  GHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPI
        G G+ PPPG FP FDM+PPPEVME KLA QH+E+Q+L TENQRLA+THGTLRQELA AQHELQ+LH QIGAVK+EREQ+ RN+ +KI+KMEAEL+AAEP+
Subjt:  GHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPI

Query:  KLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAEL-T
        K ELQ+A+++AQ+L+ ARQELI++ Q L+QDLQRAH DVQQ+P LM+EL+SLRQEYQHCRATYD EKKLYNDHLESLQ+ME NY+TMARE+EKL+ EL  
Subjt:  KLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAEL-T

Query:  NTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAASSAGATAYTGPQTGSTATRPNFDATRGP---------
        N    +RR G PYG T   N+ EASG+  GQN YEDGYG +QGR P PA          AG ++ AG + Y G Q+   + RP +DA+RGP         
Subjt:  NTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAASSAGATAYTGPQTGSTATRPNFDATRGP---------

Query:  -------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQARG---VAGHAAPGNTAPYRSSTPPGRGGGYEAPSRG
                     QRG  YDG RGS YD+ RPGYD QRGP YNV  G P YD  +   GYD Q+RG     GH AP N APY SSTPP         +  
Subjt:  -------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQARG---VAGHAAPGNTAPYRSSTPPGRGGGYEAPSRG

Query:  GGNP
        GGNP
Subjt:  GGNP

A0A7J6GSX6 Uncharacterized protein4.2e-30861.76Show/hide
Query:  VGEQSPGCLSLK-RIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI
        V E+  G   LK R P HLRFS+  RT+A  N N K+VE+PRQWYNL+ADLP+KPPPPL+P+T E +KPEDLS LFPDELIKQE +++RYI+IPDE+   
Subjt:  VGEQSPGCLSLK-RIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDI

Query:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM
                 + AKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQ +YN +EGVKNV                         VWQVRASYDQKPYRRMM
Subjt:  YGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMM

Query:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS
        MQTWGAKVHPSPS+LT+AGR  LQ+DPSSPGSLGIAISEAVEVAA+N DTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGE+PD++IGCT          
Subjt:  MQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLS

Query:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF
                         AVEP+ACPSLTKGVYAYDYGDTAG+TPLMKMHTLGHDF+PDPIHAGGLRYHGMAPLISHVY LG++EAIS+PQ ECFQGAIQF
Subjt:  FPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQF

Query:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE-------------------------------------------------
        AR+EGLIPAPEPTHAIAATIREAL CRETGE+KVILTAMCGHGHFDLPAY+                                                 
Subjt:  ARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE-------------------------------------------------

Query:  ------------NKGRIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQ
                    +KGR+PP H RR LPG G+ HPE FG G+ PPPG FP FDM+PPPEVME KLA QH+E+Q+L TENQRLA+THGTLRQELA AQHELQ
Subjt:  ------------NKGRIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQ

Query:  ILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATY
        +LH QIGAVK+EREQ+ RN+ +KI+KMEAEL+AAEP+K ELQ+A+++AQ+L+ ARQELI++ Q L+QDLQRAH DVQQ+P LM+EL+SLRQEYQHCRATY
Subjt:  ILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATY

Query:  DYEKKLYNDHLESLQVMEKNYITMARELEKLRAEL-TNTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAA
        D EKKLYNDHLESLQ+ME NY+TMARE+EKL+ EL  N    +RR G PYG T   N+ EASG+  GQN YEDGYG +QGR P PA          AG +
Subjt:  DYEKKLYNDHLESLQVMEKNYITMARELEKLRAEL-TNTASLERRHGGPYGTTQ-NNEIEASGNSAGQNTYEDGYGVAQGRGPLPA---------TAGAA

Query:  SSAGATAYTGPQTGSTATRPNFDATRGP----------------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQ
        + AG + Y G Q+   + RP +DA+RGP                      QRG  YDG RGS YD+ RPGYD QRGP YNV  G P YD  +   GYD Q
Subjt:  SSAGATAYTGPQTGSTATRPNFDATRGP----------------------QRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVP-GLPTYDAPR-GAGYDAQ

Query:  ARG---VAGHAAPGNTAPYRSSTPPGRGGGYEAPSRGGGNPGRR
        +RG     GH AP N APY SSTPP         +  GGNP RR
Subjt:  ARG---VAGHAAPGNTAPYRSSTPPGRGGGYEAPSRGGGNPGRR

SwissProt top hitse value%identityAlignment
O29028 Tryptophan synthase beta chain 21.3e-14960.39Show/hide
Query:  EVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSH
        E+P++WYN++ DLP   PPPLHP T EP+KPEDL  +FP  LI+QE S +R+I IP++V +IY +WRPTPL+RA+RLEK L TPARIY+KYEG SP GSH
Subjt:  EVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSH

Query:  KPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAIS
        KPN+AV Q +YN +EGV+ + TETGAGQWGS+L FA  +F + C V+ V+ S+ QKPYRR+MM+TWG +V PSPSD T+ GRK L  +P +PGSLGIAIS
Subjt:  KPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAIS

Query:  EAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGD
        EA+E AA N +TKY LGSVLNHVLLHQTVIG E   Q+E + E PDV+IGC GGGSNFAGL++PF+ +   G +   I AVEPAACP+LT G Y YD+GD
Subjt:  EAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGD

Query:  TAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTA
         AG+TPL+KM+TLGHDFIP PIHAGGLRYHG AP +  +   G+++A ++ Q   F+  + FAR+EG+IPAPE  HA+ A I EA+  RE  E KVI+  
Subjt:  TAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTA

Query:  MCGHGHFDLPAYEN
          GHG  DL AY++
Subjt:  MCGHGHFDLPAYEN

O67409 Tryptophan synthase beta chain 21.7e-15262.02Show/hide
Query:  EVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSH
        E+P++W N++  LP    PPL P+T EP+KPE L  +FP+ L++QE S   +I+IP+EV+DIY LWRPTPL RAK LE+ L TPA+I+YK E VSP GSH
Subjt:  EVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSH

Query:  KPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAIS
        KPN+AV Q +YN   GVK + TETGAGQWGS+L+FA   F + C V+ VR SY+QKPYRR++M+TW  +V PSPS  T+AGRK+ + +P  PGSLGIAIS
Subjt:  KPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAIS

Query:  EAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAG---KINPVIRAVEPAACPSLTKGVYAYD
        EA+E AA   DTKY LGSVLNHVLLHQTVIG E  KQME  G  PDVIIG  GGGSNFAGLSFPFL + L G   K +  + AVEP ACP+LTKG Y YD
Subjt:  EAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAG---KINPVIRAVEPAACPSLTKGVYAYD

Query:  YGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVI
        +GD+ G+TPL+KM+TLGHDF+P PIHAGGLRYHG APL+  +YNLG ++A++  QTE F+ A+ FAR+EG++PAPE  HAI A I EAL C+ETGE KVI
Subjt:  YGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVI

Query:  LTAMCGHGHFDLPAYE
        L  + GHG+FDL AY+
Subjt:  LTAMCGHGHFDLPAYE

Q8Q001 Tryptophan synthase beta chain 25.0e-14959.81Show/hide
Query:  EVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSH
        E+P++WYN+++DLP    PPL P+T++PI PE L  +F  ELI+QE S+DRYI+IP E++D+Y LWRP+PL RA +LEK L TPA+IYYKYEGVSPAGSH
Subjt:  EVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSH

Query:  KPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAIS
        K N+++ Q +YN++EG + + TETGAGQWGS+L+ AC+ F + C+V+ VR+SY QKPYR+ +M  WG  V PSPS  T+ GRK L+  P +PGSLGIAIS
Subjt:  KPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAIS

Query:  EAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGD
        EAVE A  + +TKY LGSVLNHV+LHQTVIG EC KQ+E + E PDV+IGC GGGSN  G+   F++++L GK N  + AVEP+ACPSLTKG Y YD+GD
Subjt:  EAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGD

Query:  TAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTA
        TA MTPL+KM+TLGH  IP  IHAGGLRYHG +P+IS +   GL+EA+S  Q E F  A+QFAR+EG++PAPE +HAI   I EAL  ++ GE KVIL  
Subjt:  TAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTA

Query:  MCGHGHFDLPAYE
        + GHGHFD+ +Y+
Subjt:  MCGHGHFDLPAYE

Q8TL44 Tryptophan synthase beta chain 21.5e-14858.6Show/hide
Query:  EVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSH
        E+P++WYN++ADLP    PPL P+T++PI P+ L  +FP  LI QE S+DRYI+IP+EV+D+Y LWRP+PL RA +LEK+L +PA+IYYKYEGVSPAGSH
Subjt:  EVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSH

Query:  KPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAIS
        K N+++ Q +YN++EG + + TETGAGQWGS+L+ AC+ F + C+V+ VR+S+ QKPYR+ ++  WG  V PSPS  T+ GRK LQ  P +PGSLGIAIS
Subjt:  KPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAIS

Query:  EAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGD
        EAVE A  + +TKY LGSVLNHV+LHQTVIG EC +Q+  + E PDV+IGC GGGSN  G+   F++++L GK +  + AVEP+ACPSLTKG Y YD+GD
Subjt:  EAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGD

Query:  TAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTA
        TA MTPL+KM+TLGH  +P  IHAGGLRYHG +P+IS + + GL+EA+S  Q E F  A+QFAR+EG++PAPE +HAI   I EAL  ++TGE K IL  
Subjt:  TAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTA

Query:  MCGHGHFDLPAYE
        + GHGHFD+ +Y+
Subjt:  MCGHGHFDLPAYE

Q9WZ09 Tryptophan synthase beta chain 26.3e-14460.43Show/hide
Query:  NLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVS
        NLK  E+P+ WYN++ADLP K  PPL P+T +PI PE LS +FP  LI+QE S +R+I IP+ V+  Y ++RPTPLIRA  LE+ L TPARIYYKYEGVS
Subjt:  NLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVS

Query:  PAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSL
        P GSHKPN+A+ Q +YN  EGVK +VTETGAGQWGS+L++A + FG+  +V+ V+ SY QKP R+ MM  +G KV PSPS+ T+ GRK L  DP +PGSL
Subjt:  PAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSL

Query:  GIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYA
        GIAISEA+EVA  + +TKY LGSVLNHVLLHQTVIG E  KQ+E IGE PD+++GC GGGSNF G   PF+ +KL+G+    + A EPAACPSLTKG Y 
Subjt:  GIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYA

Query:  YDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESK
        YD+GDTAG+TPL+KM+TLG DFIP  IHAGGLRYHG AP+I+ +   GL+EA +  Q E F+ A  FA+ EG+IPAPE  HAIA  IREA   +E G+ +
Subjt:  YDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESK

Query:  VILTAMCGHGHFDLPAY
        VI+  + GHG  DL AY
Subjt:  VILTAMCGHGHFDLPAY

Arabidopsis top hitse value%identityAlignment
AT1G55170.1 unknown protein6.5e-2736Show/hide
Query:  PPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQ
        PP    PP  +L   ++ E ++  Q  EI++L ++N  LA     L +EL AA+ EL  ++  I  +++E++ Q R  S+K  K+E +++A E  K E  
Subjt:  PPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQ

Query:  QAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLE
        Q + + Q L   ++EL   VQ L +DL +   D +Q+P + +E++ L++E  H R   +YEKK   + +E  Q MEKN ++MARE+EKLRAEL    S  
Subjt:  QAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLE

Query:  RRHGGPYGTTQNNEIEASGNSAGQN
           GG YG   NN       S G+N
Subjt:  RRHGGPYGTTQNNEIEASGNSAGQN

AT1G67170.1 unknown protein4.1e-9052.97Show/hide
Query:  ENKGRIPPP--HARRTLPGSG--VLHPEAFG-HGLRPP---PGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILH
        E+KGRI P   H RR LPG G  + HPE FG HG  PP    G +P F+MLPPPEVMEQK   QH E+Q+LA ENQRL  THG+LRQELAAAQHE+Q+LH
Subjt:  ENKGRIPPP--HARRTLPGSG--VLHPEAFG-HGLRPP---PGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILH

Query:  AQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYE
        AQIG++KSEREQ+   L++K+AKME ELQ +E +KLE+QQA+++A++L+VAR+EL+++V  LTQ+LQ++  DVQQ+P LMSELE+LRQEYQ CRATYDYE
Subjt:  AQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYE

Query:  KKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLPATAGAASSAGATAYTGPQTG
        KK YNDHLESLQ MEKNY+TMARE+EKL+A+L N A+ +RR GGPYG   N EI+ASG+ +G   YED +G  QG  P P    A         TGP + 
Subjt:  KKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLPATAGAASSAGATAYTGPQTG

Query:  STATRPNFDATRGPQRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPRGAGYDAQARGVAGHAAPGNTAPYRSSTPPGRGGGYEAPSRG-GGN
                 A + P +G    G     Y  QRPGY+  RGP    PG           YD   R       P  T PY +  PPG       P  G  GN
Subjt:  STATRPNFDATRGPQRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPRGAGYDAQARGVAGHAAPGNTAPYRSSTPPGRGGGYEAPSRG-GGN

Query:  PGRR
        P RR
Subjt:  PGRR

AT3G14750.1 unknown protein5.1e-3234.12Show/hide
Query:  KGRIPPPHARRTLPGSGVLHP-------EAFGHGLRPPP--------GAFPPFDM----LPPP-EVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELA
        + R PPP + +    SG+  P          G G  PPP           P F +    LPP   ++E +LA Q+ ++Q L  +NQRLAATH  L+QEL 
Subjt:  KGRIPPPHARRTLPGSGVLHP-------EAFGHGLRPPP--------GAFPPFDM----LPPP-EVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELA

Query:  AAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEY
         AQHELQ +   I ++++E E   R + DK  + E EL+  + ++ E+Q+ ++D +     RQEL ++V  +TQDL R   D+QQ+P L +E+E+ +QE 
Subjt:  AAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEY

Query:  QHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPL----PATAGAA
        Q  RA  DYEKK Y ++ E  ++ME   + MARELEKLRAE+ N+ +      GP G            + G   Y  GYG  +   P+    P      
Subjt:  QHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPL----PATAGAA

Query:  SSAGATAYTGPQTG-STATRPNFDATRGPQRGPGYDG
        +  G   Y  P  G   A    +D  +  Q+ P   G
Subjt:  SSAGATAYTGPQTG-STATRPNFDATRGPQRGPGYDG

AT5G38530.1 tryptophan synthase beta type 24.7e-21977.92Show/hide
Query:  PPSFPSSLINLNLFNSVGEQSPGCLSLKRIPTH-LRFSNG--YRTKAA--SNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDEL
        PP+  +  ++  +F +   Q    L+LKR      R SNG   R KAA  S  N   VE+P+QWYNLVADL VKPPPPLHPKTFEPIKPEDL+HLFP+EL
Subjt:  PPSFPSSLINLNLFNSVGEQSPGCLSLKRIPTH-LRFSNG--YRTKAA--SNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDEL

Query:  IKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGI
        IKQEA+ +R+I+IP+EV++IY LWRPTPLIRAKRLEKLL TPARIY+KYEG SPAGSHKPN+AVPQ +YN +EGVKNVVTETGAGQWGSSLAFA S+FG+
Subjt:  IKQEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGI

Query:  GCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIG
         CEVWQV  SY  KPYRR+MMQTWGAKVHPSPSDLT+AGR+ L+ DPSSPGSLGIAISEAVEVAA N DTKYCLGSVLNHVLLHQT+IGEEC++QME  G
Subjt:  GCEVWQVRASYDQKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIG

Query:  ESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNL
        E+PD+IIGCTGGGSNFAGLSFPF+REKL GKINPVIRAVEP+ACPSLTKGVYAYD+GDTAG+TPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVY  
Subjt:  ESPDVIIGCTGGGSNFAGLSFPFLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNL

Query:  GLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE
        G +EAIS+PQ ECFQGAIQFAR+EG+IPAPEPTHAIAATIREAL C+ETGE+KVIL AMCGHGHFDL +Y+
Subjt:  GLLEAISLPQTECFQGAIQFARSEGLIPAPEPTHAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYE

AT5G61920.1 unknown protein3.8e-1932.96Show/hide
Query:  EVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQE
        +++E K+A Q  EI +L+ +N++LA+++  L+++L  A  E+Q L A I   +++ E Q R+  +KIAKME  ++  E I+ E+Q A  +A  L   R+E
Subjt:  EVMEQKLAGQHVEIQKLATENQRLAATHGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQE

Query:  LITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELT
        L ++V+   +DL++   + + +     ELE L++E+Q  R  ++ EK    + L  L+ ME+  I   + +EKLR+E++
Subjt:  LITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEYQHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACACTTTCAATCTGATATTAAGCTTCTCAGCCCCTCCCTCTTTCCCCTCATCTTTGATAAATTTAAACCTTTTCAACTCAGTTGGGGAGCAATCGCCTGGATG
TCTTTCACTTAAGAGAATTCCAACACATTTGAGGTTTTCAAATGGTTATAGAACAAAAGCAGCATCAAACCCTAATTTGAAGGCAGTCGAGGTCCCGCGCCAATGGTACA
ATCTGGTTGCAGATCTTCCTGTAAAACCTCCTCCTCCATTGCATCCCAAGACTTTTGAACCAATAAAACCCGAAGATTTGTCGCATCTCTTTCCTGATGAATTGATTAAG
CAAGAAGCCAGTACTGATCGGTACATAAACATCCCTGACGAAGTTGTCGATATCTACGGGCTTTGGCGCCCAACTCCTCTGATCAGAGCTAAGAGGTTAGAGAAGCTTCT
TGACACACCAGCTAGAATTTACTACAAGTATGAAGGAGTTAGCCCTGCTGGGTCACATAAACCCAACTCTGCTGTTCCACAAGTTTGGTATAATGTACAAGAAGGTGTCA
AAAATGTTGTTACAGAAACTGGGGCTGGGCAATGGGGATCCAGTTTGGCCTTTGCTTGCAGCATATTTGGTATTGGCTGTGAGGTGTGGCAAGTGCGTGCGTCTTATGAT
CAAAAACCGTATCGACGAATGATGATGCAAACTTGGGGGGCAAAAGTACACCCATCTCCATCCGATCTAACAGATGCAGGTAGAAAATTTCTTCAAATTGACCCGTCAAG
CCCGGGAAGTTTGGGAATTGCCATTTCAGAAGCTGTAGAGGTTGCAGCTCTCAATGCTGATACCAAATACTGTTTGGGAAGTGTCCTCAACCATGTTTTGCTTCACCAAA
CTGTCATCGGTGAAGAGTGTCTAAAACAAATGGAAGCCATAGGTGAGAGCCCGGATGTTATCATCGGTTGCACGGGCGGAGGATCAAATTTCGCAGGTCTTAGCTTCCCA
TTCCTTCGTGAGAAACTTGCTGGAAAAATAAACCCTGTTATTAGAGCAGTTGAACCTGCAGCATGTCCTTCATTAACCAAAGGCGTATACGCTTATGATTATGGCGATAC
AGCGGGGATGACGCCGTTGATGAAAATGCACACACTTGGTCATGACTTCATTCCCGACCCCATTCACGCTGGTGGATTGCGGTATCATGGTATGGCTCCATTGATCTCAC
ATGTCTATAATCTTGGTCTCTTGGAAGCTATATCACTTCCCCAGACAGAATGCTTTCAAGGTGCTATACAATTTGCACGATCAGAAGGACTAATACCAGCACCCGAGCCA
ACTCACGCCATAGCTGCAACTATCCGAGAGGCTCTTTGCTGTCGAGAAACCGGAGAATCAAAAGTTATTCTCACAGCCATGTGTGGGCATGGTCACTTTGATTTGCCAGC
TTATGAGAATAAAGGTCGTATTCCGCCTCCACATGCAAGGCGTACTCTTCCAGGTTCTGGTGTACTGCATCCAGAAGCATTTGGTCATGGGCTACGTCCTCCACCAGGTG
CCTTTCCTCCCTTTGACATGTTGCCTCCTCCAGAAGTGATGGAACAGAAGCTAGCTGGGCAACATGTTGAGATACAGAAACTTGCCACCGAGAATCAGAGGCTTGCTGCT
ACCCATGGAACTTTGAGGCAAGAGCTTGCTGCTGCCCAACATGAGCTGCAAATTCTTCATGCTCAAATAGGAGCTGTGAAATCTGAAAGAGAACAGCAAGCAAGGAATCT
TTCTGATAAAATTGCAAAGATGGAAGCAGAGTTACAAGCAGCTGAGCCAATTAAGTTAGAGTTGCAGCAAGCAAAATCAGATGCTCAAAACTTGATTGTAGCAAGGCAGG
AACTTATTACTAGAGTGCAACATTTGACCCAAGACCTCCAGAGGGCCCATGGAGATGTGCAGCAAGTTCCTGTTTTGATGTCAGAACTTGAGAGTTTGAGGCAGGAATAT
CAGCATTGTAGGGCGACCTATGACTATGAGAAAAAATTATACAATGATCATCTTGAGTCACTTCAGGTGATGGAAAAGAATTATATCACTATGGCTAGGGAACTGGAGAA
GCTTAGGGCAGAGTTGACAAATACTGCTAGTTTAGAACGAAGACATGGTGGACCTTATGGTACTACCCAAAACAACGAGATAGAGGCTTCTGGGAATTCAGCTGGACAAA
ATACTTATGAAGACGGCTATGGTGTTGCCCAGGGGCGTGGCCCGCTGCCCGCTACTGCAGGTGCTGCATCCAGTGCTGGTGCAACTGCATACACGGGACCTCAAACTGGT
TCTACTGCAACCCGACCCAACTTCGATGCCACAAGGGGTCCCCAACGAGGACCCGGTTATGATGGAGGAAGAGGATCCATTTATGATTCACAGAGACCTGGTTATGATGG
ACAAAGAGGACCGGGTTACAATGTGCCTGGGTTACCTACTTATGATGCACCTAGAGGAGCTGGCTATGATGCTCAAGCTAGAGGTGTTGCCGGACATGCTGCACCTGGAA
ACACCGCTCCTTATCGGTCTTCAACACCGCCTGGTCGTGGTGGTGGCTACGAAGCACCATCTCGAGGTGGAGGAAACCCCGGTAGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACACTTTCAATCTGATATTAAGCTTCTCAGCCCCTCCCTCTTTCCCCTCATCTTTGATAAATTTAAACCTTTTCAACTCAGTTGGGGAGCAATCGCCTGGATG
TCTTTCACTTAAGAGAATTCCAACACATTTGAGGTTTTCAAATGGTTATAGAACAAAAGCAGCATCAAACCCTAATTTGAAGGCAGTCGAGGTCCCGCGCCAATGGTACA
ATCTGGTTGCAGATCTTCCTGTAAAACCTCCTCCTCCATTGCATCCCAAGACTTTTGAACCAATAAAACCCGAAGATTTGTCGCATCTCTTTCCTGATGAATTGATTAAG
CAAGAAGCCAGTACTGATCGGTACATAAACATCCCTGACGAAGTTGTCGATATCTACGGGCTTTGGCGCCCAACTCCTCTGATCAGAGCTAAGAGGTTAGAGAAGCTTCT
TGACACACCAGCTAGAATTTACTACAAGTATGAAGGAGTTAGCCCTGCTGGGTCACATAAACCCAACTCTGCTGTTCCACAAGTTTGGTATAATGTACAAGAAGGTGTCA
AAAATGTTGTTACAGAAACTGGGGCTGGGCAATGGGGATCCAGTTTGGCCTTTGCTTGCAGCATATTTGGTATTGGCTGTGAGGTGTGGCAAGTGCGTGCGTCTTATGAT
CAAAAACCGTATCGACGAATGATGATGCAAACTTGGGGGGCAAAAGTACACCCATCTCCATCCGATCTAACAGATGCAGGTAGAAAATTTCTTCAAATTGACCCGTCAAG
CCCGGGAAGTTTGGGAATTGCCATTTCAGAAGCTGTAGAGGTTGCAGCTCTCAATGCTGATACCAAATACTGTTTGGGAAGTGTCCTCAACCATGTTTTGCTTCACCAAA
CTGTCATCGGTGAAGAGTGTCTAAAACAAATGGAAGCCATAGGTGAGAGCCCGGATGTTATCATCGGTTGCACGGGCGGAGGATCAAATTTCGCAGGTCTTAGCTTCCCA
TTCCTTCGTGAGAAACTTGCTGGAAAAATAAACCCTGTTATTAGAGCAGTTGAACCTGCAGCATGTCCTTCATTAACCAAAGGCGTATACGCTTATGATTATGGCGATAC
AGCGGGGATGACGCCGTTGATGAAAATGCACACACTTGGTCATGACTTCATTCCCGACCCCATTCACGCTGGTGGATTGCGGTATCATGGTATGGCTCCATTGATCTCAC
ATGTCTATAATCTTGGTCTCTTGGAAGCTATATCACTTCCCCAGACAGAATGCTTTCAAGGTGCTATACAATTTGCACGATCAGAAGGACTAATACCAGCACCCGAGCCA
ACTCACGCCATAGCTGCAACTATCCGAGAGGCTCTTTGCTGTCGAGAAACCGGAGAATCAAAAGTTATTCTCACAGCCATGTGTGGGCATGGTCACTTTGATTTGCCAGC
TTATGAGAATAAAGGTCGTATTCCGCCTCCACATGCAAGGCGTACTCTTCCAGGTTCTGGTGTACTGCATCCAGAAGCATTTGGTCATGGGCTACGTCCTCCACCAGGTG
CCTTTCCTCCCTTTGACATGTTGCCTCCTCCAGAAGTGATGGAACAGAAGCTAGCTGGGCAACATGTTGAGATACAGAAACTTGCCACCGAGAATCAGAGGCTTGCTGCT
ACCCATGGAACTTTGAGGCAAGAGCTTGCTGCTGCCCAACATGAGCTGCAAATTCTTCATGCTCAAATAGGAGCTGTGAAATCTGAAAGAGAACAGCAAGCAAGGAATCT
TTCTGATAAAATTGCAAAGATGGAAGCAGAGTTACAAGCAGCTGAGCCAATTAAGTTAGAGTTGCAGCAAGCAAAATCAGATGCTCAAAACTTGATTGTAGCAAGGCAGG
AACTTATTACTAGAGTGCAACATTTGACCCAAGACCTCCAGAGGGCCCATGGAGATGTGCAGCAAGTTCCTGTTTTGATGTCAGAACTTGAGAGTTTGAGGCAGGAATAT
CAGCATTGTAGGGCGACCTATGACTATGAGAAAAAATTATACAATGATCATCTTGAGTCACTTCAGGTGATGGAAAAGAATTATATCACTATGGCTAGGGAACTGGAGAA
GCTTAGGGCAGAGTTGACAAATACTGCTAGTTTAGAACGAAGACATGGTGGACCTTATGGTACTACCCAAAACAACGAGATAGAGGCTTCTGGGAATTCAGCTGGACAAA
ATACTTATGAAGACGGCTATGGTGTTGCCCAGGGGCGTGGCCCGCTGCCCGCTACTGCAGGTGCTGCATCCAGTGCTGGTGCAACTGCATACACGGGACCTCAAACTGGT
TCTACTGCAACCCGACCCAACTTCGATGCCACAAGGGGTCCCCAACGAGGACCCGGTTATGATGGAGGAAGAGGATCCATTTATGATTCACAGAGACCTGGTTATGATGG
ACAAAGAGGACCGGGTTACAATGTGCCTGGGTTACCTACTTATGATGCACCTAGAGGAGCTGGCTATGATGCTCAAGCTAGAGGTGTTGCCGGACATGCTGCACCTGGAA
ACACCGCTCCTTATCGGTCTTCAACACCGCCTGGTCGTGGTGGTGGCTACGAAGCACCATCTCGAGGTGGAGGAAACCCCGGTAGGAGATGA
Protein sequenceShow/hide protein sequence
MENTFNLILSFSAPPSFPSSLINLNLFNSVGEQSPGCLSLKRIPTHLRFSNGYRTKAASNPNLKAVEVPRQWYNLVADLPVKPPPPLHPKTFEPIKPEDLSHLFPDELIK
QEASTDRYINIPDEVVDIYGLWRPTPLIRAKRLEKLLDTPARIYYKYEGVSPAGSHKPNSAVPQVWYNVQEGVKNVVTETGAGQWGSSLAFACSIFGIGCEVWQVRASYD
QKPYRRMMMQTWGAKVHPSPSDLTDAGRKFLQIDPSSPGSLGIAISEAVEVAALNADTKYCLGSVLNHVLLHQTVIGEECLKQMEAIGESPDVIIGCTGGGSNFAGLSFP
FLREKLAGKINPVIRAVEPAACPSLTKGVYAYDYGDTAGMTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYNLGLLEAISLPQTECFQGAIQFARSEGLIPAPEP
THAIAATIREALCCRETGESKVILTAMCGHGHFDLPAYENKGRIPPPHARRTLPGSGVLHPEAFGHGLRPPPGAFPPFDMLPPPEVMEQKLAGQHVEIQKLATENQRLAA
THGTLRQELAAAQHELQILHAQIGAVKSEREQQARNLSDKIAKMEAELQAAEPIKLELQQAKSDAQNLIVARQELITRVQHLTQDLQRAHGDVQQVPVLMSELESLRQEY
QHCRATYDYEKKLYNDHLESLQVMEKNYITMARELEKLRAELTNTASLERRHGGPYGTTQNNEIEASGNSAGQNTYEDGYGVAQGRGPLPATAGAASSAGATAYTGPQTG
STATRPNFDATRGPQRGPGYDGGRGSIYDSQRPGYDGQRGPGYNVPGLPTYDAPRGAGYDAQARGVAGHAAPGNTAPYRSSTPPGRGGGYEAPSRGGGNPGRR