; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021292 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021292
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein HLB1-like isoform X2
Genome locationscaffold6:47667814..47681609
RNA-Seq ExpressionSpg021292
SyntenySpg021292
Gene Ontology termsGO:0016192 - vesicle-mediated transport (biological process)
GO:0048767 - root hair elongation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0012505 - endomembrane system (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015807.1 Protein HLB1 [Cucurbita argyrosperma subsp. argyrosperma]5.4e-19466.72Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNEEPDSEPESR--------------------------------
        MSPTPEEPNNLQNGIE +PHIS ES+   E  +S+ +  AD +P  ELQQERESES+N   D EP+S                                 
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNEEPDSEPESR--------------------------------

Query:  RKQFAESIQLQVVTDVSDPGFEEPKEPSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA
        RKQ +ESIQLQV TDV+DP FEEPK  SI SNGTENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAA
Subjt:  RKQFAESIQLQVVTDVSDPGFEEPKEPSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA

Query:  MELINSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPT
        MELINSVTGVDEEGRSRQRILTFAAR                YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPT
Subjt:  MELINSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPT

Query:  LHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTV
        LHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTV
Subjt:  LHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTV

Query:  LYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------
        LYGLAEDTLRTG  G +KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSM                                            
Subjt:  LYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------

Query:  -----------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                                                   VADSWDALDGWLDAIRLVYTIYAR KN+VLAGII G
Subjt:  -----------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]1.4e-19770.62Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIPS
        MSPTPEEPNNLQNGIE QPHISSES    E      +   D+IP  ELQ+ERESES+ N  PDSEPES RKQ +ESI L VVT V+DP  EE KE S PS
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIPS

Query:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAKH
        NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+DS+NESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR    
Subjt:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAKH

Query:  HGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK
                    YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK
Subjt:  HGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK

Query:  QATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSA
        QATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GN+KDVSPNELYSQSA
Subjt:  QATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSA

Query:  IYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------------------------------------
        IYIAAAHALKPNYSVYSSALRLVRSM                                                                          
Subjt:  IYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------------------------------------

Query:  -----------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                     VADSWD LDGWLDAIRLVYTIYAR KN+VLAGIITG
Subjt:  -----------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

XP_008448563.1 PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo]2.0e-19670.85Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSD-SQLLADAIPKPELQQERESESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIP
        MSPTPEEPNNLQNGIE QPHISSES     EPRS+  +  AD+IP  ELQQERESES+ N   DSEPES RKQ +ESI L VVT V+DP  EE KE S P
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSD-SQLLADAIPKPELQQERESESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIP

Query:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAK
         NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+D +NESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAAR   
Subjt:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAK

Query:  HHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELW
                     YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELW
Subjt:  HHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELW

Query:  KQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQS
        KQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GNIKDVSPNELYSQS
Subjt:  KQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQS

Query:  AIYIAAAHALKPNYSVYSSALRLVRSM-------------------------------------------------------------------------
        AIYIAAAHALKPNYSVYSSALRLVRSM                                                                         
Subjt:  AIYIAAAHALKPNYSVYSSALRLVRSM-------------------------------------------------------------------------

Query:  ------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                      VADSWDALDGWLDAIRLVYTIYAR KN+VLAGIITG
Subjt:  ------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]7.5e-19669.66Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNEEPDSEPESR----RKQFAESIQLQVVTDVSDPGFEEPKEPS
        MSP PEEPNNLQNGIE +PHIS ES+   E  +S+ +  AD IP  ELQQERESES+N   DSEP+S     RKQ +ESI+LQVVTDV+DP FEEPK  S
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNEEPDSEPESR----RKQFAESIQLQVVTDVSDPGFEEPKEPS

Query:  IPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSA
        I SNG ENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR  
Subjt:  IPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSA

Query:  KHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEEL
                      YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEEL
Subjt:  KHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEEL

Query:  WKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQ
        WKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  G +KDVSPNELYSQ
Subjt:  WKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQ

Query:  SAIYIAAAHALKPNYSVYSSALRLVRSM------------------------------------------------------------------------
        SAIYIAAAHALKP+YSVYSSALRLVRSM                                                                        
Subjt:  SAIYIAAAHALKPNYSVYSSALRLVRSM------------------------------------------------------------------------

Query:  -------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                       VADSWDALDGWLDAIRLVYTIYAR KN+VLAGII G
Subjt:  -------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

XP_038876586.1 protein HLB1 [Benincasa hispida]2.9e-20071.86Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNE-EPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIPS
        MSPTPEEPNNLQNGIE QPHIS ES  T  EPRS+ +  ADAI   EL QERESES+N    DSEP SRRKQ  ESI LQV TDV+DP FEE KE SIPS
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNE-EPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIPS

Query:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAKH
        NG TENS+PALRKDEGSRTFTMRELLNGLKGEDGNDS+NESEGERPEGN GYSLNQDSPHQPYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAAR    
Subjt:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAKH

Query:  HGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK
                    YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK
Subjt:  HGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK

Query:  QATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSA
        QATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GN+KDVSPNELYSQSA
Subjt:  QATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSA

Query:  IYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------------------------------------
        IYIAAAHALKPNYSVYSSALRLVRSM                                                                          
Subjt:  IYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------------------------------------

Query:  -----------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                     VADSWDALDGWLDAIRLVYTIYAR KN+VLAGIITG
Subjt:  -----------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein6.6e-19870.62Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIPS
        MSPTPEEPNNLQNGIE QPHISSES    E      +   D+IP  ELQ+ERESES+ N  PDSEPES RKQ +ESI L VVT V+DP  EE KE S PS
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIPS

Query:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAKH
        NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+DS+NESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR    
Subjt:  NG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAKH

Query:  HGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK
                    YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK
Subjt:  HGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWK

Query:  QATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSA
        QATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GN+KDVSPNELYSQSA
Subjt:  QATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSA

Query:  IYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------------------------------------
        IYIAAAHALKPNYSVYSSALRLVRSM                                                                          
Subjt:  IYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------------------------------------

Query:  -----------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                     VADSWD LDGWLDAIRLVYTIYAR KN+VLAGIITG
Subjt:  -----------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

A0A1S3BJC9 uncharacterized protein LOC103490705 isoform X19.6e-19770.85Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSD-SQLLADAIPKPELQQERESESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIP
        MSPTPEEPNNLQNGIE QPHISSES     EPRS+  +  AD+IP  ELQQERESES+ N   DSEPES RKQ +ESI L VVT V+DP  EE KE S P
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSD-SQLLADAIPKPELQQERESESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIP

Query:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAK
         NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+D +NESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAAR   
Subjt:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAK

Query:  HHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELW
                     YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELW
Subjt:  HHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELW

Query:  KQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQS
        KQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GNIKDVSPNELYSQS
Subjt:  KQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQS

Query:  AIYIAAAHALKPNYSVYSSALRLVRSM-------------------------------------------------------------------------
        AIYIAAAHALKPNYSVYSSALRLVRSM                                                                         
Subjt:  AIYIAAAHALKPNYSVYSSALRLVRSM-------------------------------------------------------------------------

Query:  ------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                      VADSWDALDGWLDAIRLVYTIYAR KN+VLAGIITG
Subjt:  ------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

A0A6J1EA05 protein HLB1-like1.3e-19366.55Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNEEPDSEPESR--------------------------------
        MSPTPEEPNNLQNGIE +PHIS ES+   E  +S+ +  AD +P  ELQQERE ES+N   D EP+S                                 
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNEEPDSEPESR--------------------------------

Query:  RKQFAESIQLQVVTDVSDPGFEEPKEPSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA
        RKQ +ESIQLQV TDV+DP FEEPK  SI SNGTENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAA
Subjt:  RKQFAESIQLQVVTDVSDPGFEEPKEPSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA

Query:  MELINSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPT
        MELINSVTGVDEEGRSRQRILTFAAR                YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPT
Subjt:  MELINSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPT

Query:  LHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTV
        LHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTV
Subjt:  LHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTV

Query:  LYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------
        LYGLAEDTLRTG  G +KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSM                                            
Subjt:  LYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------

Query:  -----------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                                                   VADSWDALDGWLDAIRLVYTIYAR KN+VLAGII G
Subjt:  -----------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

A0A6J1HJU5 protein HLB1-like isoform X23.6e-19669.66Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNEEPDSEPESR----RKQFAESIQLQVVTDVSDPGFEEPKEPS
        MSP PEEPNNLQNGIE +PHIS ES+   E  +S+ +  AD IP  ELQQERESES+N   DSEP+S     RKQ +ESI+LQVVTDV+DP FEEPK  S
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNEEPDSEPESR----RKQFAESIQLQVVTDVSDPGFEEPKEPS

Query:  IPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSA
        I SNG ENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR  
Subjt:  IPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSA

Query:  KHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEEL
                      YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEEL
Subjt:  KHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEEL

Query:  WKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQ
        WKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  G +KDVSPNELYSQ
Subjt:  WKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQ

Query:  SAIYIAAAHALKPNYSVYSSALRLVRSM------------------------------------------------------------------------
        SAIYIAAAHALKP+YSVYSSALRLVRSM                                                                        
Subjt:  SAIYIAAAHALKPNYSVYSSALRLVRSM------------------------------------------------------------------------

Query:  -------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                       VADSWDALDGWLDAIRLVYTIYAR KN+VLAGII G
Subjt:  -------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

A0A6J1HL68 protein HLB1-like isoform X11.1e-19266.22Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLN--------------------------------EEPDSEPESR
        MSP PEEPNNLQNGIE +PHIS ES+   E  +S+ +  AD +P  ELQQERESES+N                                 EP SE +S 
Subjt:  MSPTPEEPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLN--------------------------------EEPDSEPESR

Query:  RKQFAESIQLQVVTDVSDPGFEEPKEPSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA
        RKQ +ESI+LQVVTDV+DP FEEPK  SI SNG ENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAA
Subjt:  RKQFAESIQLQVVTDVSDPGFEEPKEPSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA

Query:  MELINSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPT
        MELINSVTGVDEEGRSRQRILTFAAR                YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPT
Subjt:  MELINSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPT

Query:  LHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTV
        LHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTV
Subjt:  LHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTV

Query:  LYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------
        LYGLAEDTLRTG  G +KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSM                                            
Subjt:  LYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM--------------------------------------------

Query:  -----------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                                                   VADSWDALDGWLDAIRLVYTIYAR KN+VLAGII G
Subjt:  -----------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB11.4e-12348.82Show/hide
Query:  MSPTPEEPNNLQNGI-----ET------QPHISSESHPTDEEPRSDSQL--------LADAIPKPELQQERESESLNEEPDSEPESRRKQFAESIQLQVV
        M+ T EEP  LQNG      ET      +P + +E   T E P  ++ L        + DA P+    + +  E      D++PE  + +        VV
Subjt:  MSPTPEEPNNLQNGI-----ET------QPHISSESHPTDEEPRSDSQL--------LADAIPKPELQQERESESLNEEPDSEPESRRKQFAESIQLQVV

Query:  T----DVSDPGFEEPKEPSIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELI
        T    D++D          IP   TE  Q +      + D+G++TFTMRELL+ LK E+G+ + +         +S    +++S  QP   ++  AM+LI
Subjt:  T----DVSDPGFEEPKEPSIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDA
        N +   DEEGRSRQR+L FAAR                YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA
Subjt:  NSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDA

Query:  FYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGL
        +YNWAIAISDRAK+RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGL
Subjt:  FYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGL

Query:  AEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM------------------------------------------------
        AEDTLRTG  GN KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSM                                                
Subjt:  AEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM------------------------------------------------

Query:  --------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                                                VADSW++LDGWLDAIRLVYTIYAR K+DVLAGIITG
Subjt:  --------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.7e-12548.82Show/hide
Query:  MSPTPEEPNNLQNGI-----ET------QPHISSESHPTDEEPRSDSQL--------LADAIPKPELQQERESESLNEEPDSEPESRRKQFAESIQLQVV
        M+ T EEP  LQNG      ET      +P + +E   T E P  ++ L        + DA P+    + +  E      D++PE  + +        VV
Subjt:  MSPTPEEPNNLQNGI-----ET------QPHISSESHPTDEEPRSDSQL--------LADAIPKPELQQERESESLNEEPDSEPESRRKQFAESIQLQVV

Query:  T----DVSDPGFEEPKEPSIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELI
        T    D++D          IP   TE  Q +      + D+G++TFTMRELL+ LK E+G+ + +         +S    +++S  QP   ++  AM+LI
Subjt:  T----DVSDPGFEEPKEPSIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDA
        N +   DEEGRSRQR+L FAAR                YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA
Subjt:  NSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDA

Query:  FYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGL
        +YNWAIAISDRAK+RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGL
Subjt:  FYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGL

Query:  AEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM------------------------------------------------
        AEDTLRTG  GN KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSM                                                
Subjt:  AEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSM------------------------------------------------

Query:  --------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG
                                                                VADSW++LDGWLDAIRLVYTIYAR K+DVLAGIITG
Subjt:  --------------------------------------------------------VADSWDALDGWLDAIRLVYTIYARSKNDVLAGIITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCGAGGCCGACCTCATCCTCCTAGGGTTTTTAAGAATCCGGAGGTGTTTCGGGACGAACCAGGCCAAATCGGGGCAGTCATAGACATTAGGGACCGAAGGGAGGT
GGCCAAGCCCAGCCCGCGCCCGCGGGCCGAATGGTCGGCCTCGGCCATCGGGCCAAATTGGCCAGACCCTTTGGTCCGGTCTTCCTCTGGGTCAACTTTTTGGTCCCACC
TCTGCCCAACTGTCATCGTCAGCTCCTTGTGCGTCAGTGTGTTTTCCGGAACCTTGCTTCGATCCCGTTCTCTTCCTGCTCGCCACTTCACCATGTCGCCTACTCCCGAG
GAACCTAATAATCTGCAGAACGGAATCGAAACCCAACCACACATTTCTTCAGAATCACACCCAACTGATGAAGAACCCAGATCAGACTCACAACTCCTAGCAGATGCAAT
TCCCAAACCTGAATTGCAACAGGAACGCGAATCAGAATCACTCAATGAAGAGCCAGATTCGGAGCCGGAGTCTCGAAGGAAACAGTTCGCCGAGTCAATCCAATTACAGG
TAGTGACGGATGTTTCAGATCCGGGATTTGAAGAGCCGAAAGAACCCTCGATCCCATCCAACGGCACTGAGAACTCGCAACCTGCGCTGCGTAAGGACGAAGGAAGCCGG
ACGTTTACAATGAGGGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAACGACAGCGTTAACGAATCTGAAGGCGAGAGGCCCGAGGGGAACTCCGGTTACAGTCTTAA
TCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGTGTTGATGAAGAGGGCCGTTCTCGCCAACGGATTCTCA
CATTTGCTGCTAGGAGTGCGAAGCACCATGGCATAGGTGAGATGCAAGGCAATCACCTCGGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATAC
AATTGGGCTTTGGTCCTCCAGGAGAGTGCAGACAATGTTAGTCCAGATTCCACTTCACCTTCTAAAGACGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTAC
CCGTCTTTGTCCAACACTTCACGATGCCTTCTACAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACAAAGGAGGCCGAAGAACTGTGGAAGCAGG
CTACCAAAAATTACGAAAAAGCTGTCCAGCTCAACTGGAATAGTCCCCAGGCGCTAAATAATTGGGGACTTGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAA
AAGCAAACAATTGTAAAAACAGCTATCAGTAAGTTTCGTGCAGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTGTATGGATTAGC
TGAGGACACATTACGGACTGGTGGAAATATTAAGGATGTTTCCCCTAATGAGTTGTACAGCCAATCTGCAATTTATATTGCAGCTGCTCATGCTCTAAAACCAAATTACT
CTGTTTACAGCAGTGCCTTGCGGTTGGTTCGTTCAATGGTTGCTGACTCATGGGACGCGCTCGATGGATGGCTCGACGCAATAAGATTAGTTTACACAATCTATGCTCGA
AGCAAGAACGACGTTTTAGCTGGCATCATAACGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCGAGGCCGACCTCATCCTCCTAGGGTTTTTAAGAATCCGGAGGTGTTTCGGGACGAACCAGGCCAAATCGGGGCAGTCATAGACATTAGGGACCGAAGGGAGGT
GGCCAAGCCCAGCCCGCGCCCGCGGGCCGAATGGTCGGCCTCGGCCATCGGGCCAAATTGGCCAGACCCTTTGGTCCGGTCTTCCTCTGGGTCAACTTTTTGGTCCCACC
TCTGCCCAACTGTCATCGTCAGCTCCTTGTGCGTCAGTGTGTTTTCCGGAACCTTGCTTCGATCCCGTTCTCTTCCTGCTCGCCACTTCACCATGTCGCCTACTCCCGAG
GAACCTAATAATCTGCAGAACGGAATCGAAACCCAACCACACATTTCTTCAGAATCACACCCAACTGATGAAGAACCCAGATCAGACTCACAACTCCTAGCAGATGCAAT
TCCCAAACCTGAATTGCAACAGGAACGCGAATCAGAATCACTCAATGAAGAGCCAGATTCGGAGCCGGAGTCTCGAAGGAAACAGTTCGCCGAGTCAATCCAATTACAGG
TAGTGACGGATGTTTCAGATCCGGGATTTGAAGAGCCGAAAGAACCCTCGATCCCATCCAACGGCACTGAGAACTCGCAACCTGCGCTGCGTAAGGACGAAGGAAGCCGG
ACGTTTACAATGAGGGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAACGACAGCGTTAACGAATCTGAAGGCGAGAGGCCCGAGGGGAACTCCGGTTACAGTCTTAA
TCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGTGTTGATGAAGAGGGCCGTTCTCGCCAACGGATTCTCA
CATTTGCTGCTAGGAGTGCGAAGCACCATGGCATAGGTGAGATGCAAGGCAATCACCTCGGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATAC
AATTGGGCTTTGGTCCTCCAGGAGAGTGCAGACAATGTTAGTCCAGATTCCACTTCACCTTCTAAAGACGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTAC
CCGTCTTTGTCCAACACTTCACGATGCCTTCTACAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACAAAGGAGGCCGAAGAACTGTGGAAGCAGG
CTACCAAAAATTACGAAAAAGCTGTCCAGCTCAACTGGAATAGTCCCCAGGCGCTAAATAATTGGGGACTTGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAA
AAGCAAACAATTGTAAAAACAGCTATCAGTAAGTTTCGTGCAGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTGTATGGATTAGC
TGAGGACACATTACGGACTGGTGGAAATATTAAGGATGTTTCCCCTAATGAGTTGTACAGCCAATCTGCAATTTATATTGCAGCTGCTCATGCTCTAAAACCAAATTACT
CTGTTTACAGCAGTGCCTTGCGGTTGGTTCGTTCAATGGTTGCTGACTCATGGGACGCGCTCGATGGATGGCTCGACGCAATAAGATTAGTTTACACAATCTATGCTCGA
AGCAAGAACGACGTTTTAGCTGGCATCATAACGGGCTGA
Protein sequenceShow/hide protein sequence
MGRGRPHPPRVFKNPEVFRDEPGQIGAVIDIRDRREVAKPSPRPRAEWSASAIGPNWPDPLVRSSSGSTFWSHLCPTVIVSSLCVSVFSGTLLRSRSLPARHFTMSPTPE
EPNNLQNGIETQPHISSESHPTDEEPRSDSQLLADAIPKPELQQERESESLNEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKEPSIPSNGTENSQPALRKDEGSR
TFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARSAKHHGIGEMQGNHLGYASAIERNAQDYDALY
NWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPARE
KQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMVADSWDALDGWLDAIRLVYTIYAR
SKNDVLAGIITG