; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G022720 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G022720
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionATP-dependent zinc metalloprotease
Genome locationCG_Chr09:39759384..39761911
RNA-Seq ExpressionClCG09G022720
SyntenyClCG09G022720
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR037219 - Peptidase M41-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139896.1 uncharacterized protein LOC101213430 [Cucumis sativus]1.0e-19974.71Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLE--SHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS
        MAILSPPKLLISSSL Q   FHYPIPF+FQQKNPNGINK+FHLE   HQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDR+SAIEPI DSAPAGS
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLE--SHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS

Query:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL
        APSA+ N RLSGWERDWEVLDTCLNADDMKLVA+AY FL+DRGFLPNFGKCRNI                                              
Subjt:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL

Query:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
                                        VLEGRRDVTPSVLE TTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
Subjt:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF

Query:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL
        LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRLDGTSFD              
Subjt:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL

Query:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI
                       RYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVI
Subjt:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI

Query:  RRIEDALSTN
        R+IEDALSTN
Subjt:  RRIEDALSTN

XP_008447096.1 PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo]3.4e-20375.69Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESH--QRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS
        MAILSPPKLLISSSLLQ   FHYPIPF+FQQKNPNGINKHFHL+ H  QRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDR+SAIEPINDSAPAGS
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESH--QRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS

Query:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL
        APSA+GN RLSGWERDWEVLDTCLNADDMKLVA+AY FL+DRGFLPNFGKCRNI                                              
Subjt:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL

Query:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
                                        VLEG+RDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
Subjt:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF

Query:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL
        LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRLDGTSFD              
Subjt:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL

Query:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI
                       RYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVI
Subjt:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI

Query:  RRIEDALSTN
        RRIEDALSTN
Subjt:  RRIEDALSTN

XP_022969425.1 uncharacterized protein LOC111468437 isoform X1 [Cucurbita maxima]2.4e-19373.08Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP
        M+I SPPKLLIS SLLQF  FH P+PF+FQQK  NGIN+HFHL+ HQRLL L RA+REWQ+YEEAVKRKDLAEALRFLESF RESAIEP NDSA A SAP
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP

Query:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF
        SALGNPRLSGWERDWEVLDTCLNADDMKLVA+AYGFLRDRGFLPNFGKCRNI                                                
Subjt:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF

Query:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD
                                      VLEG RDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIA LGGTSFLLSQDIDIRPNL ALLGLAFLD
Subjt:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD

Query:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI
        SILLGGTCLAQISS WPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRLDGTSFD                
Subjt:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI

Query:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR
                     RYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR
Subjt:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR

Query:  IEDALSTNR
        +E+ALSTNR
Subjt:  IEDALSTNR

XP_023511731.1 uncharacterized protein LOC111776502 [Cucurbita pepo subsp. pepo]6.0e-19272.89Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP
        M+I SPPKLLIS SLLQF  FH P PF+FQQK  NGINKHFHL  HQRLL L RA+REWQ+YEEAVKRKDLAEALRFLES  RESAIEP NDSA + SAP
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP

Query:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF
        SALGNPRLSGWERDWEVLDTCLNADDMKLVA+AYGFLRDRGFLPNFGKCRNI                                                
Subjt:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF

Query:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD
                                      VLEG RDVTPSVLESTTGLEV KLSPKKWGLSGSSRYALIA LGGTSFLLSQDIDIRPNL ALLGLAFLD
Subjt:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD

Query:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI
        SILLGGTCLAQISS WPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRLDGTSFD                
Subjt:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI

Query:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR
                     RYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR
Subjt:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR

Query:  IEDALSTNR
        IE+ALSTNR
Subjt:  IEDALSTNR

XP_038888049.1 uncharacterized protein LOC120077976 isoform X1 [Benincasa hispida]1.1e-20676.62Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP
        MA+LSPPKLLISSSLLQF   HYPIPFNFQQKNPNGINKHF+LE HQRLLPLSRAL EWQDYEEAVKRKDLAEALRFLESFDR+SAIEPINDSAPAGSAP
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP

Query:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF
        SAL NPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGK RNI                                                
Subjt:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF

Query:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD
                                      VLEGRRDVTPSVLESTTGLEVSKLSPKKWG+SGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD
Subjt:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD

Query:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI
        SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFD                
Subjt:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI

Query:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR
                     RYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQ AVKALESGSSLSVVIRR
Subjt:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR

Query:  IEDALSTNR
        IEDALSTNR
Subjt:  IEDALSTNR

TrEMBL top hitse value%identityAlignment
A0A0A0K7I5 Uncharacterized protein4.9e-20074.71Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLE--SHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS
        MAILSPPKLLISSSL Q   FHYPIPF+FQQKNPNGINK+FHLE   HQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDR+SAIEPI DSAPAGS
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLE--SHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS

Query:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL
        APSA+ N RLSGWERDWEVLDTCLNADDMKLVA+AY FL+DRGFLPNFGKCRNI                                              
Subjt:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL

Query:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
                                        VLEGRRDVTPSVLE TTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
Subjt:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF

Query:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL
        LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRLDGTSFD              
Subjt:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL

Query:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI
                       RYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVI
Subjt:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI

Query:  RRIEDALSTN
        R+IEDALSTN
Subjt:  RRIEDALSTN

A0A1S3BH83 uncharacterized protein LOC103489633 isoform X11.6e-20375.69Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESH--QRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS
        MAILSPPKLLISSSLLQ   FHYPIPF+FQQKNPNGINKHFHL+ H  QRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDR+SAIEPINDSAPAGS
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESH--QRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS

Query:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL
        APSA+GN RLSGWERDWEVLDTCLNADDMKLVA+AY FL+DRGFLPNFGKCRNI                                              
Subjt:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL

Query:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
                                        VLEG+RDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
Subjt:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF

Query:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL
        LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRLDGTSFD              
Subjt:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL

Query:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI
                       RYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVI
Subjt:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI

Query:  RRIEDALSTN
        RRIEDALSTN
Subjt:  RRIEDALSTN

A0A5A7U732 Uncharacterized protein1.6e-20375.69Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESH--QRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS
        MAILSPPKLLISSSLLQ   FHYPIPF+FQQKNPNGINKHFHL+ H  QRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDR+SAIEPINDSAPAGS
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESH--QRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGS

Query:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL
        APSA+GN RLSGWERDWEVLDTCLNADDMKLVA+AY FL+DRGFLPNFGKCRNI                                              
Subjt:  APSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGL

Query:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
                                        VLEG+RDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF
Subjt:  YFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF

Query:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL
        LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRLDGTSFD              
Subjt:  LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTL

Query:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI
                       RYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVI
Subjt:  YIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVI

Query:  RRIEDALSTN
        RRIEDALSTN
Subjt:  RRIEDALSTN

A0A6J1D1P2 uncharacterized protein LOC1110167838.4e-19272.24Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP
        MAI SPPKL ISSS L F  F + I F+F QK P GI +HFHLE  QRLL L RALREWQDYEEAVKRKDLAEALRFLESFDR+SAIEP+NDSA A SAP
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP

Query:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF
        SAL NPRLSGWERDWEVLDTCLNADDMKLVA+AYGFLRDRGFLPNFGKCRNI                                                
Subjt:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF

Query:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD
                                      VLEGRRDVTPSVLES+TGL+V+KLSPKKWGLSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLD
Subjt:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD

Query:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI
        SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRLDGTSFD                
Subjt:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI

Query:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR
                     RYCM+LFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIR+
Subjt:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR

Query:  IEDALSTN
        IEDALSTN
Subjt:  IEDALSTN

A0A6J1HZW5 uncharacterized protein LOC111468437 isoform X11.2e-19373.08Show/hide
Query:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP
        M+I SPPKLLIS SLLQF  FH P+PF+FQQK  NGIN+HFHL+ HQRLL L RA+REWQ+YEEAVKRKDLAEALRFLESF RESAIEP NDSA A SAP
Subjt:  MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAP

Query:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF
        SALGNPRLSGWERDWEVLDTCLNADDMKLVA+AYGFLRDRGFLPNFGKCRNI                                                
Subjt:  SALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYF

Query:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD
                                      VLEG RDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIA LGGTSFLLSQDIDIRPNL ALLGLAFLD
Subjt:  IVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLD

Query:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI
        SILLGGTCLAQISS WPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRLDGTSFD                
Subjt:  SILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYI

Query:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR
                     RYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR
Subjt:  ITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRR

Query:  IEDALSTNR
        +E+ALSTNR
Subjt:  IEDALSTNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G56180.1 unknown protein1.3e-12854.95Show/hide
Query:  ALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAPSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGT
        ALREW++YE+AVKRKDLA ALRFL+S + +   + +     A    S LG   L   ERDW+VLD CLNADDM+LV  A+ FL++RG L NFGK  +I  
Subjt:  ALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAPSALGNPRLSGWERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGT

Query:  PLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKL
                                                                                    VLEG R+VTP+VL+S TGLEV+KL
Subjt:  PLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYFIVDFQYPFYLMICFYTTYLSTVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKL

Query:  SPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM
        SPKKWGLSG S  AL A LGG S+LLSQ+ID+RPNL  +LGLA+LDS+ LGGTCLAQ+S YWPP++RRI+VHEAGHLL AYLMGCPIRGVILDP+VAMQM
Subjt:  SPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM

Query:  GIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPL
        G+QGQAGTQFWD+KM S +AEGRL G+SFD                             RY MVLFAGIAAEALVYGEAEGGENDENLFRSI +LL+PPL
Subjt:  GIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPL

Query:  SVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRIEDALSTNR
        SVAQMSNQARW+VLQSYNLLKWHK AH+ AV+AL+ GS LS+VIRRIE+A+S+++
Subjt:  SVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRIEDALSTNR

AT2G21960.1 unknown protein2.3e-1630.61Show/hide
Query:  LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYIITNLEMPW
        ++  S+++P Y+ RI  HEA H L AYL+G PI G  LD          G+      DE++A  +  G+LD    D                        
Subjt:  LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYIITNLEMPW

Query:  AMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRIEDA
             R   V  AG+AAE L Y +  G   D    +      QP +S  Q  N  RWAVL S +LLK +K  H+  + A+   +S+   I+ IE A
Subjt:  AMIGVRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRIEDA

AT5G27290.1 unknown protein2.8e-1426.69Show/hide
Query:  SKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF-----LDSILLGGTCLAQI-----SSYWPPYRRRILVHEAGHLLTAYLMGCPI
        S LSP    L    R   IA + G   +  +  D+    L  L L F     LD +   G   + +      ++   Y  R++ HEAGH L AYL+G   
Subjt:  SKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF-----LDSILLGGTCLAQI-----SSYWPPYRRRILVHEAGHLLTAYLMGCPI

Query:  RGVILDPIVAMQM--GIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGEND
        RG  L  + A+Q    +  QAG+ F D +    +  G++  T  +                             R+  +  AG+A E L+YG AEGG +D
Subjt:  RGVILDPIVAMQM--GIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGEND

Query:  ENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRIEDAL
         +    +   L    +  +  +Q RW+VL +  LL+ H+ A     +A+  G S+   I+ IED++
Subjt:  ENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRIEDAL

AT5G27290.2 unknown protein9.1e-0528.87Show/hide
Query:  SKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF-----LDSILLGGTCLAQI-----SSYWPPYRRRILVHEAGHLLTAYLMGCPI
        S LSP    L    R   IA + G   +  +  D+    L  L L F     LD +   G   + +      ++   Y  R++ HEAGH L AYL+G   
Subjt:  SKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAF-----LDSILLGGTCLAQI-----SSYWPPYRRRILVHEAGHLLTAYLMGCPI

Query:  RGVILDPIVAMQM--GIQGQAGTQFWDEKMASSLAEGRLDGT
        RG  L  + A+Q    +  QAG+ F D +    +  G++  T
Subjt:  RGVILDPIVAMQM--GIQGQAGTQFWDEKMASSLAEGRLDGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATCCTTAGTCCTCCCAAACTCCTAATTTCATCTTCTCTTCTCCAATTCCACCATTTTCATTACCCAATTCCCTTCAATTTTCAACAGAAAAACCCTAATGGAAT
CAATAAACATTTCCATTTAGAAAGCCATCAGCGTCTCCTCCCTCTGTCTAGAGCTCTTCGCGAATGGCAAGATTACGAAGAGGCAGTGAAGCGCAAGGATCTCGCTGAAG
CTCTTAGGTTTCTCGAATCCTTTGACAGAGAAAGCGCAATCGAACCCATTAATGATTCGGCACCTGCTGGTTCAGCTCCGTCTGCTCTTGGGAATCCGCGGTTATCTGGC
TGGGAGAGGGACTGGGAGGTACTAGACACTTGTTTAAATGCGGATGATATGAAGCTTGTTGCCGATGCTTATGGGTTTCTCAGGGACAGAGGATTTTTGCCCAATTTTGG
AAAGTGCAGGAACATTGGTACACCCCTTTCTCTGTCTCCACCATGTGAATTTATTTCATTAGTGTTGAAGAATGCAAGCATCGAGAAACGACTTTCCCTTTCGGCTTCCA
TCAATCTCTTTAGCATGTATCTACACGCCGTATTGCAGGGGCTTTACTTTATTGTTGATTTTCAGTACCCATTTTACTTGATGATTTGTTTTTATACTACTTATTTGAGC
ACAGTGACTGATGAACTAGCCATACCTGCAGTTTTGGAGGGTCGAAGAGATGTCACGCCGTCTGTGTTGGAGTCTACAACTGGATTAGAAGTCTCCAAGTTATCTCCAAA
GAAATGGGGTCTTTCAGGCAGCTCTCGTTACGCTTTGATTGCTTTTCTTGGTGGAACATCCTTTCTGCTCTCGCAGGACATAGATATTAGGCCAAACCTTTTGGCACTGC
TGGGGCTGGCATTTTTGGATTCTATCCTCCTTGGTGGTACTTGTCTAGCACAAATCTCCAGCTATTGGCCACCATATAGGCGTCGAATCCTTGTACACGAAGCTGGACAT
CTACTGACTGCTTACCTCATGGGCTGCCCGATTCGTGGAGTGATTTTGGATCCGATCGTTGCCATGCAAATGGGGATACAAGGACAGGCAGGTACCCAGTTTTGGGATGA
AAAAATGGCAAGCAGTCTTGCTGAAGGACGTTTGGATGGTACTTCCTTTGACAGTGATCATATGAGTGGACTCAGTTTTATGCCTGATACTCTGTATATAATAACTAACT
TGGAAATGCCCTGGGCCATGATTGGTGTCAGGTACTGCATGGTCCTTTTTGCGGGCATTGCAGCTGAAGCTCTTGTTTACGGTGAAGCAGAGGGAGGAGAGAATGATGAA
AATTTGTTTAGAAGTATTTGCATTCTTTTGCAACCCCCATTGTCTGTTGCGCAGATGTCAAATCAAGCAAGATGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCA
CAAACATGCACACCAAGTAGCTGTCAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAGAATTGAGGATGCATTGTCGACAAATAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTATCCTTAGTCCTCCCAAACTCCTAATTTCATCTTCTCTTCTCCAATTCCACCATTTTCATTACCCAATTCCCTTCAATTTTCAACAGAAAAACCCTAATGGAAT
CAATAAACATTTCCATTTAGAAAGCCATCAGCGTCTCCTCCCTCTGTCTAGAGCTCTTCGCGAATGGCAAGATTACGAAGAGGCAGTGAAGCGCAAGGATCTCGCTGAAG
CTCTTAGGTTTCTCGAATCCTTTGACAGAGAAAGCGCAATCGAACCCATTAATGATTCGGCACCTGCTGGTTCAGCTCCGTCTGCTCTTGGGAATCCGCGGTTATCTGGC
TGGGAGAGGGACTGGGAGGTACTAGACACTTGTTTAAATGCGGATGATATGAAGCTTGTTGCCGATGCTTATGGGTTTCTCAGGGACAGAGGATTTTTGCCCAATTTTGG
AAAGTGCAGGAACATTGGTACACCCCTTTCTCTGTCTCCACCATGTGAATTTATTTCATTAGTGTTGAAGAATGCAAGCATCGAGAAACGACTTTCCCTTTCGGCTTCCA
TCAATCTCTTTAGCATGTATCTACACGCCGTATTGCAGGGGCTTTACTTTATTGTTGATTTTCAGTACCCATTTTACTTGATGATTTGTTTTTATACTACTTATTTGAGC
ACAGTGACTGATGAACTAGCCATACCTGCAGTTTTGGAGGGTCGAAGAGATGTCACGCCGTCTGTGTTGGAGTCTACAACTGGATTAGAAGTCTCCAAGTTATCTCCAAA
GAAATGGGGTCTTTCAGGCAGCTCTCGTTACGCTTTGATTGCTTTTCTTGGTGGAACATCCTTTCTGCTCTCGCAGGACATAGATATTAGGCCAAACCTTTTGGCACTGC
TGGGGCTGGCATTTTTGGATTCTATCCTCCTTGGTGGTACTTGTCTAGCACAAATCTCCAGCTATTGGCCACCATATAGGCGTCGAATCCTTGTACACGAAGCTGGACAT
CTACTGACTGCTTACCTCATGGGCTGCCCGATTCGTGGAGTGATTTTGGATCCGATCGTTGCCATGCAAATGGGGATACAAGGACAGGCAGGTACCCAGTTTTGGGATGA
AAAAATGGCAAGCAGTCTTGCTGAAGGACGTTTGGATGGTACTTCCTTTGACAGTGATCATATGAGTGGACTCAGTTTTATGCCTGATACTCTGTATATAATAACTAACT
TGGAAATGCCCTGGGCCATGATTGGTGTCAGGTACTGCATGGTCCTTTTTGCGGGCATTGCAGCTGAAGCTCTTGTTTACGGTGAAGCAGAGGGAGGAGAGAATGATGAA
AATTTGTTTAGAAGTATTTGCATTCTTTTGCAACCCCCATTGTCTGTTGCGCAGATGTCAAATCAAGCAAGATGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCA
CAAACATGCACACCAAGTAGCTGTCAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAGAATTGAGGATGCATTGTCGACAAATAGATGA
Protein sequenceShow/hide protein sequence
MAILSPPKLLISSSLLQFHHFHYPIPFNFQQKNPNGINKHFHLESHQRLLPLSRALREWQDYEEAVKRKDLAEALRFLESFDRESAIEPINDSAPAGSAPSALGNPRLSG
WERDWEVLDTCLNADDMKLVADAYGFLRDRGFLPNFGKCRNIGTPLSLSPPCEFISLVLKNASIEKRLSLSASINLFSMYLHAVLQGLYFIVDFQYPFYLMICFYTTYLS
TVTDELAIPAVLEGRRDVTPSVLESTTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGH
LLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASSLAEGRLDGTSFDSDHMSGLSFMPDTLYIITNLEMPWAMIGVRYCMVLFAGIAAEALVYGEAEGGENDE
NLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRRIEDALSTNR