; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G06460 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G06460
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionActin cytoskeleton-regulatory complex protein pan1
Genome locationClcChr04:19767491..19770989
RNA-Seq ExpressionClc04G06460
SyntenyClc04G06460
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033696.1 hypothetical protein SDJN02_03421, partial [Cucurbita argyrosperma subsp. argyrosperma]9.8e-17075.37Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASN-FSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQAITRASTFKEEIHHP+D+ D+NSISSSSSSSSSLAAQAIRASAAHRDSS SSAF G+S+ FSPGH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASN-FSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGEN+TFKSESKSGFWGVLARKAKAILEEDDIAIEEETS FQP        EPCQSTD+DSKK DNPA  KGLDAI+T+ NQLGDT +K
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        A +EGR IV  KT DII ETRKLQIR+KGN SEGL+P VNNPWQQPN+ SPEP MQ HHETQLKASRDVAMATAAKAKLLLRELKTIKA+LAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT+ML N  SP S
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHD-----------KDSDGHSSPATVSEEEEEGTAKKPLPSSTSS
         TSP SPSA  AAME LP VP +HPK+ KD  D           +DS+G  SP TVS EEEEGT   PLPSSTSS
Subjt:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHD-----------KDSDGHSSPATVSEEEEEGTAKKPLPSSTSS

XP_008463299.1 PREDICTED: uncharacterized protein LOC103501490 isoform X1 [Cucumis melo]1.3e-18280.3Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQAITRASTFKEEIH+PSDEFD+NSISSSSSSSSSLAAQAIRASA H DSS+SSAF GA SNFSPGH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGENV FKSESKSGFWGVLARKAKAILEEDDIAIEEE SRFQP        EPCQST+ DSKK+DNPAIRKGLDAIST+LNQLGDTFEK
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        AY EGR+IVE KTADIIQETRKLQIRKKGNN+EGLYPAVNN WQQPNIQSPEP+MQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYP+STSPEITK LSNSVSPRS
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVV--PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS
        PTSP+SP      ME LP V  P I  KQDKD HDKDSD H + +T       EEEEEGT  KPLPS+TSSS
Subjt:  PTSPNSPSAAEAAMEHLPVV--PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS

XP_011653735.2 uncharacterized protein LOC101205031 isoform X1 [Cucumis sativus]3.5e-18380.13Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQAITRASTFKEEIH+PSDE D+NSISSSSSSSSSLAAQAIRASA  RDSS+SSAF GA SNFSPGH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGEN+ FKS+SKSGFWGVLARKAKAILEEDDIAIE+E SRFQP        EPCQST+ DSKK+DNPAIRKGLDAIST+LNQLGDTFEK
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        AYEEGR+IVE KTADIIQETRKLQIRKKGNN+EGLYPAVNN WQQPNIQSPEP+MQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYP+STSPEITKMLSNSVSPRS
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVV---PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS
        PTSP+SP      ME LPVV   P I  KQDKD HDKDSD H + +T       EEEEEGT K PLPSSTSSS
Subjt:  PTSPNSPSAAEAAMEHLPVV---PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS

XP_022978186.1 uncharacterized protein LOC111478244 [Cucurbita maxima]9.8e-17076.12Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASN-FSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQAITRASTFKEEIH P+D+ D+NSISSSSSSSSSLA QAIRASAAHRDSS SSAF G+S+ FSPGH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASN-FSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGEN+TFKSESKSGFWGVLARKAKAILEEDDIAIEEETS FQP        E CQ TD+DSKKSDNPA  KGLDAI+T+LNQLGDT +K
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        A +EGR IVE KT DII ETRKLQIR+KGN SEGL+P VNNPWQQPN+ SPEP MQ HHETQLKASRDVAMATAAKAKLLLRELKTIKA+LAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT+ML N  SP S
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHD-----KDSDGHSSPATVSEEEEEGTAKKPLPSSTSS
         TSP SPSA  AAME LP VP +HPK++KD  D     +DS+G  SP TVS EEEEGT   PLPSSTSS
Subjt:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHD-----KDSDGHSSPATVSEEEEEGTAKKPLPSSTSS

XP_038882986.1 uncharacterized protein LOC120074073 [Benincasa hispida]1.2e-18682.23Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGG-ASNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQ ITRASTFKEEIHH SDEFD+NSISSSSSSSSSLAAQAIRAS AHRDSS SSA GG +SNFS GH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGG-ASNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQ--------PEPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGE+VTFKSESKSGFWGVLARKAKAILEEDDI IEEETSRFQ         EPCQSTDYDSKKSDNPA+RKGLDAIST+LNQLGDTFEK
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQ--------PEPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        A+EEGRSIVETKTADIIQETRKLQIRKKGNNSEGL+PA+NNPWQQP+IQSPEP+MQTHHETQLKASRDVAMATAAKAKLLLRELKT+KADLAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLT QDVVY DEGMEEVTEVYP+STSPEITK+LS+SVSPRS
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHDKDSDGHSSPATV--SEEEEEGTAKKPLPSSTSSS
        PTS NSPS+A AAME LPVVP   PKQDKD H K SDGH SP TV   EEEEEGT KKPLPSSTSSS
Subjt:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHDKDSDGHSSPATV--SEEEEEGTAKKPLPSSTSSS

TrEMBL top hitse value%identityAlignment
A0A0A0KXX4 Uncharacterized protein1.7e-18380.13Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQAITRASTFKEEIH+PSDE D+NSISSSSSSSSSLAAQAIRASA  RDSS+SSAF GA SNFSPGH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGEN+ FKS+SKSGFWGVLARKAKAILEEDDIAIE+E SRFQP        EPCQST+ DSKK+DNPAIRKGLDAIST+LNQLGDTFEK
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        AYEEGR+IVE KTADIIQETRKLQIRKKGNN+EGLYPAVNN WQQPNIQSPEP+MQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYP+STSPEITKMLSNSVSPRS
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVV---PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS
        PTSP+SP      ME LPVV   P I  KQDKD HDKDSD H + +T       EEEEEGT K PLPSSTSSS
Subjt:  PTSPNSPSAAEAAMEHLPVV---PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS

A0A1S3CJA7 uncharacterized protein LOC103501490 isoform X16.4e-18380.3Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQAITRASTFKEEIH+PSDEFD+NSISSSSSSSSSLAAQAIRASA H DSS+SSAF GA SNFSPGH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGENV FKSESKSGFWGVLARKAKAILEEDDIAIEEE SRFQP        EPCQST+ DSKK+DNPAIRKGLDAIST+LNQLGDTFEK
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        AY EGR+IVE KTADIIQETRKLQIRKKGNN+EGLYPAVNN WQQPNIQSPEP+MQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYP+STSPEITK LSNSVSPRS
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVV--PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS
        PTSP+SP      ME LP V  P I  KQDKD HDKDSD H + +T       EEEEEGT  KPLPS+TSSS
Subjt:  PTSPNSPSAAEAAMEHLPVV--PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS

A0A5D3C4F3 Actin cytoskeleton-regulatory complex protein pan16.4e-18380.3Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQAITRASTFKEEIH+PSDEFD+NSISSSSSSSSSLAAQAIRASA H DSS+SSAF GA SNFSPGH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGA-SNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGENV FKSESKSGFWGVLARKAKAILEEDDIAIEEE SRFQP        EPCQST+ DSKK+DNPAIRKGLDAIST+LNQLGDTFEK
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        AY EGR+IVE KTADIIQETRKLQIRKKGNN+EGLYPAVNN WQQPNIQSPEP+MQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYP+STSPEITK LSNSVSPRS
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVV--PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS
        PTSP+SP      ME LP V  P I  KQDKD HDKDSD H + +T       EEEEEGT  KPLPS+TSSS
Subjt:  PTSPNSPSAAEAAMEHLPVV--PSIHPKQDKDQHDKDSDGHSSPATV-----SEEEEEGTAKKPLPSSTSSS

A0A6J1GFX8 uncharacterized protein LOC1114535968.9e-16975.91Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASN-FSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQAITRASTFKEEIHHP+D+ D+NSI   SSSSSSLAAQAIRASAAHRDSS SSAF G+S+ FSPGH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASN-FSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGEN+TFKSESKSGFWGVLARKAKAILEEDDIAIEEETS FQP        EPCQSTD+DSKKSDNPA  KGLDAI+T+ NQLGDT +K
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        A +EGR IV  KT DII ETRKLQIR+KGN SEGL+P VNNPWQQPN+ SPEP MQ HHETQLKASRDVAMATAAKAKLLLRELKTIKA+LAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT+ML N  SP S
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHD-----KDSDGHSSPATVSEEEEEGTAKKPLPSSTSS
         TSP SPSA  AAME LP VP +HPK+ KD  D     +DS+G  SP TVS EEEEGT   PLPSSTSS
Subjt:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHD-----KDSDGHSSPATVSEEEEEGTAKKPLPSSTSS

A0A6J1IM16 uncharacterized protein LOC1114782444.7e-17076.12Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASN-FSPGHIRSKINMICFASNSSEQVIDQFTNYIFL
        MAYRRRQAITRASTFKEEIH P+D+ D+NSISSSSSSSSSLA QAIRASAAHRDSS SSAF G+S+ FSPGH+RSK                        
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASN-FSPGHIRSKINMICFASNSSEQVIDQFTNYIFL

Query:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK
                 SFAGEN+TFKSESKSGFWGVLARKAKAILEEDDIAIEEETS FQP        E CQ TD+DSKKSDNPA  KGLDAI+T+LNQLGDT +K
Subjt:  ELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQP--------EPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEK

Query:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ
        A +EGR IVE KT DII ETRKLQIR+KGN SEGL+P VNNPWQQPN+ SPEP MQ HHETQLKASRDVAMATAAKAKLLLRELKTIKA+LAFAKERCAQ
Subjt:  AYEEGRSIVETKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQ

Query:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS
        LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTE YP STSPEIT+ML N  SP S
Subjt:  LEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRS

Query:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHD-----KDSDGHSSPATVSEEEEEGTAKKPLPSSTSS
         TSP SPSA  AAME LP VP +HPK++KD  D     +DS+G  SP TVS EEEEGT   PLPSSTSS
Subjt:  PTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHD-----KDSDGHSSPATVSEEEEEGTAKKPLPSSTSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30050.1 unknown protein1.7e-7950.57Show/hide
Query:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSI-----------SSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASNFSPGHIRSKINMICFASNSSEQV
        MAYRRRQ ITRASTFKE+I+H   + D+  +           S S SS SSLAAQAIRAS        S A G                           
Subjt:  MAYRRRQAITRASTFKEEIHHPSDEFDYNSI-----------SSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASNFSPGHIRSKINMICFASNSSEQV

Query:  IDQFTNYIFLELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQPEPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTF
           FT Y                     KSES+ GFWG+LA+KAK+ILE+     EEE    Q +  Q  D   + S NP IRK +D I+T+LN +GD+F
Subjt:  IDQFTNYIFLELNFAMISSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQPEPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTF

Query:  EKAYEEGRSIVETKTADIIQETRKLQIRKKG-------NNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADL
        EKA+EEGR+IV +            QIR+KG       NN+       ++PWQ   +  P P      E+QLKASRDVAMATAAKAKLLLRELKT+KADL
Subjt:  EKAYEEGRSIVETKTADIIQETRKLQIRKKG-------NNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADL

Query:  AFAKERCAQLEEENKILRENREKGDNR-ADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITK
        AFAKERC+QLEEENK LR+NR+KG+N  ADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVY+DEG+EEV EV     +P IT+
Subjt:  AFAKERCAQLEEENKILRENREKGDNR-ADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITK

Query:  MLS---------NSVSPRSPTSPNSPSAAEAAMEHLPVV
         LS          S+SP SP+SP SPS    + +  P++
Subjt:  MLS---------NSVSPRSPTSPNSPSAAEAAMEHLPVV

AT2G30530.1 unknown protein5.9e-7249.74Show/hide
Query:  KEEIHHPSDEFDYNSISS------SSSSSSSLAAQAIRASAAHRDSSYSSAFGGASNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFLELNFAMISSS
        KE +  P+    Y+S SS      SSS++SSLAA+AIRAS+AHRDSS SSA+   S+ +P     K                        E+N A   +S
Subjt:  KEEIHHPSDEFDYNSISS------SSSSSSSLAAQAIRASAAHRDSSYSSAFGGASNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFLELNFAMISSS

Query:  FAGENVTFKSESKSGFWGVLARKAKAILEEDD-IAIEEETSRFQPEPCQSTDYDSK-------KSDNPAIRKGLDAISTTLNQLGDTFEKAYEEGRSIVE
            N     E K GFWG LA KAKA L+EDD   + +   R +     +T   +K       KS+NP++++ LDAI+++LN +G T     EEG + VE
Subjt:  FAGENVTFKSESKSGFWGVLARKAKAILEEDD-IAIEEETSRFQPEPCQSTDYDSK-------KSDNPAIRKGLDAISTTLNQLGDTFEKAYEEGRSIVE

Query:  TKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQLEEENKILRE
         +TA IIQETRK +I+KK        P++    Q P IQ+         E QLKASRDVAMA AAKAKLLLRELK +K+DLAFAK+RCAQLEEENK+LRE
Subjt:  TKTADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQLEEENKILRE

Query:  NREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSP
        NR       DDDL+RLQLETLLAEKARLAHENSIY REN +LR +VEYHQLTMQDVVY DE  EEVTEVYP++ S  ++    NS +P
Subjt:  NREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSP

AT4G02800.1 unknown protein7.3e-3034.64Show/hide
Query:  SSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQPEPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEKAYEEGRSIVETKT--
        ++SF   + +    S    W  +  +   +LEE+    +  T+    E  +S  +   K+D+  + KG D++S TL+ L    + A +  R + +  +  
Subjt:  SSSFAGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQPEPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEKAYEEGRSIVETKT--

Query:  --------ADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQLEEEN
                AD IQ  +K +  ++   S+G      +  +Q    S E   +      +K ++++A++ AAKA  L RELKTIK+DL+F +ERC  LEEEN
Subjt:  --------ADIIQETRKLQIRKKGNNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQLEEEN

Query:  KILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTS
        K LR+   KG    +DDL+RLQLE LLAEKARLA+EN+   REN+ L ++VEYHQ+T QD   L    E+V + + L  S
Subjt:  KILRENREKGDNRADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTS

AT5G01970.1 unknown protein4.4e-8361.56Show/hide
Query:  FKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQPEPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEKAYEEGRSIVETKTADIIQETRKLQI
        F +E K G WGV+A+KAK+++ EDD + +  T+  Q      +D   KK DNP +R+GLD ++++LNQ+GDTFEKA+E+GR++VE KTADIIQETRKLQ 
Subjt:  FKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQPEPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEKAYEEGRSIVETKTADIIQETRKLQI

Query:  RKKGNNSE----GLYPAVNNPWQQPNIQSPEPHMQTH---HETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQLEEENKILRE-NREKGDN
        R++G   E         V++ W+    +SPE  MQ +   HETQLKASRDVAMATAAKAKLLLRELKT+KADLAFAKERCAQLEEENK LRE +REKG N
Subjt:  RKKGNNSE----GLYPAVNNPWQQPNIQSPEPHMQTH---HETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQLEEENKILRE-NREKGDN

Query:  RADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRSPTSPNSPSAAE
         AD+DLIRLQLE+LLAEKARLAHENS+YARENRFLREIVEYHQLTMQDVVY+DEG EEVT+V     SP ++ ++++S + RS + P+     E
Subjt:  RADDDLIRLQLETLLAEKARLAHENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRSPTSPNSPSAAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATAGGAGGAGACAAGCAATCACCAGGGCTTCCACTTTCAAAGAAGAGATCCATCATCCTTCCGACGAATTTGATTATAATTCCATCTCTTCTTCCTCCTCTTC
CTCTTCTTCTCTTGCCGCTCAGGCGATCCGAGCTTCTGCTGCTCATCGAGATTCTTCATATTCCTCTGCCTTTGGTGGTGCCTCCAATTTTTCCCCTGGCCATATACGAT
CCAAGATTAATATGATTTGTTTTGCATCGAATTCTTCCGAGCAAGTGATTGATCAATTTACTAATTACATTTTCTTAGAGCTTAATTTTGCTATGATCTCCTCGAGTTTT
GCTGGTGAGAATGTGACATTCAAAAGTGAGTCAAAATCCGGGTTTTGGGGTGTACTGGCTCGGAAAGCTAAGGCCATTCTGGAAGAAGATGACATAGCCATAGAAGAGGA
AACAAGTAGGTTTCAGCCCGAGCCATGTCAGTCTACAGATTATGACTCGAAGAAATCAGACAATCCTGCAATACGGAAGGGTTTGGATGCAATCTCAACTACGCTTAATC
AACTTGGGGACACGTTTGAAAAGGCATATGAGGAAGGTCGCTCAATTGTGGAAACTAAGACTGCAGACATCATTCAAGAAACTCGCAAACTGCAGATTAGGAAAAAGGGC
AATAATTCTGAAGGATTATATCCTGCGGTGAATAATCCATGGCAGCAACCGAATATACAGTCTCCGGAACCACATATGCAAACACATCATGAAACTCAACTGAAGGCATC
TCGAGACGTGGCGATGGCGACAGCTGCCAAAGCCAAACTACTACTTCGGGAGCTGAAAACCATTAAAGCAGATCTGGCTTTTGCTAAAGAAAGATGTGCACAATTAGAGG
AAGAAAACAAAATTCTCAGGGAAAATCGGGAGAAGGGAGACAATCGTGCAGATGATGACTTGATTCGGCTTCAACTGGAGACTCTTTTGGCTGAGAAGGCTCGTTTGGCG
CATGAGAATTCAATATATGCCCGGGAGAACCGTTTCTTAAGGGAAATTGTGGAGTACCACCAACTGACAATGCAGGATGTGGTTTATTTAGACGAAGGAATGGAAGAAGT
TACAGAAGTTTATCCCCTATCCACCTCTCCAGAGATCACCAAGATGCTTTCTAATTCTGTTTCCCCACGCTCTCCAACCTCTCCAAACTCACCTTCAGCAGCAGAAGCAG
CCATGGAACATCTTCCTGTTGTTCCTTCTATTCACCCTAAACAAGATAAGGATCAACATGACAAAGACAGTGATGGACACTCATCTCCCGCTACAGTTTCTGAGGAGGAA
GAAGAAGGTACAGCAAAAAAACCCTTGCCTTCTAGTACTTCTTCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
GAAAACACAGCATCAACAAAACACAGTTTCAATAATGGCGTATAGGAGGAGACAAGCAATCACCAGGGCTTCCACTTTCAAAGAAGAGATCCATCATCCTTCCGACGAAT
TTGATTATAATTCCATCTCTTCTTCCTCCTCTTCCTCTTCTTCTCTTGCCGCTCAGGCGATCCGAGCTTCTGCTGCTCATCGAGATTCTTCATATTCCTCTGCCTTTGGT
GGTGCCTCCAATTTTTCCCCTGGCCATATACGATCCAAGATTAATATGATTTGTTTTGCATCGAATTCTTCCGAGCAAGTGATTGATCAATTTACTAATTACATTTTCTT
AGAGCTTAATTTTGCTATGATCTCCTCGAGTTTTGCTGGTGAGAATGTGACATTCAAAAGTGAGTCAAAATCCGGGTTTTGGGGTGTACTGGCTCGGAAAGCTAAGGCCA
TTCTGGAAGAAGATGACATAGCCATAGAAGAGGAAACAAGTAGGTTTCAGCCCGAGCCATGTCAGTCTACAGATTATGACTCGAAGAAATCAGACAATCCTGCAATACGG
AAGGGTTTGGATGCAATCTCAACTACGCTTAATCAACTTGGGGACACGTTTGAAAAGGCATATGAGGAAGGTCGCTCAATTGTGGAAACTAAGACTGCAGACATCATTCA
AGAAACTCGCAAACTGCAGATTAGGAAAAAGGGCAATAATTCTGAAGGATTATATCCTGCGGTGAATAATCCATGGCAGCAACCGAATATACAGTCTCCGGAACCACATA
TGCAAACACATCATGAAACTCAACTGAAGGCATCTCGAGACGTGGCGATGGCGACAGCTGCCAAAGCCAAACTACTACTTCGGGAGCTGAAAACCATTAAAGCAGATCTG
GCTTTTGCTAAAGAAAGATGTGCACAATTAGAGGAAGAAAACAAAATTCTCAGGGAAAATCGGGAGAAGGGAGACAATCGTGCAGATGATGACTTGATTCGGCTTCAACT
GGAGACTCTTTTGGCTGAGAAGGCTCGTTTGGCGCATGAGAATTCAATATATGCCCGGGAGAACCGTTTCTTAAGGGAAATTGTGGAGTACCACCAACTGACAATGCAGG
ATGTGGTTTATTTAGACGAAGGAATGGAAGAAGTTACAGAAGTTTATCCCCTATCCACCTCTCCAGAGATCACCAAGATGCTTTCTAATTCTGTTTCCCCACGCTCTCCA
ACCTCTCCAAACTCACCTTCAGCAGCAGAAGCAGCCATGGAACATCTTCCTGTTGTTCCTTCTATTCACCCTAAACAAGATAAGGATCAACATGACAAAGACAGTGATGG
ACACTCATCTCCCGCTACAGTTTCTGAGGAGGAAGAAGAAGGTACAGCAAAAAAACCCTTGCCTTCTAGTACTTCTTCTTCTTAAGGACTTTGATTTTACCGTTCTTTAA
ACTAAGAAAAAAAACTGGCAAAGAAGGGGGTGGGG
Protein sequenceShow/hide protein sequence
MAYRRRQAITRASTFKEEIHHPSDEFDYNSISSSSSSSSSLAAQAIRASAAHRDSSYSSAFGGASNFSPGHIRSKINMICFASNSSEQVIDQFTNYIFLELNFAMISSSF
AGENVTFKSESKSGFWGVLARKAKAILEEDDIAIEEETSRFQPEPCQSTDYDSKKSDNPAIRKGLDAISTTLNQLGDTFEKAYEEGRSIVETKTADIIQETRKLQIRKKG
NNSEGLYPAVNNPWQQPNIQSPEPHMQTHHETQLKASRDVAMATAAKAKLLLRELKTIKADLAFAKERCAQLEEENKILRENREKGDNRADDDLIRLQLETLLAEKARLA
HENSIYARENRFLREIVEYHQLTMQDVVYLDEGMEEVTEVYPLSTSPEITKMLSNSVSPRSPTSPNSPSAAEAAMEHLPVVPSIHPKQDKDQHDKDSDGHSSPATVSEEE
EEGTAKKPLPSSTSSS