; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028957 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028957
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCTL-like protein DDB_G0274487
Genome locationtig00153210:1998059..2028235
RNA-Seq ExpressionSgr028957
SyntenySgr028957
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006865 - amino acid transport (biological process)
GO:0030163 - protein catabolic process (biological process)
GO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR003769 - Adaptor protein ClpS, core
IPR007603 - Choline transporter-like
IPR013057 - Amino acid transporter, transmembrane domain
IPR014719 - Ribosomal protein L7/L12, C-terminal/adaptor protein ClpS-like
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2614520.1 hypothetical protein F2Q70_00012533 [Brassica cretica]1.5e-30156.98Show/hide
Query:  LLIVVFAFFSGNVRSRGLPRRA--KTSVLDVAASIQRTREIFAFDAKSPLQEERCVSDSSSMALQLNSRISIQRTSHSDYKSLTLARLGRDSARVRSLTA
        LL++ F     +V SR LP+ +   T+ LDV  SI++T++  +F      +E      SSS +LQL+SR S++ T H DY SLTLARL RDSARV+SLTA
Subjt:  LLIVVFAFFSGNVRSRGLPRRA--KTSVLDVAASIQRTREIFAFDAKSPLQEERCVSDSSSMALQLNSRISIQRTSHSDYKSLTLARLGRDSARVRSLTA

Query:  RIDLAVRG---ADLKPFGNGDGSQFGAEDFESPIVSGASQGSGEYFSRVGIGKPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFEPASSTSF-----
        R++LA+     ADLK            E+ E+P++SG +QGSGEYF+RVGIG P   VYMVLDTGSDV+W+QCAPCA CY QT+PIFEP SS+S+     
Subjt:  RIDLAVRG---ADLKPFGNGDGSQFGAEDFESPIVSGASQGSGEYFSRVGIGKPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFEPASSTSF-----

Query:  ------RLSPAKQSNAS-----SYGDGSYTVGDFVTETVTLGSTSLRNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQLNVSSFSYCLVDRDSDSAST
               L  ++  NA+     SYGDGSYT GDF TET T+GS S+ N+A+GCGH+N+GLFVGAAGLLGLGGG L+ PSQLN +SFSYCLVDRDSDS+ST
Subjt:  ------RLSPAKQSNAS-----SYGDGSYTVGDFVTETVTLGSTSLRNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQLNVSSFSYCLVDRDSDSAST

Query:  LDFNSPLPPDAVTAPLHRNRNLNTFFYLGVTGLSVGGELLPIPESSFQMSEDGNGGIIIDSGTAVTRLQTTTYNLLRDAFVKSTHDLQSTRGVALFDTCY
        ++F S +P DAV APL RN  L+TF+YLG+ G+SVGGE+L IP SSF+M E G GG+IIDSGTAVTRLQT  Y+ LRDAFVK T DL+            
Subjt:  LDFNSPLPPDAVTAPLHRNRNLNTFFYLGVTGLSVGGELLPIPESSFQMSEDGNGGIIIDSGTAVTRLQTTTYNLLRDAFVKSTHDLQSTRGVALFDTCY

Query:  DLSSKSRVEVPTVSFHFPDGKELPLPAKNYLIPVDSDGTFCFAFAPTDSALSIIGNAQQQGTLTVRGKYRKFLDVVGILQARSRDERSPKSNRFDPSPYR
                                                                                                           R
Subjt:  DLSSKSRVEVPTVSFHFPDGKELPLPAKNYLIPVDSDGTFCFAFAPTDSALSIIGNAQQQGTLTVRGKYRKFLDVVGILQARSRDERSPKSNRFDPSPYR

Query:  FLNHFSRSRQPHYDSIRKLPIFQPISEKCSLVSPFTMGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVG
         +NH         +SI  L   Q I+E                        +V    + ++  R R SGDGG       SR W DIFWS +F+IHLI +G
Subjt:  FLNHFSRSRQPHYDSIRKLPIFQPISEKCSLVSPFTMGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVG

Query:  FVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTEDYWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIG
        FVL VLGLNRF+ S+RL ID+YT   +EN  GLTEDYWPLYA+AGG+G  + W W  LLGS+AN +MK+SVHILTTYLAV+SVLCFW + FFWG AF+IG
Subjt:  FVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTEDYWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIG

Query:  AGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVIRVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMV
        A LQFLYV+SVIDRLPFT+LVL+KA+K+V GLP+VI VA+AF + MLL M +WSFG +G+VASSMGD GRWWLLVV S+SLFWTGAVLCNT+HVIVSGMV
Subjt:  AGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVIRVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMV

Query:  FLVLIHGGQ-EASSVPSTSLVKALRYAVTTSFGSICYGSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNR
        F VL   GQ E+SS+P ++LV++LRYAVTTSFGSICYGSLFTAAIRTLRWEIRG+RSKI  NECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGK FNR
Subjt:  FLVLIHGGQ-EASSVPSTSLVKALRYAVTTSFGSICYGSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNR

Query:  SARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLTAGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAE
        SARDAWELFQSTGVEALVAYDCSGAVLLM TI GGL  G+C G+W WIK+ D+V MV  T+ LMGMVLVGL +V+VESAVTSIYIC+AEDPLLI RWDA+
Subjt:  SARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLTAGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAE

Query:  FFNQMSEMLHQRLQHRSARAREVLT
        FF +MSEMLH+RLQHRSARAREVLT
Subjt:  FFNQMSEMLHQRLQHRSARAREVLT

KAF9674125.1 hypothetical protein SADUNF_Sadunf10G0095100 [Salix dunnii]2.3e-28956.21Show/hide
Query:  PGEVPADDQKWKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGP
        P +      KW   DP+RRAKWWYSTFH VTAMIGAGVLSLPYAMAYLGWGPG MVL LSWCMTLNTMWQMIQLHEC PGTRFDRYIDLGR+AFG KLGP
Subjt:  PGEVPADDQKWKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGP

Query:  WIVLPQQLIVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAY
        WIVLPQQLIVQVGCDIVY+VTGGKC+KKFMEM C +C  IRQSYWI+IFG IHFFLSQLPNFNSVAGVSLAAA+MSLSYSTIAW GSL+ G+++NVSYAY
Subjt:  WIVLPQQLIVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAY

Query:  KKTSVQDSMFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMV
        K TS  D MFRVFNALG+ISFA+AGHAV LEIQATIPSTP KPSK+PMWKGA+GAY INAICYFPVA IGYWAFGQDVEDN+L +LKRPAWLIASANLMV
Subjt:  KKTSVQDSMFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMV

Query:  VIHVIGSYQVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYVGDKNSLCKGLYSNFGASVVKSATTAGRGGGLLERPVIEKATPGRESEFDLRSSRKV
        V+HVIGSYQVYAMPVFD++ER++MK+FNFP G  LR++TRSAYV               A  + +  T    G LL                        
Subjt:  VIHVIGSYQVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYVGDKNSLCKGLYSNFGASVVKSATTAGRGGGLLERPVIEKATPGRESEFDLRSSRKV

Query:  APPYRVILHNDDFNKREYVVQVLMKVIPGMMLDNAVNIMQEAHCNGLSLVIICAQADAEEHCMQLRGNGLLSSIEPASVNDTRGFVFFSGLRSLSLNTAS
                                                                                              FF G          
Subjt:  APPYRVILHNDDFNKREYVVQVLMKVIPGMMLDNAVNIMQEAHCNGLSLVIICAQADAEEHCMQLRGNGLLSSIEPASVNDTRGFVFFSGLRSLSLNTAS

Query:  PSVFASQPVLSFLYLPTSMAKISLLLLIVVFAFFSGNVRSRGLPRRAKTSVLDVAASIQRTREIFAFDAKSPLQEERCVSDSSSMALQLNSRISIQRTSH
                   F + PTS     ++ LI+              P+R  T                                                   
Subjt:  PSVFASQPVLSFLYLPTSMAKISLLLLIVVFAFFSGNVRSRGLPRRAKTSVLDVAASIQRTREIFAFDAKSPLQEERCVSDSSSMALQLNSRISIQRTSH

Query:  SDYKSLTLARLGRDSARVRSLTARIDLAVRGADLKPFGNGDGSQFGAEDFESPIVSGASQGSGEYFSRVGIGKPPSPVYMVLDTGSDVSWVQCAPCADCY
                                          K F N            SPI+SGASQGSGEYFSRVGIGKPP   Y++LDTGSDV+WVQCAPCADCY
Subjt:  SDYKSLTLARLGRDSARVRSLTARIDLAVRGADLKPFGNGDGSQFGAEDFESPIVSGASQGSGEYFSRVGIGKPPSPVYMVLDTGSDVSWVQCAPCADCY

Query:  EQTDPIFEPASSTSF-----------RLSPAKQSNAS-----SYGDGSYTVGDFVTETVTLGSTSLRNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ
        +Q+DPIFEPASS SF            L  ++  N +     SYGDGSYTVGDFVTET+TLGS S+ N+AIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ
Subjt:  EQTDPIFEPASSTSF-----------RLSPAKQSNAS-----SYGDGSYTVGDFVTETVTLGSTSLRNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ

Query:  LNVSSFSYCLVDRDSDSASTLDFNSPLPPDAVTAPLHRNRNLNTFFYLGVTGLSVGGELLPIPESSFQMSEDGNGGIIIDSGTAVTRLQTTTYNLLRDAF
        ++ +SFSYCLVDRDSDSASTL+FNS LPP+AV APL RN +L+TF+Y+G+TGLSVGGEL+ +PES+FQ+ E GNGG+I+DSGTA+TRLQT  YN LRDAF
Subjt:  LNVSSFSYCLVDRDSDSASTLDFNSPLPPDAVTAPLHRNRNLNTFFYLGVTGLSVGGELLPIPESSFQMSEDGNGGIIIDSGTAVTRLQTTTYNLLRDAF

Query:  VKSTHDLQSTRGVALFDTCYDLSSKSRVEVPTVSFHFPDGKELPLPAKNYLIPVDSDGTFCFAFAPTDSALSIIGNAQQQGT
        VK T DL ST G+ALFDTCYDLSS+  VEVPTVSFHFPDGK LPLPAKNYL+P+DS+GTFCFAFAPT S+LSIIGN QQQGT
Subjt:  VKSTHDLQSTRGVALFDTCYDLSSKSRVEVPTVSFHFPDGKELPLPAKNYLIPVDSDGTFCFAFAPTDSALSIIGNAQQQGT

XP_008446568.1 PREDICTED: CTL-like protein DDB_G0274487 [Cucumis melo]8.9e-25793.08Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        MGDSGSVSSSSFDS SVATADQ QN V+GRMSGDGGQ+LP ESSR WRDIFW  VFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENR GLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAF+I AGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY
        RVAYAFMIVMLLCMGIWSFGV+GIVASSMGDGGRWWLLVVFSISLFW GAV CNTLHVIVSGMVFLVLIHGG+E+SS+PS SL+KA RYAVTTSFGSICY
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY

Query:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT
        GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGKSFNRSARDAWELFQSTGVE LVAYDCSGAVLLMST+MGGLT
Subjt:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT

Query:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR
        AGTC+G+WTWIKWKDKVSMVACT+TLMGMVLVGLAIV+VESAVTSIYICYAE+PLLI +WDAEFFNQ+SEMLHQRLQHRSARAREVL+ YR
Subjt:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR

XP_022150534.1 CTL-like protein DDB_G0274487 [Momordica charantia]7.8e-26194.7Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        MGDSGSVSSSSFDSASVATADQVQNAV GRMSGDGGQ+L AESSRRW+DIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENR GLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYALAGGVG+LLGWTWLFLLGSFA HVMKISVHILTT+LAVISVLCFWGQLFFWGVAF IGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY
        RVA+AFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFW GAVLCNTLHV+VSGMVFLVLIHGG+EASS+PS SLVK LRY+VTTSFGSICY
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY

Query:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT
        GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLV FFNKYAY+QIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSG+VLLMSTIMGGLT
Subjt:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT

Query:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR
        AGTC+G+WTWIKWKDKVSMVACT+TLMGMVLVGLAIV+VESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRS+RAREVLT+YR
Subjt:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR

XP_038891301.1 CTL-like protein DDB_G0274487 [Benincasa hispida]5.2e-25792.67Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        MGDSGSVSSSSFDSASV TADQ QN  +GRMSGDGGQ+L  ESSR WRD+FWS VFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYALAGGVGSLLGW WLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAF+IGAGLQFLYV+SVIDRLPFTLLVLQKAVKMVSGLPEVI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY
        RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFW GAV CNTLHVIVSGMVFLVLIHGG+E+SS+PS SLVKA RYAVTTSFGSICY
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY

Query:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT
        GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGKSFNRSARDAWELFQSTGVEAL+AYDCSGAVLLMSTIMGGLT
Subjt:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT

Query:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR
        AGTC+G+WTWIKWKDKVSMVACT+TLMGMVLVGLAIV+VESAVTSIYICYAE+PLL+ +WDAEFFNQ+S+MLHQRLQHRSARAREVL ++R
Subjt:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR

TrEMBL top hitse value%identityAlignment
A0A0A0KR48 Uncharacterized protein6.2e-25692.26Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        MGDSGSVSSSSFDS SVATAD+  N V+GRMSGDGGQ+L  ESSR WRDIFW  VFM+HLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYALAGGVGSLLGWTWLFLLGSFANH MKISVHILTTYLAVISVLCFWGQLFFWGV F+IGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY
        RVAY FMIVMLLCMGIWSFGV+GIVASSMGDGGRWWLLVVFSISLFW GAVLCNTLHVIVSGMVFLVLIHGG+E+SS+PS SL+KA RYAVTTSFGSICY
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY

Query:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT
        GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGKSFNRSARDAWELFQSTGVE LVAYDCSGAVLLMST+MGGLT
Subjt:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT

Query:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR
        AGTC+G+WTWIKWKDKVSMVACT+TLMGMVLVGLAIV+VESAVTSIYICYAE+PLLI +WDAEFFNQ+SEMLHQRLQHRSARAREVL+ YR
Subjt:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR

A0A1S3BFC7 CTL-like protein DDB_G02744874.3e-25793.08Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        MGDSGSVSSSSFDS SVATADQ QN V+GRMSGDGGQ+LP ESSR WRDIFW  VFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENR GLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAF+I AGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY
        RVAYAFMIVMLLCMGIWSFGV+GIVASSMGDGGRWWLLVVFSISLFW GAV CNTLHVIVSGMVFLVLIHGG+E+SS+PS SL+KA RYAVTTSFGSICY
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY

Query:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT
        GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGKSFNRSARDAWELFQSTGVE LVAYDCSGAVLLMST+MGGLT
Subjt:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT

Query:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR
        AGTC+G+WTWIKWKDKVSMVACT+TLMGMVLVGLAIV+VESAVTSIYICYAE+PLLI +WDAEFFNQ+SEMLHQRLQHRSARAREVL+ YR
Subjt:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR

A0A5A7SXE5 CTL-like protein4.3e-25793.08Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        MGDSGSVSSSSFDS SVATADQ QN V+GRMSGDGGQ+LP ESSR WRDIFW  VFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENR GLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAF+I AGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY
        RVAYAFMIVMLLCMGIWSFGV+GIVASSMGDGGRWWLLVVFSISLFW GAV CNTLHVIVSGMVFLVLIHGG+E+SS+PS SL+KA RYAVTTSFGSICY
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY

Query:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT
        GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGKSFNRSARDAWELFQSTGVE LVAYDCSGAVLLMST+MGGLT
Subjt:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT

Query:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR
        AGTC+G+WTWIKWKDKVSMVACT+TLMGMVLVGLAIV+VESAVTSIYICYAE+PLLI +WDAEFFNQ+SEMLHQRLQHRSARAREVL+ YR
Subjt:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR

A0A6J1DAC3 CTL-like protein DDB_G02744873.8e-26194.7Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        MGDSGSVSSSSFDSASVATADQVQNAV GRMSGDGGQ+L AESSRRW+DIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENR GLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYALAGGVG+LLGWTWLFLLGSFA HVMKISVHILTT+LAVISVLCFWGQLFFWGVAF IGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY
        RVA+AFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFW GAVLCNTLHV+VSGMVFLVLIHGG+EASS+PS SLVK LRY+VTTSFGSICY
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY

Query:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT
        GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLV FFNKYAY+QIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSG+VLLMSTIMGGLT
Subjt:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT

Query:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR
        AGTC+G+WTWIKWKDKVSMVACT+TLMGMVLVGLAIV+VESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRS+RAREVLT+YR
Subjt:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR

A0A6J1GZB3 CTL-like protein DDB_G0274487 isoform X13.1e-25592.46Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        M DSGSVSSSSFDSAS+AT DQVQNAVRGRMSG GGQ+L  ESSR WRD+FWS VF+IHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAF+IGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY
        +VAYAFMIVMLLCMG+WSFGVAGIVASSMGDGGRWWLLVVFSISLFW G V CNTLHVIVSGMVFLVL HGG+E+SS+PS SL+KA RYAVTTSFGSICY
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICY

Query:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT
        GSLFTA IRTLRWEIRGIRSKIG+NECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGKSFNRSARDAWEL QSTGVEALVAYDCSGAVLLMSTIMGGLT
Subjt:  GSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLT

Query:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR
        AGTC+G+WTWIKWKDKVSMVACT+TLMGMVLVGLAIV+VESAVTSIYICYAEDPLLI +WDAEFFNQ+SEMLHQRLQHRSARAREVLT  R
Subjt:  AGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYR

SwissProt top hitse value%identityAlignment
Q9C6M2 Lysine histidine transporter-like 62.6e-15877.38Show/hide
Query:  QKWKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGPWIVLPQQL
        +KW  EDPSR AKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGT VL ++W +TLNTMWQM+QLHEC PGTRFDRYIDLGRYAFG KLGPWIVLPQQL
Subjt:  QKWKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGPWIVLPQQL

Query:  IVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAYKKTSVQDS
        IVQVGC+IVY+VTGGKC+K+F+E+ C  C  +RQSYWI+ FG +HF LSQLPNFNSVAGVSLAAA+MSL YSTIAW GS++ GR+ +VSY YK T+  D 
Subjt:  IVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAYKKTSVQDS

Query:  MFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMVVIHVIGSY
         FRVFNALGQISFA+AGHAVALEIQAT+PSTP +PSKVPMW+G +GAY++NA+CYFPVA I YWAFGQDV+DN+L+NL+RPAWLIA+ANLMVV+HVIGSY
Subjt:  MFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMVVIHVIGSY

Query:  QVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV
        QV+AMPVFDLLERMM+ KF F  G  LR  TR+ YV
Subjt:  QVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV

Q9FKS8 Lysine histidine transporter 14.9e-12562.8Show/hide
Query:  QKWKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGPWIVLPQQL
        + W     SR AKWWYS FH VTAM+GAGVL LPYAM+ LGWGPG  VL LSW +TL T+WQM+++HE  PG RFDRY +LG++AFG+KLG +IV+PQQL
Subjt:  QKWKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGPWIVLPQQL

Query:  IVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAYKKTSVQDS
        IV++G  IVY+VTGGK +KKF E+ C +C  I+ +Y+I+IF S+HF LS LPNFNS++GVSLAAA+MSLSYSTIAWA S S+G  E+V Y YK  +   +
Subjt:  IVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAYKKTSVQDS

Query:  MFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMVVIHVIGSY
        +F  F+ LG ++FAYAGH V LEIQATIPSTP KPSK PMW+G + AYI+ A+CYFPVA +GY+ FG  VEDNIL++LK+PAWLIA+AN+ VVIHVIGSY
Subjt:  MFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMVVIHVIGSY

Query:  QVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV
        Q+YAMPVFD++E +++KK NF     LR   R+ YV
Subjt:  QVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV

Q9LRB5 Lysine histidine transporter 21.6e-12360.69Show/hide
Query:  EVPADDQK----WKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKL
        EV A  QK    W     SR AKWWYS FH VTAM+GAGVLSLPYAM+ LGWGPG  ++ +SW +TL T+WQM+++HE  PG R DRY +LG++AFG+KL
Subjt:  EVPADDQK----WKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKL

Query:  GPWIVLPQQLIVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSY
        G WIV+PQQLIV+VG DIVY+VTGG  +KK  ++ C +C +IR ++WI+IF S+HF +S LPNFNS++ +SLAAA+MSL+YSTIAWA S+ +G   +V Y
Subjt:  GPWIVLPQQLIVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSY

Query:  AYKKTSVQDSMFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANL
        + + ++    +F   NALG ++FAYAGH V LEIQATIPSTP  PSKVPMW+G + AYI+ AICYFPVAF+GY+ FG  V+DNIL+ L++P WLIA AN+
Subjt:  AYKKTSVQDSMFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANL

Query:  MVVIHVIGSYQVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV
         VVIHVIGSYQ++AMPVFD+LE +++KK NF   F LR ITRS YV
Subjt:  MVVIHVIGSYQVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 12.7e-13152.13Show/hide
Query:  ISLLLLIVVFAFF-SGNVRSRGLPRRAKTSVLDVAASIQRTREIFAFD---------AKSPLQEERCVSDSSSMALQLNSRISIQRTSHSDYKSLTLARL
        +SLL ++ +  F  + +  SR L    KT+VLDV +S+Q+T+ I + D             L +    + SS ++L+L+SR +   + H DYKSLTL+RL
Subjt:  ISLLLLIVVFAFF-SGNVRSRGLPRRAKTSVLDVAASIQRTREIFAFD---------AKSPLQEERCVSDSSSMALQLNSRISIQRTSHSDYKSLTLARL

Query:  GRDSARVRSLTARIDLAVRG---ADLKPFGNGDGSQFGAEDFESPIVSGASQGSGEYFSRVGIGKPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFE
         RDS+RV  + A+I  AV G   +DLKP  N D +++  ED  +P+VSGASQGSGEYFSR+G+G P   +Y+VLDTGSDV+W+QC PCADCY+Q+DP+F 
Subjt:  GRDSARVRSLTARIDLAVRG---ADLKPFGNGDGSQFGAEDFESPIVSGASQGSGEYFSRVGIGKPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFE

Query:  PASSTSFRL------------SPAKQSNAS----SYGDGSYTVGDFVTETVTLG-STSLRNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQLNVSSFS
        P SS++++             + A +SN      SYGDGS+TVG+  T+TVT G S  + N+A+GCGH+NEGLF GAAGLLGLGGG LS  +Q+  +SFS
Subjt:  PASSTSFRL------------SPAKQSNAS----SYGDGSYTVGDFVTETVTLG-STSLRNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQLNVSSFS

Query:  YCLVDRDSDSASTLDFNS-PLPPDAVTAPLHRNRNLNTFFYLGVTGLSVGGELLPIPESSFQMSEDGNGGIIIDSGTAVTRLQTTTYNLLRDAFVKSTHD
        YCLVDRDS  +S+LDFNS  L     TAPL RN+ ++TF+Y+G++G SVGGE + +P++ F +   G+GG+I+D GTAVTRLQT  YN LRDAF+K T +
Subjt:  YCLVDRDSDSASTLDFNS-PLPPDAVTAPLHRNRNLNTFFYLGVTGLSVGGELLPIPESSFQMSEDGNGGIIIDSGTAVTRLQTTTYNLLRDAFVKSTHD

Query:  L-QSTRGVALFDTCYDLSSKSRVEVPTVSFHFPDGKELPLPAKNYLIPVDSDGTFCFAFAPTDSALSIIGNAQQQGTLTVRGKYRKFLDVVGI
        L + +  ++LFDTCYD SS S V+VPTV+FHF  GK L LPAKNYLIPVD  GTFCFAFAPT S+LSIIGN QQQGT   R  Y    +V+G+
Subjt:  L-QSTRGVALFDTCYDLSSKSRVEVPTVSFHFPDGKELPLPAKNYLIPVDSDGTFCFAFAPTDSALSIIGNAQQQGTLTVRGKYRKFLDVVGI

Q9SR44 Lysine histidine transporter-like 22.4e-12462.87Show/hide
Query:  WKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGPWIVLPQQLIV
        W     SR AKWWYS FH VTAM+GAGVLSLPYAM+ LGWGPG  ++ +SW +T  T+WQM+Q+HE  PG RFDRY +LG++AFG+KLG WIV+PQQLIV
Subjt:  WKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGPWIVLPQQLIV

Query:  QVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAYKKTSVQDSMF
        +VG DIVY+VTGGK +KK  ++ C +C  IR +YWI+IF SIHF L+ LPNFNS++ VSLAAA+MSLSYSTIAWA S+ +G   NV Y+ + ++   ++F
Subjt:  QVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAYKKTSVQDSMF

Query:  RVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMVVIHVIGSYQV
           NALG ++FAYAGH V LEIQATIPSTP KPSK+ MWKG V AYI+ AICYFPVAF+ Y+ FG  V+DNIL+ L++P WLIA AN  VV+HVIGSYQ+
Subjt:  RVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMVVIHVIGSYQV

Query:  YAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV
        YAMPVFD+LE  ++KK  F   F LR ITR+ YV
Subjt:  YAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV

Arabidopsis top hitse value%identityAlignment
AT1G25500.1 Plasma-membrane choline transporter family protein6.3e-17673.15Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        M DS +  S S D AS+++ D  Q+  R R SGDG +++  E SR W D+FWS +F+IHLI +GFVL VLGLNRF+ S+RL ID+YT   +EN  GLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYA+AGG+G  + W W  LLGS+AN +MK+SVHILTTYLAV+SVLCFW +LFFWG AFA+G+ LQFLYVISVIDRLPFT+LVL+KA+K+V GLP+VI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQ-EASSVPSTSLVKALRYAVTTSFGSIC
         VA+AF +VMLL M +WSFG AG+VASSMGD GRWWLLVV S+SLFWTGAVLCNT+HVIVSGMVF VL H GQ E+SS+P +SLV +LRYAVTTSFGSIC
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQ-EASSVPSTSLVKALRYAVTTSFGSIC

Query:  YGSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGL
        YGSLFTAAIRTLRWEIRG RSKI  NECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGK FN+SARDAWELFQSTGVEALVAYDCSGAVLLM TI GGL
Subjt:  YGSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGL

Query:  TAGTCAGVWTWIKWKDKVSMVACTSTLMGMVL
          G+C G+W WIK+ D+V MVA T+ LMGMVL
Subjt:  TAGTCAGVWTWIKWKDKVSMVACTSTLMGMVL

AT1G25500.2 Plasma-membrane choline transporter family protein6.0e-20373.82Show/hide
Query:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED
        M DS +  S S D AS+++ D  Q+  R R SGDG +++  E SR W D+FWS +F+IHLI +GFVL VLGLNRF+ S+RL ID+YT   +EN  GLTED
Subjt:  MGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQIDKYTNTIMENRVGLTED

Query:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI
        YWPLYA+AGG+G  + W W  LLGS+AN +MK+SVHILTTYLAV+SVLCFW +LFFWG AFA+G+ LQFLYVISVIDRLPFT+LVL+KA+K+V GLP+VI
Subjt:  YWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKMVSGLPEVI

Query:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQ-EASSVPSTSLVKALRYAVTTSFGSIC
         VA+AF +VMLL M +WSFG AG+VASSMGD GRWWLLVV S+SLFWTGAVLCNT+HVIVSGMVF VL H GQ E+SS+P +SLV +LRYAVTTSFGSIC
Subjt:  RVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQ-EASSVPSTSLVKALRYAVTTSFGSIC

Query:  YGSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGL
        YGSLFTAAIRTLRWEIRG RSKI  NECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGK FN+SARDAWELFQSTGVEALVAYDCSGAVLLM TI GGL
Subjt:  YGSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGL

Query:  TAGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLT
          G+C G+W WIK+ D+V MVA T+ LMGMVLVGL +V+VESAVTSIYIC+AEDPLLI RWDA+F+ +MSE+LH+RLQHRS+RAREVLT
Subjt:  TAGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLT

AT1G25500.3 Plasma-membrane choline transporter family protein3.1e-18377.91Show/hide
Query:  SNRLQIDKYTNTIMENRVGLTEDYWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVID
        S+RL ID+YT   +EN  GLTEDYWPLYA+AGG+G  + W W  LLGS+AN +MK+SVHILTTYLAV+SVLCFW +LFFWG AFA+G+ LQFLYVISVID
Subjt:  SNRLQIDKYTNTIMENRVGLTEDYWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVID

Query:  RLPFTLLVLQKAVKMVSGLPEVIRVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQ-EAS
        RLPFT+LVL+KA+K+V GLP+VI VA+AF +VMLL M +WSFG AG+VASSMGD GRWWLLVV S+SLFWTGAVLCNT+HVIVSGMVF VL H GQ E+S
Subjt:  RLPFTLLVLQKAVKMVSGLPEVIRVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQ-EAS

Query:  SVPSTSLVKALRYAVTTSFGSICYGSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTG
        S+P +SLV +LRYAVTTSFGSICYGSLFTAAIRTLRWEIRG RSKI  NECLLCCVDFLFHLVETLVRFFNKYAY+QIAVYGK FN+SARDAWELFQSTG
Subjt:  SVPSTSLVKALRYAVTTSFGSICYGSLFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTG

Query:  VEALVAYDCSGAVLLMSTIMGGLTAGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRL
        VEALVAYDCSGAVLLM TI GGL  G+C G+W WIK+ D+V MVA T+ LMGMVLVGL +V+VESAVTSIYIC+AEDPLLI RWDA+F+ +MSE+LH+RL
Subjt:  VEALVAYDCSGAVLLMSTIMGGLTAGTCAGVWTWIKWKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRL

Query:  QHRSARAREVLT
        QHRS+RAREVLT
Subjt:  QHRSARAREVLT

AT1G25510.1 Eukaryotic aspartyl protease family protein1.1e-15663.4Show/hide
Query:  VVFAFFSGNVRSRGLPRRA--KTSVLDVAASIQRTREIFAFDAKSPLQEERCVSDSSSMALQLNSRISIQRTSHSDYKSLTLARLGRDSARVRSLTARID
        + F     +V SR LP  +   TS+L+VA SI RT+   +F      QEE+  S SSS +LQL+SR+S++ T HSDYKSLTLARL RD+ARV+SL  R+D
Subjt:  VVFAFFSGNVRSRGLPRRA--KTSVLDVAASIQRTREIFAFDAKSPLQEERCVSDSSSMALQLNSRISIQRTSHSDYKSLTLARLGRDSARVRSLTARID

Query:  LAVRG---ADLKPFGNGDGSQFGAEDFESPIVSGASQGSGEYFSRVGIGKPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFEPASSTSFR-------
        LA+     ADLKP      ++   +D E+P++SG +QGSGEYF+RVGIGKP   VYMVLDTGSDV+W+QC PCADCY QT+PIFEP+SS+S+        
Subjt:  LAVRG---ADLKPFGNGDGSQFGAEDFESPIVSGASQGSGEYFSRVGIGKPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFEPASSTSFR-------

Query:  ----LSPAKQSNAS-----SYGDGSYTVGDFVTETVTLGSTSLRNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQLNVSSFSYCLVDRDSDSASTLDF
            L  ++  NA+     SYGDGSYTVGDF TET+T+GST ++N+A+GCGH+NEGLFVGAAGLLGLGGG L+ PSQLN +SFSYCLVDRDSDSAST+DF
Subjt:  ----LSPAKQSNAS-----SYGDGSYTVGDFVTETVTLGSTSLRNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQLNVSSFSYCLVDRDSDSASTLDF

Query:  NSPLPPDAVTAPLHRNRNLNTFFYLGVTGLSVGGELLPIPESSFQMSEDGNGGIIIDSGTAVTRLQTTTYNLLRDAFVKSTHDLQSTRGVALFDTCYDLS
         + L PDAV APL RN  L+TF+YLG+TG+SVGGELL IP+SSF+M E G+GGIIIDSGTAVTRLQT  YN LRD+FVK T DL+   GVA+FDTCY+LS
Subjt:  NSPLPPDAVTAPLHRNRNLNTFFYLGVTGLSVGGELLPIPESSFQMSEDGNGGIIIDSGTAVTRLQTTTYNLLRDAFVKSTHDLQSTRGVALFDTCYDLS

Query:  SKSRVEVPTVSFHFPDGKELPLPAKNYLIPVDSDGTFCFAFAPTDSALSIIGNAQQQGT
        +K+ VEVPTV+FHFP GK L LPAKNY+IPVDS GTFC AFAPT S+L+IIGN QQQGT
Subjt:  SKSRVEVPTVSFHFPDGKELPLPAKNYLIPVDSDGTFCFAFAPTDSALSIIGNAQQQGT

AT1G25530.1 Transmembrane amino acid transporter family protein1.8e-15977.38Show/hide
Query:  QKWKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGPWIVLPQQL
        +KW  EDPSR AKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGT VL ++W +TLNTMWQM+QLHEC PGTRFDRYIDLGRYAFG KLGPWIVLPQQL
Subjt:  QKWKLEDPSRRAKWWYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGPWIVLPQQL

Query:  IVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAYKKTSVQDS
        IVQVGC+IVY+VTGGKC+K+F+E+ C  C  +RQSYWI+ FG +HF LSQLPNFNSVAGVSLAAA+MSL YSTIAW GS++ GR+ +VSY YK T+  D 
Subjt:  IVQVGCDIVYIVTGGKCMKKFMEMACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAYKKTSVQDS

Query:  MFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMVVIHVIGSY
         FRVFNALGQISFA+AGHAVALEIQAT+PSTP +PSKVPMW+G +GAY++NA+CYFPVA I YWAFGQDV+DN+L+NL+RPAWLIA+ANLMVV+HVIGSY
Subjt:  MFRVFNALGQISFAYAGHAVALEIQATIPSTPAKPSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMVVIHVIGSY

Query:  QVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV
        QV+AMPVFDLLERMM+ KF F  G  LR  TR+ YV
Subjt:  QVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTTACAGGAGTGCTTCAGCTTCTGATAAGCTTGCAGCAAACTGGCCACAGTCAATGCAAATAGTTCGGCTTATCTCACAAGATCACATGAATAACAAGCAATATGTTGG
AAAAGCTGATTTTCTTGTTTTCCGAGCTATGAATCAGCATGGGTTTCTTGGCCAGCTTCAAGAAAAGAAACTTTGTGCAGTCATCCAGTTACCATCCCAGACTTTACTGC
TATCTGTTTCTGATAAAGCTTGCCGCTTGATAGGGATGCTTTTTCCTGGGGAAGTTCCAGCCGATGACCAGAAATGGAAGTTGGAGGACCCTTCCCGCCGAGCCAAATGG
TGGTACTCCACCTTCCACACAGTCACCGCCATGATCGGCGCTGGCGTTCTCAGCTTGCCCTACGCCATGGCCTACTTGGGATGGGGTCCAGGAACGATGGTTTTGTTCCT
ATCATGGTGCATGACTCTAAACACGATGTGGCAGATGATTCAACTGCATGAATGTGCGCCCGGGACTCGTTTCGATCGATATATCGACCTTGGCCGATACGCTTTCGGCC
AAAAACTCGGCCCTTGGATCGTCCTGCCGCAGCAGCTAATCGTCCAAGTTGGCTGCGACATTGTCTACATAGTTACAGGAGGCAAGTGCATGAAGAAGTTCATGGAGATG
GCTTGTGTCAACTGCGTTCAAATCAGGCAATCCTACTGGATTGTGATCTTCGGTTCAATTCACTTTTTTCTCTCTCAACTTCCCAACTTCAATTCTGTTGCTGGGGTTTC
TCTAGCAGCTGCCATCATGTCGCTCAGTTACTCGACGATCGCGTGGGCAGGAAGCTTGAGCCGAGGGCGAATGGAAAACGTAAGCTACGCGTACAAGAAAACCAGCGTCC
AAGATTCCATGTTCAGAGTCTTCAACGCTCTCGGCCAAATCTCCTTCGCATACGCCGGCCACGCCGTCGCTCTAGAGATTCAGGCCACCATTCCCTCCACTCCGGCCAAG
CCCTCCAAAGTTCCCATGTGGAAAGGCGCCGTCGGCGCTTACATCATCAACGCCATCTGCTATTTCCCCGTCGCCTTCATCGGCTACTGGGCTTTCGGCCAAGACGTTGA
AGACAACATCCTTCTCAATCTCAAGAGACCCGCATGGCTCATCGCCTCTGCTAATCTCATGGTCGTCATCCATGTCATTGGCAGCTACCAGGTCTACGCCATGCCTGTGT
TTGACTTGTTGGAGAGGATGATGATGAAGAAATTCAACTTCCCAGAAGGATTCTGCCTCAGAATAATCACTCGATCGGCTTATGTTGGGGACAAAAACTCTCTCTGTAAA
GGGCTATACTCAAACTTTGGCGCATCAGTTGTTAAATCAGCAACAACGGCAGGAAGAGGAGGAGGGTTGTTGGAGAGGCCTGTTATTGAGAAAGCTACTCCAGGAAGAGA
ATCCGAGTTTGATTTGAGGAGTTCAAGGAAAGTGGCGCCACCATATCGGGTCATTCTGCATAACGATGACTTCAATAAGCGGGAATACGTCGTTCAAGTCCTGATGAAAG
TTATCCCCGGAATGATGCTTGACAACGCAGTTAACATCATGCAAGAAGCTCACTGCAATGGCTTGTCACTGGTGATCATTTGTGCTCAGGCTGATGCTGAAGAACACTGT
ATGCAGCTCAGAGGCAATGGGCTGTTAAGTTCAATTGAACCTGCAAGTGTAAATGACACTCGCGGCTTCGTCTTCTTCTCCGGCCTTCGCTCTCTAAGCTTAAATACAGC
GTCTCCCTCTGTTTTCGCTTCTCAACCAGTCCTTTCTTTTCTCTATCTTCCCACTTCTATGGCGAAGATTTCTCTGCTTCTTCTTATCGTTGTTTTCGCCTTTTTTTCCG
GAAATGTTCGCTCCCGGGGCTTGCCTCGGAGGGCGAAGACCTCGGTGCTCGATGTGGCGGCTTCGATTCAGAGGACTCGAGAGATTTTCGCATTTGATGCTAAATCTCCT
CTGCAAGAAGAAAGATGCGTCTCCGATTCCTCTTCGATGGCTCTGCAGTTGAATTCGAGGATTTCCATTCAGAGAACCTCGCATAGTGACTACAAATCGCTTACGTTAGC
CAGACTCGGGCGTGACTCTGCTCGAGTCAGATCTTTGACTGCCAGGATAGATCTAGCCGTTCGAGGAGCAGATCTTAAACCCTTCGGTAATGGCGATGGTTCGCAGTTTG
GCGCTGAGGATTTCGAGAGTCCGATTGTATCGGGGGCGAGTCAGGGAAGCGGTGAGTACTTCTCCCGCGTCGGAATCGGTAAACCGCCGAGTCCAGTTTACATGGTACTC
GATACCGGCAGTGATGTAAGCTGGGTACAATGCGCGCCCTGCGCCGATTGCTACGAGCAAACCGATCCAATCTTCGAACCTGCTTCGTCTACTTCTTTTCGTCTCTCTCC
TGCCAAACAGAGCAATGCAAGTTCCTACGGCGACGGCTCCTATACCGTCGGCGACTTTGTTACCGAGACCGTCACTCTTGGCTCGACTTCCCTCCGAAACATCGCCATCG
GCTGTGGCCACAACAATGAGGGTCTGTTCGTCGGCGCAGCCGGCTTGCTCGGACTCGGCGGCGGCTCGCTTTCGTTCCCTTCGCAGCTCAATGTCTCGTCCTTCTCATAC
TGTCTCGTGGACCGTGACTCTGACTCAGCTTCCACTCTTGATTTCAACTCCCCGTTGCCTCCCGACGCCGTCACTGCCCCGTTACACCGCAACCGTAATCTGAACACGTT
CTTCTACCTCGGCGTGACGGGGCTGAGCGTAGGAGGTGAGCTACTCCCGATTCCCGAGTCGTCGTTCCAAATGAGCGAGGACGGAAACGGTGGCATCATCATCGACTCCG
GCACCGCCGTGACTCGGTTGCAGACGACCACTTACAACCTCCTGCGCGACGCGTTCGTCAAGAGCACACACGACCTGCAGTCCACCCGCGGCGTCGCGTTGTTCGACACC
TGTTACGACCTGTCGTCGAAGTCGAGAGTCGAAGTGCCGACGGTGTCGTTTCACTTCCCCGACGGGAAAGAGTTGCCGTTGCCGGCCAAAAACTACCTGATACCGGTTGA
CTCCGATGGGACGTTTTGTTTCGCCTTCGCTCCTACAGATTCGGCATTGTCAATCATCGGGAACGCACAACAGCAAGGGACACTAACGGTAAGAGGCAAATATCGTAAAT
TCTTAGATGTTGTGGGGATATTACAGGCAAGAAGCAGAGACGAACGAAGCCCTAAAAGCAACCGTTTTGATCCCTCCCCCTATCGTTTTCTTAACCACTTTTCACGTTCT
CGTCAACCTCATTATGATTCAATCCGAAAGCTTCCAATTTTCCAACCCATTTCAGAAAAATGCTCGTTAGTTAGTCCTTTCACAATGGGCGACTCCGGCAGCGTCTCCTC
TTCTTCCTTTGATTCCGCCTCCGTCGCCACCGCTGATCAGGTTCAGAATGCAGTTCGGGGTCGTATGAGTGGCGACGGAGGTCAATCACTGCCGGCAGAATCATCACGTC
GTTGGCGAGATATTTTCTGGTCGATTGTGTTTATGATTCATTTGATCAGTGTGGGATTTGTGCTCGTGGTTCTCGGGCTTAATAGGTTCAAGAAAAGCAATAGGCTTCAA
ATCGATAAGTACACCAATACAATCATGGAGAACCGGGTTGGATTGACAGAGGATTACTGGCCGCTGTATGCTTTAGCAGGCGGAGTTGGAAGCTTACTTGGATGGACTTG
GTTGTTCCTACTTGGTTCTTTTGCAAATCATGTTATGAAGATTTCTGTACACATTTTGACTACGTATCTTGCTGTGATAAGTGTTTTGTGCTTCTGGGGCCAACTCTTTT
TCTGGGGTGTAGCATTTGCAATTGGTGCAGGGTTGCAGTTCCTCTATGTTATATCAGTTATAGACAGACTACCATTTACACTGCTAGTGTTGCAAAAGGCTGTAAAGATG
GTATCCGGGCTTCCTGAAGTTATTAGAGTTGCATATGCTTTCATGATTGTCATGCTTTTGTGTATGGGAATTTGGTCTTTTGGAGTAGCTGGAATTGTGGCATCAAGTAT
GGGTGACGGTGGACGCTGGTGGCTACTTGTGGTTTTTTCGATAAGCTTATTTTGGACTGGTGCTGTTTTATGTAATACTCTACATGTTATAGTTTCCGGAATGGTGTTTC
TAGTTCTAATCCATGGGGGCCAAGAGGCATCGTCTGTGCCATCAACGTCATTAGTGAAGGCTTTACGATATGCTGTGACAACATCTTTTGGTAGCATTTGCTATGGGTCA
CTTTTTACAGCTGCTATTCGGACACTGCGGTGGGAGATCAGGGGAATCCGATCAAAGATTGGCAAGAACGAGTGTCTGCTTTGCTGTGTTGATTTTTTGTTTCATCTTGT
CGAGACTCTAGTTCGTTTCTTCAACAAGTATGCCTATATCCAGATAGCAGTTTATGGTAAAAGCTTCAACCGTTCAGCCAGGGACGCCTGGGAGTTATTCCAGTCAACTG
GAGTTGAAGCTCTTGTTGCCTATGATTGTTCAGGTGCTGTTCTGCTGATGAGCACCATCATGGGTGGGCTCACGGCTGGAACTTGCGCGGGCGTTTGGACATGGATTAAG
TGGAAGGATAAGGTGTCAATGGTAGCATGTACTTCAACATTGATGGGAATGGTCCTGGTAGGCCTTGCAATAGTCATCGTGGAAAGTGCCGTTACATCTATATACATATG
TTATGCAGAAGATCCCTTGTTGATTCACAGATGGGATGCTGAATTCTTCAACCAGATGTCAGAGATGCTTCACCAGCGCCTACAGCATCGGAGTGCACGAGCGAGGGAAG
TGTTAACGCAGTATCGGTCCTTCAGAATCATTCTCACTTGGACACCAACTCGAATTTCTCTCTGGCCGTGGGTGATGATCGCTTTGTTGACCCTCTACGCCCCGTTTGGT
CGCAGTTATGGAGTTGAGACTTCTGGTTTCGGGAAATTGTTCTACCTCCTCGTCGTCGTATGCTGTCAAAACTCTGTTTTGAAATCCATGGTTATGGGGATTCTCCTCTT
GGAGTTAGATTGGGATTTTATACTGAAATGGGAGCAGATAAAAAAGACGGTAATTTCTCCTCTGTGCTGTCGGCGCCACCAGAACCTCCGCCGCCACCGGCCTCATTTTC
TGCCAGTTTCAACCAACCCCGGATGGCGGCGCCGGCCAACCACCCTCCAAACCCTTGTGAACCGAAACCCCACTCCCCAGTTGCTTGGTCCACTGGTCTCTTCGACTGTT
GCAACGACATTAGCATCTGTTAAGTTCATTACCATTTCTCTTAATTTGTTCATCGCCGTAATACTCTCGCTAAAAATTTTATCTTTTTTTGCTCTTTACAGGCTGCTTGA
CTTGCTGGTGTCCCTGCATCACGTTCGGCCGCATAGCAGAGATGTGGACCGAGGAACAACATGGGACAATTCTTCTTGGAGGAGAGCCCTTGTACTGATTGCTGCGTCCA
CTGCTTCTGCGAGGAATGTGCTTTGTGCCAAGAGTACAGGGAGCTTCAGCACCAAGGCTTCGACATGTCCTTCGGTTTCTCTCCTCTCTCTTTATCTATCATTACAGAAC
AAACAAAGGTGGCATGGGAACGTGGAGAGGCAGAGACGGATTGCTGCTGCTCATGCGGTGTCGCCGCCGAGTGTGCAAGGAATGATTCGTTAA
mRNA sequenceShow/hide mRNA sequence
GGTTACAGGAGTGCTTCAGCTTCTGATAAGCTTGCAGCAAACTGGCCACAGTCAATGCAAATAGTTCGGCTTATCTCACAAGATCACATGAATAACAAGCAATATGTTGG
AAAAGCTGATTTTCTTGTTTTCCGAGCTATGAATCAGCATGGGTTTCTTGGCCAGCTTCAAGAAAAGAAACTTTGTGCAGTCATCCAGTTACCATCCCAGACTTTACTGC
TATCTGTTTCTGATAAAGCTTGCCGCTTGATAGGGATGCTTTTTCCTGGGGAAGTTCCAGCCGATGACCAGAAATGGAAGTTGGAGGACCCTTCCCGCCGAGCCAAATGG
TGGTACTCCACCTTCCACACAGTCACCGCCATGATCGGCGCTGGCGTTCTCAGCTTGCCCTACGCCATGGCCTACTTGGGATGGGGTCCAGGAACGATGGTTTTGTTCCT
ATCATGGTGCATGACTCTAAACACGATGTGGCAGATGATTCAACTGCATGAATGTGCGCCCGGGACTCGTTTCGATCGATATATCGACCTTGGCCGATACGCTTTCGGCC
AAAAACTCGGCCCTTGGATCGTCCTGCCGCAGCAGCTAATCGTCCAAGTTGGCTGCGACATTGTCTACATAGTTACAGGAGGCAAGTGCATGAAGAAGTTCATGGAGATG
GCTTGTGTCAACTGCGTTCAAATCAGGCAATCCTACTGGATTGTGATCTTCGGTTCAATTCACTTTTTTCTCTCTCAACTTCCCAACTTCAATTCTGTTGCTGGGGTTTC
TCTAGCAGCTGCCATCATGTCGCTCAGTTACTCGACGATCGCGTGGGCAGGAAGCTTGAGCCGAGGGCGAATGGAAAACGTAAGCTACGCGTACAAGAAAACCAGCGTCC
AAGATTCCATGTTCAGAGTCTTCAACGCTCTCGGCCAAATCTCCTTCGCATACGCCGGCCACGCCGTCGCTCTAGAGATTCAGGCCACCATTCCCTCCACTCCGGCCAAG
CCCTCCAAAGTTCCCATGTGGAAAGGCGCCGTCGGCGCTTACATCATCAACGCCATCTGCTATTTCCCCGTCGCCTTCATCGGCTACTGGGCTTTCGGCCAAGACGTTGA
AGACAACATCCTTCTCAATCTCAAGAGACCCGCATGGCTCATCGCCTCTGCTAATCTCATGGTCGTCATCCATGTCATTGGCAGCTACCAGGTCTACGCCATGCCTGTGT
TTGACTTGTTGGAGAGGATGATGATGAAGAAATTCAACTTCCCAGAAGGATTCTGCCTCAGAATAATCACTCGATCGGCTTATGTTGGGGACAAAAACTCTCTCTGTAAA
GGGCTATACTCAAACTTTGGCGCATCAGTTGTTAAATCAGCAACAACGGCAGGAAGAGGAGGAGGGTTGTTGGAGAGGCCTGTTATTGAGAAAGCTACTCCAGGAAGAGA
ATCCGAGTTTGATTTGAGGAGTTCAAGGAAAGTGGCGCCACCATATCGGGTCATTCTGCATAACGATGACTTCAATAAGCGGGAATACGTCGTTCAAGTCCTGATGAAAG
TTATCCCCGGAATGATGCTTGACAACGCAGTTAACATCATGCAAGAAGCTCACTGCAATGGCTTGTCACTGGTGATCATTTGTGCTCAGGCTGATGCTGAAGAACACTGT
ATGCAGCTCAGAGGCAATGGGCTGTTAAGTTCAATTGAACCTGCAAGTGTAAATGACACTCGCGGCTTCGTCTTCTTCTCCGGCCTTCGCTCTCTAAGCTTAAATACAGC
GTCTCCCTCTGTTTTCGCTTCTCAACCAGTCCTTTCTTTTCTCTATCTTCCCACTTCTATGGCGAAGATTTCTCTGCTTCTTCTTATCGTTGTTTTCGCCTTTTTTTCCG
GAAATGTTCGCTCCCGGGGCTTGCCTCGGAGGGCGAAGACCTCGGTGCTCGATGTGGCGGCTTCGATTCAGAGGACTCGAGAGATTTTCGCATTTGATGCTAAATCTCCT
CTGCAAGAAGAAAGATGCGTCTCCGATTCCTCTTCGATGGCTCTGCAGTTGAATTCGAGGATTTCCATTCAGAGAACCTCGCATAGTGACTACAAATCGCTTACGTTAGC
CAGACTCGGGCGTGACTCTGCTCGAGTCAGATCTTTGACTGCCAGGATAGATCTAGCCGTTCGAGGAGCAGATCTTAAACCCTTCGGTAATGGCGATGGTTCGCAGTTTG
GCGCTGAGGATTTCGAGAGTCCGATTGTATCGGGGGCGAGTCAGGGAAGCGGTGAGTACTTCTCCCGCGTCGGAATCGGTAAACCGCCGAGTCCAGTTTACATGGTACTC
GATACCGGCAGTGATGTAAGCTGGGTACAATGCGCGCCCTGCGCCGATTGCTACGAGCAAACCGATCCAATCTTCGAACCTGCTTCGTCTACTTCTTTTCGTCTCTCTCC
TGCCAAACAGAGCAATGCAAGTTCCTACGGCGACGGCTCCTATACCGTCGGCGACTTTGTTACCGAGACCGTCACTCTTGGCTCGACTTCCCTCCGAAACATCGCCATCG
GCTGTGGCCACAACAATGAGGGTCTGTTCGTCGGCGCAGCCGGCTTGCTCGGACTCGGCGGCGGCTCGCTTTCGTTCCCTTCGCAGCTCAATGTCTCGTCCTTCTCATAC
TGTCTCGTGGACCGTGACTCTGACTCAGCTTCCACTCTTGATTTCAACTCCCCGTTGCCTCCCGACGCCGTCACTGCCCCGTTACACCGCAACCGTAATCTGAACACGTT
CTTCTACCTCGGCGTGACGGGGCTGAGCGTAGGAGGTGAGCTACTCCCGATTCCCGAGTCGTCGTTCCAAATGAGCGAGGACGGAAACGGTGGCATCATCATCGACTCCG
GCACCGCCGTGACTCGGTTGCAGACGACCACTTACAACCTCCTGCGCGACGCGTTCGTCAAGAGCACACACGACCTGCAGTCCACCCGCGGCGTCGCGTTGTTCGACACC
TGTTACGACCTGTCGTCGAAGTCGAGAGTCGAAGTGCCGACGGTGTCGTTTCACTTCCCCGACGGGAAAGAGTTGCCGTTGCCGGCCAAAAACTACCTGATACCGGTTGA
CTCCGATGGGACGTTTTGTTTCGCCTTCGCTCCTACAGATTCGGCATTGTCAATCATCGGGAACGCACAACAGCAAGGGACACTAACGGTAAGAGGCAAATATCGTAAAT
TCTTAGATGTTGTGGGGATATTACAGGCAAGAAGCAGAGACGAACGAAGCCCTAAAAGCAACCGTTTTGATCCCTCCCCCTATCGTTTTCTTAACCACTTTTCACGTTCT
CGTCAACCTCATTATGATTCAATCCGAAAGCTTCCAATTTTCCAACCCATTTCAGAAAAATGCTCGTTAGTTAGTCCTTTCACAATGGGCGACTCCGGCAGCGTCTCCTC
TTCTTCCTTTGATTCCGCCTCCGTCGCCACCGCTGATCAGGTTCAGAATGCAGTTCGGGGTCGTATGAGTGGCGACGGAGGTCAATCACTGCCGGCAGAATCATCACGTC
GTTGGCGAGATATTTTCTGGTCGATTGTGTTTATGATTCATTTGATCAGTGTGGGATTTGTGCTCGTGGTTCTCGGGCTTAATAGGTTCAAGAAAAGCAATAGGCTTCAA
ATCGATAAGTACACCAATACAATCATGGAGAACCGGGTTGGATTGACAGAGGATTACTGGCCGCTGTATGCTTTAGCAGGCGGAGTTGGAAGCTTACTTGGATGGACTTG
GTTGTTCCTACTTGGTTCTTTTGCAAATCATGTTATGAAGATTTCTGTACACATTTTGACTACGTATCTTGCTGTGATAAGTGTTTTGTGCTTCTGGGGCCAACTCTTTT
TCTGGGGTGTAGCATTTGCAATTGGTGCAGGGTTGCAGTTCCTCTATGTTATATCAGTTATAGACAGACTACCATTTACACTGCTAGTGTTGCAAAAGGCTGTAAAGATG
GTATCCGGGCTTCCTGAAGTTATTAGAGTTGCATATGCTTTCATGATTGTCATGCTTTTGTGTATGGGAATTTGGTCTTTTGGAGTAGCTGGAATTGTGGCATCAAGTAT
GGGTGACGGTGGACGCTGGTGGCTACTTGTGGTTTTTTCGATAAGCTTATTTTGGACTGGTGCTGTTTTATGTAATACTCTACATGTTATAGTTTCCGGAATGGTGTTTC
TAGTTCTAATCCATGGGGGCCAAGAGGCATCGTCTGTGCCATCAACGTCATTAGTGAAGGCTTTACGATATGCTGTGACAACATCTTTTGGTAGCATTTGCTATGGGTCA
CTTTTTACAGCTGCTATTCGGACACTGCGGTGGGAGATCAGGGGAATCCGATCAAAGATTGGCAAGAACGAGTGTCTGCTTTGCTGTGTTGATTTTTTGTTTCATCTTGT
CGAGACTCTAGTTCGTTTCTTCAACAAGTATGCCTATATCCAGATAGCAGTTTATGGTAAAAGCTTCAACCGTTCAGCCAGGGACGCCTGGGAGTTATTCCAGTCAACTG
GAGTTGAAGCTCTTGTTGCCTATGATTGTTCAGGTGCTGTTCTGCTGATGAGCACCATCATGGGTGGGCTCACGGCTGGAACTTGCGCGGGCGTTTGGACATGGATTAAG
TGGAAGGATAAGGTGTCAATGGTAGCATGTACTTCAACATTGATGGGAATGGTCCTGGTAGGCCTTGCAATAGTCATCGTGGAAAGTGCCGTTACATCTATATACATATG
TTATGCAGAAGATCCCTTGTTGATTCACAGATGGGATGCTGAATTCTTCAACCAGATGTCAGAGATGCTTCACCAGCGCCTACAGCATCGGAGTGCACGAGCGAGGGAAG
TGTTAACGCAGTATCGGTCCTTCAGAATCATTCTCACTTGGACACCAACTCGAATTTCTCTCTGGCCGTGGGTGATGATCGCTTTGTTGACCCTCTACGCCCCGTTTGGT
CGCAGTTATGGAGTTGAGACTTCTGGTTTCGGGAAATTGTTCTACCTCCTCGTCGTCGTATGCTGTCAAAACTCTGTTTTGAAATCCATGGTTATGGGGATTCTCCTCTT
GGAGTTAGATTGGGATTTTATACTGAAATGGGAGCAGATAAAAAAGACGGTAATTTCTCCTCTGTGCTGTCGGCGCCACCAGAACCTCCGCCGCCACCGGCCTCATTTTC
TGCCAGTTTCAACCAACCCCGGATGGCGGCGCCGGCCAACCACCCTCCAAACCCTTGTGAACCGAAACCCCACTCCCCAGTTGCTTGGTCCACTGGTCTCTTCGACTGTT
GCAACGACATTAGCATCTGTTAAGTTCATTACCATTTCTCTTAATTTGTTCATCGCCGTAATACTCTCGCTAAAAATTTTATCTTTTTTTGCTCTTTACAGGCTGCTTGA
CTTGCTGGTGTCCCTGCATCACGTTCGGCCGCATAGCAGAGATGTGGACCGAGGAACAACATGGGACAATTCTTCTTGGAGGAGAGCCCTTGTACTGATTGCTGCGTCCA
CTGCTTCTGCGAGGAATGTGCTTTGTGCCAAGAGTACAGGGAGCTTCAGCACCAAGGCTTCGACATGTCCTTCGGTTTCTCTCCTCTCTCTTTATCTATCATTACAGAAC
AAACAAAGGTGGCATGGGAACGTGGAGAGGCAGAGACGGATTGCTGCTGCTCATGCGGTGTCGCCGCCGAGTGTGCAAGGAATGATTCGTTAA
Protein sequenceShow/hide protein sequence
GYRSASASDKLAANWPQSMQIVRLISQDHMNNKQYVGKADFLVFRAMNQHGFLGQLQEKKLCAVIQLPSQTLLLSVSDKACRLIGMLFPGEVPADDQKWKLEDPSRRAKW
WYSTFHTVTAMIGAGVLSLPYAMAYLGWGPGTMVLFLSWCMTLNTMWQMIQLHECAPGTRFDRYIDLGRYAFGQKLGPWIVLPQQLIVQVGCDIVYIVTGGKCMKKFMEM
ACVNCVQIRQSYWIVIFGSIHFFLSQLPNFNSVAGVSLAAAIMSLSYSTIAWAGSLSRGRMENVSYAYKKTSVQDSMFRVFNALGQISFAYAGHAVALEIQATIPSTPAK
PSKVPMWKGAVGAYIINAICYFPVAFIGYWAFGQDVEDNILLNLKRPAWLIASANLMVVIHVIGSYQVYAMPVFDLLERMMMKKFNFPEGFCLRIITRSAYVGDKNSLCK
GLYSNFGASVVKSATTAGRGGGLLERPVIEKATPGRESEFDLRSSRKVAPPYRVILHNDDFNKREYVVQVLMKVIPGMMLDNAVNIMQEAHCNGLSLVIICAQADAEEHC
MQLRGNGLLSSIEPASVNDTRGFVFFSGLRSLSLNTASPSVFASQPVLSFLYLPTSMAKISLLLLIVVFAFFSGNVRSRGLPRRAKTSVLDVAASIQRTREIFAFDAKSP
LQEERCVSDSSSMALQLNSRISIQRTSHSDYKSLTLARLGRDSARVRSLTARIDLAVRGADLKPFGNGDGSQFGAEDFESPIVSGASQGSGEYFSRVGIGKPPSPVYMVL
DTGSDVSWVQCAPCADCYEQTDPIFEPASSTSFRLSPAKQSNASSYGDGSYTVGDFVTETVTLGSTSLRNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQLNVSSFSY
CLVDRDSDSASTLDFNSPLPPDAVTAPLHRNRNLNTFFYLGVTGLSVGGELLPIPESSFQMSEDGNGGIIIDSGTAVTRLQTTTYNLLRDAFVKSTHDLQSTRGVALFDT
CYDLSSKSRVEVPTVSFHFPDGKELPLPAKNYLIPVDSDGTFCFAFAPTDSALSIIGNAQQQGTLTVRGKYRKFLDVVGILQARSRDERSPKSNRFDPSPYRFLNHFSRS
RQPHYDSIRKLPIFQPISEKCSLVSPFTMGDSGSVSSSSFDSASVATADQVQNAVRGRMSGDGGQSLPAESSRRWRDIFWSIVFMIHLISVGFVLVVLGLNRFKKSNRLQ
IDKYTNTIMENRVGLTEDYWPLYALAGGVGSLLGWTWLFLLGSFANHVMKISVHILTTYLAVISVLCFWGQLFFWGVAFAIGAGLQFLYVISVIDRLPFTLLVLQKAVKM
VSGLPEVIRVAYAFMIVMLLCMGIWSFGVAGIVASSMGDGGRWWLLVVFSISLFWTGAVLCNTLHVIVSGMVFLVLIHGGQEASSVPSTSLVKALRYAVTTSFGSICYGS
LFTAAIRTLRWEIRGIRSKIGKNECLLCCVDFLFHLVETLVRFFNKYAYIQIAVYGKSFNRSARDAWELFQSTGVEALVAYDCSGAVLLMSTIMGGLTAGTCAGVWTWIK
WKDKVSMVACTSTLMGMVLVGLAIVIVESAVTSIYICYAEDPLLIHRWDAEFFNQMSEMLHQRLQHRSARAREVLTQYRSFRIILTWTPTRISLWPWVMIALLTLYAPFG
RSYGVETSGFGKLFYLLVVVCCQNSVLKSMVMGILLLELDWDFILKWEQIKKTVISPLCCRRHQNLRRHRPHFLPVSTNPGWRRRPTTLQTLVNRNPTPQLLGPLVSSTV
ATTLASVKFITISLNLFIAVILSLKILSFFALYRLLDLLVSLHHVRPHSRDVDRGTTWDNSSWRRALVLIAASTASARNVLCAKSTGSFSTKASTCPSVSLLSLYLSLQN
KQRWHGNVERQRRIAAAHAVSPPSVQGMIR