; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001566 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001566
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF3754 domain-containing protein
Genome locationscaffold10:585114..593201
RNA-Seq ExpressionSpg001566
SyntenySpg001566
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017875.1 hypothetical protein SDJN02_19741, partial [Cucurbita argyrosperma subsp. argyrosperma]2.7e-16567.76Show/hide
Query:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE
        SDR EFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETD LEQKFLGKLFQVMEKSNFKLTTD+EIAVALS QYRLNLPISVDE
Subjt:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE

Query:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER
        SKLD KLLT YFM+NPHDNLPYFADKYIIFRRGIGIDQM DHFY+TK+NAII R+WMFFL + GLKRLLF+ASRS QSQVFSKQIDISTDS+DDGLYVER
Subjt:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER

Query:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL
        IRVENM LGF         S LWN+ITIQEPTFDRIIVVYRPA+ N EV ERGIF+KHFKNIPMADLEIVLV   EK    S GL  +         W+ 
Subjt:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL

Query:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------
        FL     GLV                TVIGSLSV KAD+KVIFAILSAV  GG C   Y+                      ++G+G             
Subjt:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------

Query:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE
                    K G  TK    L   CEELIQ QFDQ CNF+VDDAVHKLEKLGI+++ ADGAYSCVDLRSAN IIG TTEEIV KAK+
Subjt:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE

XP_022934332.1 uncharacterized protein LOC111441529 [Cucurbita moschata]1.7e-16467.55Show/hide
Query:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE
        SDR EFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETD LEQKFLGKLFQVMEKSNFKLTTD+EIAVALS QYRLNLPISVDE
Subjt:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE

Query:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER
        SKLD KLLT YFM+NPHDNLPYFADKYIIFRRGIGIDQM DHFY+TK+NAII R+WMFFL + GLKRLLF+ASRS QSQVFSKQIDISTDS+DDGLYVER
Subjt:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER

Query:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL
        IRVENM LGF         S LWN+ITIQEPTFDRIIVVYRPA+ N EV ERGIF+KHFKNIPMADLEIVL  +K      S GL  +         W+ 
Subjt:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL

Query:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------
        FL     GLV                TVIGSLSV KAD+KVIFAILSAV  GG C   Y+                      ++G+G             
Subjt:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------

Query:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE
                    K G  TK    L   CEELIQ QFDQ CNF+VDDAVHKLEKLGI+++ ADGAYSCVDLRSAN IIG TTEEIV KAKE
Subjt:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE

XP_022983456.1 uncharacterized protein LOC111482053 [Cucurbita maxima]7.8e-16567.76Show/hide
Query:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE
        SDR EFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETD LEQKFLGKLFQVMEKSNFKLTTD+EIAVALS QYRLNLPISVDE
Subjt:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE

Query:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER
        SKLD KLLT YFM+NPHDNLPYFADKYIIFRRGIGIDQM DHFY+TK+NAII R+WMFFL + GLKRLLF+ASRS QSQVFSKQIDISTDS DDGLYVER
Subjt:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER

Query:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL
        IRVENM LGF         S LWN+ITIQEPTFDRIIVVYRPA+ N EV ERGIF+KHFKNIPMADLEIVL  +K      S GL  +         WL 
Subjt:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL

Query:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------
        FL     GLV                TVIGSLSV KAD+KVIFAILSAV  GG C   Y+                      ++G+G             
Subjt:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------

Query:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE
                    K G  TK    L + CEELIQ QFDQ CNF+VDDAVHKLEKLGI+++ ADGAYSCVDLRSAN IIG TTEEIV KAKE
Subjt:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE

XP_023528251.1 uncharacterized protein LOC111791222 isoform X1 [Cucurbita pepo subsp. pepo]1.7e-16467.55Show/hide
Query:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE
        SDR EFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETD LEQKFLGKLFQVMEKSNFKLTTD+EIAVALS QYRLNLPISVDE
Subjt:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE

Query:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER
        SKLD KLLT YFM+NPHDNLPYFADKYIIFRRGIGIDQM DHFY+TK+NAII R+WMFFL + GLKRLLF+ASRS QSQVFSKQIDISTDS+DDGLYVER
Subjt:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER

Query:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL
        IRVENM LGF         S LWN+ITIQEPTFDRIIVVYRPA+ N EV ERGIF+KHFKNIPMADLEIVL  +K      S GL  +         W+ 
Subjt:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL

Query:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------
        FL     GLV                TVIGSLSV KAD+KVIFAILSAV  GG C   Y+                      ++G+G             
Subjt:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------

Query:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE
                    K G  TK    L   CEELIQ QFDQ CNF+VDDAVHKLEKLGI+++ ADGAYSCVDLRSAN IIG TTEEIV KAKE
Subjt:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE

XP_038892952.1 uncharacterized protein LOC120081846 [Benincasa hispida]1.4e-16665.94Show/hide
Query:  PTLLH------HTSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALS
        PTL++       T DR EFL FCQRVEYSIRAWYLL FDDLLHLYSLF+PIHGARKLE++NLSPEE DV+EQKFLGKLFQVMEKSNFKLTTD+EIAVALS
Subjt:  PTLLH------HTSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALS

Query:  AQYRLNLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDI
        AQYRLNLPISVDESKLDKKLLTKYF +NPHDNLPYFADKYIIFRRGIGIDQM D+FY+TK+NAIIMR+WMFFLKV+GLK LLF ASRSRQSQVFSKQIDI
Subjt:  AQYRLNLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDI

Query:  STDSEDDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEV
        ST+SEDDGLYVERIRVENM  G         IS L N+ITIQEPTFDRIIV+YRPANT  E+ERGIFVKHFKNIPMADLEIVL  +      N+ GL  +
Subjt:  STDSEDDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEV

Query:  PCVCCNWAGWLLFLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-
                 W+ FL     GLV                TVIGSLSV KAD+KVIFAILSAV  GG C   Y+                      ++G+G 
Subjt:  PCVCCNWAGWLLFLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-

Query:  -----------------------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKA
                                K G+ TK    L L CEELI+ +FDQ CNFDVDDAVHKL+KLGI+V+GADGAYSCVDLRSAN IIG TTEEIV KA
Subjt:  -----------------------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKA

Query:  KEGDASAT
        KEGDAS T
Subjt:  KEGDASAT

TrEMBL top hitse value%identityAlignment
A0A6J1D1Z1 uncharacterized protein LOC111016855 isoform X28.2e-16064.89Show/hide
Query:  RYLPTLLHH---TSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALS
        R + TL  H    SDR EF+K CQRVEYSIRAWYLLHFDDLLHLY+LFDPIHGA KLEQQNLS EETDVLEQKFLG LFQVM+KSNF++TTDDEIAVALS
Subjt:  RYLPTLLHH---TSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALS

Query:  AQYRLNLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLL-FDASRSRQSQVFSKQID
        AQYRLNLPISVDESKLDKKLLTKYF +NPHDNLPYFADKYIIFRRGIGIDQMTDHFY TK+N IIMR+W FFLK+SGL RL+   ASRS +SQVF+KQID
Subjt:  AQYRLNLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLL-FDASRSRQSQVFSKQID

Query:  ISTDSEDDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGE
        ISTDSEDDGLYVERIRVENMKLG         IS L +EITIQEPTFDRIIVVYRPAN N+E+ERGIFVKHFKNIPMADLEIVL  +K  S    +    
Subjt:  ISTDSEDDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGE

Query:  VPCVCCNWAGWLLFLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVG-------------------------------KGGDCF
                  W+ FL     GLV                TVIGSLSV  ADI+VIFAI+SAV                                 G    
Subjt:  VPCVCCNWAGWLLFLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVG-------------------------------KGGDCF

Query:  LLYIDETGKG------------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAK
        L   DE  +              K G  T     L   CEELIQ QF Q CNFDVDDAVHKLEKLGIVV+ ADGAYSCVDLRSAN IIGTTTEEI+ KAK
Subjt:  LLYIDETGKG------------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAK

Query:  EGDASAT
        E DASAT
Subjt:  EGDASAT

A0A6J1D2Z1 uncharacterized protein LOC111016855 isoform X11.1e-15964.76Show/hide
Query:  RYLPTLLHH----TSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVAL
        R + TL  H     SDR EF+K CQRVEYSIRAWYLLHFDDLLHLY+LFDPIHGA KLEQQNLS EETDVLEQKFLG LFQVM+KSNF++TTDDEIAVAL
Subjt:  RYLPTLLHH----TSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVAL

Query:  SAQYRLNLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLL-FDASRSRQSQVFSKQI
        SAQYRLNLPISVDESKLDKKLLTKYF +NPHDNLPYFADKYIIFRRGIGIDQMTDHFY TK+N IIMR+W FFLK+SGL RL+   ASRS +SQVF+KQI
Subjt:  SAQYRLNLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLL-FDASRSRQSQVFSKQI

Query:  DISTDSEDDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLG
        DISTDSEDDGLYVERIRVENMKLG         IS L +EITIQEPTFDRIIVVYRPAN N+E+ERGIFVKHFKNIPMADLEIVL  +K  S    +   
Subjt:  DISTDSEDDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLG

Query:  EVPCVCCNWAGWLLFLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVG-------------------------------KGGDC
                   W+ FL     GLV                TVIGSLSV  ADI+VIFAI+SAV                                 G   
Subjt:  EVPCVCCNWAGWLLFLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVG-------------------------------KGGDC

Query:  FLLYIDETGKG------------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKA
         L   DE  +              K G  T     L   CEELIQ QF Q CNFDVDDAVHKLEKLGIVV+ ADGAYSCVDLRSAN IIGTTTEEI+ KA
Subjt:  FLLYIDETGKG------------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKA

Query:  KEGDASAT
        KE DASAT
Subjt:  KEGDASAT

A0A6J1D4B7 uncharacterized protein LOC111016855 isoform X37.2e-15663.69Show/hide
Query:  RYLPTLLHH----TSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVAL
        R + TL  H     SDR EF+K CQRVEYSIRAWYLLHFDDLLHLY+LFDPIHGA KLEQQNLS EETDVLEQKFLG LFQVM+KSNF++TTDDEIAVAL
Subjt:  RYLPTLLHH----TSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVAL

Query:  SAQYRLNLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLL-FDASRSRQSQVFSKQI
        SAQYRLNLPISVDESKLDKKLLTKYF +NPHDNLPYFADKYIIFRRGIGIDQMTDHFY TK+N IIMR+W FFLK+SGL RL+   ASRS +SQVF+KQI
Subjt:  SAQYRLNLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLL-FDASRSRQSQVFSKQI

Query:  DISTDSEDDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSR-------
        DISTDSEDDGLYVERIRVENMKLG         IS L +EITIQEPTFDRIIVVYRPAN N+E+ERGIFVKHFKNIPMADLEIVL  +K  S        
Subjt:  DISTDSEDDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSR-------

Query:  -FNSNGLGEVPCVCCNWAGWLLFLPKILYGLVNTNAPLSSPLPL-------------------LGQGTVIGSL-SVAKADIKVIFAILSAVGKGGDCFLL
           S  +G V C     + W   + +  Y  +NT++PL                          G+GT++     V + ++K +      + K G   + 
Subjt:  -FNSNGLGEVPCVCCNWAGWLLFLPKILYGLVNTNAPLSSPLPL-------------------LGQGTVIGSL-SVAKADIKVIFAILSAVGKGGDCFLL

Query:  YIDETGKGYKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKEGDASAT
         +D+                    CEELIQ QF Q CNFDVDDAVHKLEKLGIVV+ ADGAYSCVDLRSAN IIGTTTEEI+ KAKE DASAT
Subjt:  YIDETGKGYKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKEGDASAT

A0A6J1F2F4 uncharacterized protein LOC1114415298.4e-16567.55Show/hide
Query:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE
        SDR EFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETD LEQKFLGKLFQVMEKSNFKLTTD+EIAVALS QYRLNLPISVDE
Subjt:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE

Query:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER
        SKLD KLLT YFM+NPHDNLPYFADKYIIFRRGIGIDQM DHFY+TK+NAII R+WMFFL + GLKRLLF+ASRS QSQVFSKQIDISTDS+DDGLYVER
Subjt:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER

Query:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL
        IRVENM LGF         S LWN+ITIQEPTFDRIIVVYRPA+ N EV ERGIF+KHFKNIPMADLEIVL  +K      S GL  +         W+ 
Subjt:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL

Query:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------
        FL     GLV                TVIGSLSV KAD+KVIFAILSAV  GG C   Y+                      ++G+G             
Subjt:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------

Query:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE
                    K G  TK    L   CEELIQ QFDQ CNF+VDDAVHKLEKLGI+++ ADGAYSCVDLRSAN IIG TTEEIV KAKE
Subjt:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE

A0A6J1J295 uncharacterized protein LOC1114820533.8e-16567.76Show/hide
Query:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE
        SDR EFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETD LEQKFLGKLFQVMEKSNFKLTTD+EIAVALS QYRLNLPISVDE
Subjt:  SDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDE

Query:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER
        SKLD KLLT YFM+NPHDNLPYFADKYIIFRRGIGIDQM DHFY+TK+NAII R+WMFFL + GLKRLLF+ASRS QSQVFSKQIDISTDS DDGLYVER
Subjt:  SKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVER

Query:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL
        IRVENM LGF         S LWN+ITIQEPTFDRIIVVYRPA+ N EV ERGIF+KHFKNIPMADLEIVL  +K      S GL  +         WL 
Subjt:  IRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV-ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLL

Query:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------
        FL     GLV                TVIGSLSV KAD+KVIFAILSAV  GG C   Y+                      ++G+G             
Subjt:  FLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG-------------

Query:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE
                    K G  TK    L + CEELIQ QFDQ CNF+VDDAVHKLEKLGI+++ ADGAYSCVDLRSAN IIG TTEEIV KAKE
Subjt:  -----------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G19340.1 Protein of unknown function (DUF3754)4.3e-11346.57Show/hide
Query:  LPTLLHHTSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRL
        L  L+ H++DR EFLK C+R+EY++RAWYLL F+DL+ LYSLFDP+HGA+K++QQNL+ +E DVLEQ FL  LFQVMEKSNFK+T+++E+ VA S QY L
Subjt:  LPTLLHHTSDRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRL

Query:  NLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSE
        NLPI VDESKLDKKLL +YF ++PH+N+P F+DKY+IFRRGIG+D+ TD+F+  KL+ II R W F ++++ L++L    S S   +   K  + + D++
Subjt:  NLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSE

Query:  DDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCC
        +D LYVERIR+EN KL F+ + S         ++TIQEPTFDR+IVVYR A++   +ERGI+VKHFKNIPMAD+EIVL  ++          G  P    
Subjt:  DDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCC

Query:  NWAGWLLFLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG------
            W+ FL   + GLV                 V+ S+ + K+D  VI AILS V   G C   Y                       ++G+G      
Subjt:  NWAGWLLFLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG------

Query:  ---------------YKAGDYTK-PCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKEG
                       Y   +  K     L L CEELI+ +F  RCNFDV+DAV KLEKLGIV +   G Y C+ L+ AN IIGTTTEE+V KAK+G
Subjt:  ---------------YKAGDYTK-PCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKEG

AT5G13940.1 aminopeptidases6.1e-10746.22Show/hide
Query:  DRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDES
        +R EFL+FCQRVE +IRAWY LHF+DL+ LYSLF+P+ GA +L QQNLS  E D LE +FL  LFQVMEKSNFK+ T++EI VALSAQYRLNLPI V+E+
Subjt:  DRGEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDES

Query:  KLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVERI
        KLD KLLT+YF   P D+LP+FADKYIIFRRG GID M  +F+  K++ I++R+W F L ++ LKRL++     +     S+QIDIS ++E D LY+ERI
Subjt:  KLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQMTDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVERI

Query:  RVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLLFL
        R+E +KL          +S L  +ITIQEPTF+RIIVVYR  +   E ER I+VKHFK IPMAD+EIVL  +K          G  P        W+ FL
Subjt:  RVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEVERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLLFL

Query:  PKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG---------------
             GLV                TV+ S+S+ KADI+VI AILS V     C   Y                       ++G+G               
Subjt:  PKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYID---------------------ETGKG---------------

Query:  ---------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKEG
                  K G  T     L +  E  I+ +F++ CNFDVDDA+ KLEKLG+V + ++  Y CV+++ AN I+GTTTEE+V KA++G
Subjt:  ---------YKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTTCATCGGAACTCTTCGTCGCCACGAACCCAACGTCTCTTTGCAAGAGCTCTTCACCGCCACGAACCCAACAACGTCTCTTTGTCGAAGCTCTTCATCGCCAA
GAACCCAACGTCTCTTTGTCGGAGCACTTCGCCGCCACAAACTCTACTCATTCTCTTTTCTTTTCACAACTCTGGCGAACTTGCAAAACCGACGAAACTGACCGAAATTT
ACTATGGGTTTCTTCTACGTTCGGGGACGATTCAGGATCAGATTCGGTTGAATCTGACTCAGCATGAACTCCAAGTGGGATTGGATCAACCCCGAACTTGTGCCGGGGTC
GGCCTCGACCTAGGCCAAGGTCGAGGACGAGGCTTAGGGACAATCCCCTTGACCCTCTACTTCCTCGATGGTTTTGCTCTTGGTTGTGTTTCGGTCGTTTTGCTTTTGGA
GGCTTACAATGACCAAGAAGAAGAGGGAAGTTATACGCTTGGAGAGGGAGTCGGTTATTCCCATCCTCAAGCCCGCGCTTATCACCGCCTTGTCCAGCCATCTCGGTATC
TTCCTACTCTTCTTCATCATACTTCGGACAGGGGTGAGTTTCTTAAGTTTTGCCAGAGAGTTGAATACTCAATTCGAGCTTGGTATCTTCTGCATTTTGATGACCTTTTG
CATTTATATTCATTATTCGATCCTATACACGGGGCTCGAAAATTGGAGCAGCAAAATCTCTCGCCTGAAGAAACCGATGTTTTGGAACAAAAATTTCTGGGGAAACTGTT
TCAGGTGATGGAGAAGAGCAATTTTAAATTAACAACAGACGACGAAATCGCGGTTGCACTTTCTGCACAATATCGTCTAAACCTTCCAATCTCTGTGGATGAGTCCAAGC
TTGACAAGAAGCTTTTGACGAAATACTTCATGGACAATCCTCACGACAATCTACCATATTTTGCTGATAAGTATATAATTTTCCGCCGTGGTATTGGGATTGATCAAATG
ACCGATCACTTTTACCAAACAAAACTAAATGCCATCATTATGCGAGTATGGATGTTCTTTCTCAAAGTCTCAGGGTTAAAGAGACTTCTATTTGACGCATCAAGAAGCCG
CCAAAGTCAGGTCTTTTCAAAACAAATTGATATCAGTACAGATTCAGAGGATGATGGCTTGTATGTCGAGCGGATCCGTGTTGAGAACATGAAACTTGGGTTTGAACTCT
ACCACTCTATAATTTGGATCTCTACACTATGGAACGAGATTACGATCCAAGAACCCACGTTTGATAGAATTATCGTTGTTTACAGGCCAGCAAATACGAATAATGAAGTG
GAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCAATGGCAGATCTCGAGATCGTGCTTGTAGCCCGAAAAGAAAAATCCAGGTTTAACTCCAATGGACTGGGTGA
AGTTCCTTGTGTCTGCTGCAATTGGGCTGGTTGGTTGCTGTTTTTACCCAAAATTTTGTACGGACTTGTGAACACAAATGCACCTCTCTCAAGCCCCCTTCCCCTCCTTG
GACAGGGTACTGTTATTGGCTCGCTTAGCGTCGCGAAAGCAGATATCAAAGTCATTTTTGCTATCCTCTCTGCAGTCGGTAAAGGAGGTGATTGTTTCCTTCTATATATT
GATGAGACAGGGAAAGGCTACAAAGCAGGTGATTACACAAAACCTTGTTCCGCTTTAGCACTATGGTGCGAGGAGCTGATTCAAGCACAGTTTGATCAGAGGTGTAATTT
TGATGTGGATGACGCGGTTCACAAGTTAGAAAAGTTAGGAATCGTTGTCCAGGGTGCGGATGGGGCATATTCCTGTGTAGATTTGAGGAGTGCCAATACGATCATAGGCA
CCACCACGGAGGAGATAGTTTTCAAAGCTAAAGAGGGTGATGCCTCTGCTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTTCATCGGAACTCTTCGTCGCCACGAACCCAACGTCTCTTTGCAAGAGCTCTTCACCGCCACGAACCCAACAACGTCTCTTTGTCGAAGCTCTTCATCGCCAA
GAACCCAACGTCTCTTTGTCGGAGCACTTCGCCGCCACAAACTCTACTCATTCTCTTTTCTTTTCACAACTCTGGCGAACTTGCAAAACCGACGAAACTGACCGAAATTT
ACTATGGGTTTCTTCTACGTTCGGGGACGATTCAGGATCAGATTCGGTTGAATCTGACTCAGCATGAACTCCAAGTGGGATTGGATCAACCCCGAACTTGTGCCGGGGTC
GGCCTCGACCTAGGCCAAGGTCGAGGACGAGGCTTAGGGACAATCCCCTTGACCCTCTACTTCCTCGATGGTTTTGCTCTTGGTTGTGTTTCGGTCGTTTTGCTTTTGGA
GGCTTACAATGACCAAGAAGAAGAGGGAAGTTATACGCTTGGAGAGGGAGTCGGTTATTCCCATCCTCAAGCCCGCGCTTATCACCGCCTTGTCCAGCCATCTCGGTATC
TTCCTACTCTTCTTCATCATACTTCGGACAGGGGTGAGTTTCTTAAGTTTTGCCAGAGAGTTGAATACTCAATTCGAGCTTGGTATCTTCTGCATTTTGATGACCTTTTG
CATTTATATTCATTATTCGATCCTATACACGGGGCTCGAAAATTGGAGCAGCAAAATCTCTCGCCTGAAGAAACCGATGTTTTGGAACAAAAATTTCTGGGGAAACTGTT
TCAGGTGATGGAGAAGAGCAATTTTAAATTAACAACAGACGACGAAATCGCGGTTGCACTTTCTGCACAATATCGTCTAAACCTTCCAATCTCTGTGGATGAGTCCAAGC
TTGACAAGAAGCTTTTGACGAAATACTTCATGGACAATCCTCACGACAATCTACCATATTTTGCTGATAAGTATATAATTTTCCGCCGTGGTATTGGGATTGATCAAATG
ACCGATCACTTTTACCAAACAAAACTAAATGCCATCATTATGCGAGTATGGATGTTCTTTCTCAAAGTCTCAGGGTTAAAGAGACTTCTATTTGACGCATCAAGAAGCCG
CCAAAGTCAGGTCTTTTCAAAACAAATTGATATCAGTACAGATTCAGAGGATGATGGCTTGTATGTCGAGCGGATCCGTGTTGAGAACATGAAACTTGGGTTTGAACTCT
ACCACTCTATAATTTGGATCTCTACACTATGGAACGAGATTACGATCCAAGAACCCACGTTTGATAGAATTATCGTTGTTTACAGGCCAGCAAATACGAATAATGAAGTG
GAACGGGGTATCTTCGTGAAGCATTTCAAGAATATACCAATGGCAGATCTCGAGATCGTGCTTGTAGCCCGAAAAGAAAAATCCAGGTTTAACTCCAATGGACTGGGTGA
AGTTCCTTGTGTCTGCTGCAATTGGGCTGGTTGGTTGCTGTTTTTACCCAAAATTTTGTACGGACTTGTGAACACAAATGCACCTCTCTCAAGCCCCCTTCCCCTCCTTG
GACAGGGTACTGTTATTGGCTCGCTTAGCGTCGCGAAAGCAGATATCAAAGTCATTTTTGCTATCCTCTCTGCAGTCGGTAAAGGAGGTGATTGTTTCCTTCTATATATT
GATGAGACAGGGAAAGGCTACAAAGCAGGTGATTACACAAAACCTTGTTCCGCTTTAGCACTATGGTGCGAGGAGCTGATTCAAGCACAGTTTGATCAGAGGTGTAATTT
TGATGTGGATGACGCGGTTCACAAGTTAGAAAAGTTAGGAATCGTTGTCCAGGGTGCGGATGGGGCATATTCCTGTGTAGATTTGAGGAGTGCCAATACGATCATAGGCA
CCACCACGGAGGAGATAGTTTTCAAAGCTAAAGAGGGTGATGCCTCTGCTACTTGA
Protein sequenceShow/hide protein sequence
MSLHRNSSSPRTQRLFARALHRHEPNNVSLSKLFIAKNPTSLCRSTSPPQTLLILFSFHNSGELAKPTKLTEIYYGFLLRSGTIQDQIRLNLTQHELQVGLDQPRTCAGV
GLDLGQGRGRGLGTIPLTLYFLDGFALGCVSVVLLLEAYNDQEEEGSYTLGEGVGYSHPQARAYHRLVQPSRYLPTLLHHTSDRGEFLKFCQRVEYSIRAWYLLHFDDLL
HLYSLFDPIHGARKLEQQNLSPEETDVLEQKFLGKLFQVMEKSNFKLTTDDEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMDNPHDNLPYFADKYIIFRRGIGIDQM
TDHFYQTKLNAIIMRVWMFFLKVSGLKRLLFDASRSRQSQVFSKQIDISTDSEDDGLYVERIRVENMKLGFELYHSIIWISTLWNEITIQEPTFDRIIVVYRPANTNNEV
ERGIFVKHFKNIPMADLEIVLVARKEKSRFNSNGLGEVPCVCCNWAGWLLFLPKILYGLVNTNAPLSSPLPLLGQGTVIGSLSVAKADIKVIFAILSAVGKGGDCFLLYI
DETGKGYKAGDYTKPCSALALWCEELIQAQFDQRCNFDVDDAVHKLEKLGIVVQGADGAYSCVDLRSANTIIGTTTEEIVFKAKEGDASAT