; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017845 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017845
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionChlorophyll a-b binding protein, chloroplastic
Genome locationChr03:24147347..24174009
RNA-Seq ExpressionHG10017845
SyntenyHG10017845
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR022796 - Chlorophyll A-B binding protein
IPR023329 - Chlorophyll a/b binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022928608.1 uncharacterized protein LOC111435467 [Cucurbita moschata]5.5e-29470.11Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        ME+DEGELLLWKSDS PQSMVSVT+GRVM TLLAARPKKLHDAVS LSPDHRHGA SLDSLDQSLWFLH+YVRDAVQNH SLDEILVPM+EHTLRFKDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQVMVLLNWLFLDEL FQ+LIKNLADIIVRKDDRYVALGWCILVRSLVEFESVP ELSLNGLRERF DMLKV  SCIPRLTCIL+KGSILQEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRLSV AADCVVSLTNALT KAE QTRQKRLNASSSYQQ+T FS AV DQREKPIS+SSK+S+ DMEYLLW QLKDL+ILVQRLLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WS KSRPLHAKGLEQVLKWL EIN HYG+F+
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
        NEAGK K+KISQTGALLLSSCWRHYSILLFL+DCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSK+FEST+SEYG+
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QISQVLLLQ                                                                     FHSTDEDVI EVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDN-QFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTL
         K SSG +ITDIRQLDVVMP LLNLLDERDVTARAVIILIAESCLMSRDN QFLLEVFKRFDSDSI+QRRNAIDVISEIVQMSSN RNLLTQSAW     
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDN-QFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTL

Query:  SSFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLL
                                 QD A QLIKCLEDEEILIRKQAADLLPC+DP+L LPSLVRLVYSSND+ LASA EAL+GVLKCHNQNIGAIF+LL
Subjt:  SSFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLL

Query:  DCV
        DCV
Subjt:  DCV

XP_038882125.1 uncharacterized protein LOC120073376 isoform X1 [Benincasa hispida]2.1e-30171.2Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        MEEDEGELLLWKSDS PQSM SVTIGRVMVTLLAARPKKLHDA+SSLSPDHRHGASSLDSLDQSLWFLHQYV DAVQNH SLDEILVP++EHTLRFKDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVE+ESVPCEL LNGLRERFNDMLKVLCSCIPRLTCIL+KGSI+QEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRLSV AADCVVSLTNALTTKAEVQTRQKRLNASSSYQQ TFFS  V DQREKPISNSSK+SDSDM+YLLWHQLKDL+ILVQ+LLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WSRKSRPLHAKGLEQVLKWLHEIN HYGNFQ
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
        NEAGKAKSKI +TGALLLSSCWRHYSILLFLEDCRFSQHYEE L QYLSGIQY SGHHTGE + N+DGRETIIFFLNCLCLLLGR DSKRFESTISEYGT
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QI QVLLLQ                                                                     FHSTD DV+DEVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS
        +K SSGG+ITD RQLD+VMPLLLNLLDE DVTARAVIILIAESCLMSRDNQFLLEVFKRFDSD IM RRNAIDVISEIVQMSSNTRNLL+QSAW      
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS

Query:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD
                                QDTANQLI+CLEDEEILIRKQAADLLPCV+PTLFLPSLVRLVYSS+D+VLASAREALIGVLKCHN+N+G IFMLLD
Subjt:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD

Query:  CV
        CV
Subjt:  CV

XP_038882126.1 uncharacterized protein LOC120073376 isoform X2 [Benincasa hispida]4.8e-29870.82Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        MEEDEGELLLWKSDS PQSM SVTIGRVMVTLLAARPKKLHDA+SSLSPDHRHGASSLDSLDQSLWFLHQYV DAVQNH SLDEILVP++EHTLRFKDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVE+ESVPCEL LNGLRERFNDMLKVLCSCIPRLTCIL+KGSI+QEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRLSV AADCVVSLTNALTTKAEVQTRQKRLNASSSYQQ TFFS  V DQREKPISNSSK+SDSDM+YLLWHQLKDL+ILVQ+LLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WSRKSRPLHAKGLEQVLKWLHEIN HYGNFQ
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
        NE   AKSKI +TGALLLSSCWRHYSILLFLEDCRFSQHYEE L QYLSGIQY SGHHTGE + N+DGRETIIFFLNCLCLLLGR DSKRFESTISEYGT
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QI QVLLLQ                                                                     FHSTD DV+DEVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS
        +K SSGG+ITD RQLD+VMPLLLNLLDE DVTARAVIILIAESCLMSRDNQFLLEVFKRFDSD IM RRNAIDVISEIVQMSSNTRNLL+QSAW      
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS

Query:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD
                                QDTANQLI+CLEDEEILIRKQAADLLPCV+PTLFLPSLVRLVYSS+D+VLASAREALIGVLKCHN+N+G IFMLLD
Subjt:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD

Query:  CV
        CV
Subjt:  CV

XP_038882127.1 uncharacterized protein LOC120073376 isoform X3 [Benincasa hispida]2.1e-30171.2Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        MEEDEGELLLWKSDS PQSM SVTIGRVMVTLLAARPKKLHDA+SSLSPDHRHGASSLDSLDQSLWFLHQYV DAVQNH SLDEILVP++EHTLRFKDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVE+ESVPCEL LNGLRERFNDMLKVLCSCIPRLTCIL+KGSI+QEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRLSV AADCVVSLTNALTTKAEVQTRQKRLNASSSYQQ TFFS  V DQREKPISNSSK+SDSDM+YLLWHQLKDL+ILVQ+LLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WSRKSRPLHAKGLEQVLKWLHEIN HYGNFQ
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
        NEAGKAKSKI +TGALLLSSCWRHYSILLFLEDCRFSQHYEE L QYLSGIQY SGHHTGE + N+DGRETIIFFLNCLCLLLGR DSKRFESTISEYGT
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QI QVLLLQ                                                                     FHSTD DV+DEVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS
        +K SSGG+ITD RQLD+VMPLLLNLLDE DVTARAVIILIAESCLMSRDNQFLLEVFKRFDSD IM RRNAIDVISEIVQMSSNTRNLL+QSAW      
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS

Query:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD
                                QDTANQLI+CLEDEEILIRKQAADLLPCV+PTLFLPSLVRLVYSS+D+VLASAREALIGVLKCHN+N+G IFMLLD
Subjt:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD

Query:  CV
        CV
Subjt:  CV

XP_038882128.1 uncharacterized protein LOC120073376 isoform X4 [Benincasa hispida]6.6e-29570.32Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        MEEDEGELLLWKSDS PQSM SVTIGRVMVTLLAARPKKLHDA+SSLSPDHRHGASSLDSLDQSLWFLHQYV DAVQNH SLDEILVP++EHTLRFKDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVE+ESVPCEL LNGLRERFNDMLKVLCSCIPRLTCIL+KGSI+QEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRLSV AADCVVSLTNALTTKAEVQTRQKRLNASSSYQQ TFFS  V DQREKPISNSSK+SDSDM+YLLWHQLKDL+ILVQ+LLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WSRKSRPLHAKGLEQVLKWLHEIN HY    
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
           GKAKSKI +TGALLLSSCWRHYSILLFLEDCRFSQHYEE L QYLSGIQY SGHHTGE + N+DGRETIIFFLNCLCLLLGR DSKRFESTISEYGT
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QI QVLLLQ                                                                     FHSTD DV+DEVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS
        +K SSGG+ITD RQLD+VMPLLLNLLDE DVTARAVIILIAESCLMSRDNQFLLEVFKRFDSD IM RRNAIDVISEIVQMSSNTRNLL+QSAW      
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS

Query:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD
                                QDTANQLI+CLEDEEILIRKQAADLLPCV+PTLFLPSLVRLVYSS+D+VLASAREALIGVLKCHN+N+G IFMLLD
Subjt:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD

Query:  CV
        CV
Subjt:  CV

TrEMBL top hitse value%identityAlignment
A0A1S4DUA1 uncharacterized protein LOC103486160 isoform X45.2e-29068.45Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        MEEDEGELLLWKSD  P+SMVSVT+GRVM TLL ARPKKLH+AVS LSPDHR GASSLDS+ QSLWFLHQYV+DAVQNH SLDEIL+PM+EH LR KDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQV+VLLNWLFLDEL FQTLIKN+ADIIVRKDDRYVALGWCILVRSLVEFESVPCEL LNGLRERFNDMLKVLCSCIPRLTCIL+KGS+LQEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRL+V AADC+VSLTNALT KAEVQTRQKR NA+SSYQQ+T FS  V DQREKPISN+SK+S  DMEYLLW QLKDL  LVQRLLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WS+ SRPLHAKGLEQVLKWL EIN HYGNFQ
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
        +EAGK KSKI +TG+LLLSSCWRHYSILLFLED  FSQHY+EWLNQYLSGIQYYSGHHT E++GNK  RET+IFFLNCLCLLLGRLDSK+ EST+SEYGT
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QISQVLLLQ                                                                     FHSTDEDVIDEVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS
        +K SSGG+ITD RQLD VMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAW      
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS

Query:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD
                                QD ANQLIKCLEDEEILIRKQAADLLPCV+P LFLPSLVRLVY SNDEVLASAREALIGVLKCHNQNIGAI MLLD
Subjt:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD

Query:  CV
        CV
Subjt:  CV

A0A1S4DUB5 uncharacterized protein LOC103486160 isoform X15.2e-29068.45Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        MEEDEGELLLWKSD  P+SMVSVT+GRVM TLL ARPKKLH+AVS LSPDHR GASSLDS+ QSLWFLHQYV+DAVQNH SLDEIL+PM+EH LR KDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQV+VLLNWLFLDEL FQTLIKN+ADIIVRKDDRYVALGWCILVRSLVEFESVPCEL LNGLRERFNDMLKVLCSCIPRLTCIL+KGS+LQEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRL+V AADC+VSLTNALT KAEVQTRQKR NA+SSYQQ+T FS  V DQREKPISN+SK+S  DMEYLLW QLKDL  LVQRLLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WS+ SRPLHAKGLEQVLKWL EIN HYGNFQ
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
        +EAGK KSKI +TG+LLLSSCWRHYSILLFLED  FSQHY+EWLNQYLSGIQYYSGHHT E++GNK  RET+IFFLNCLCLLLGRLDSK+ EST+SEYGT
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QISQVLLLQ                                                                     FHSTDEDVIDEVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS
        +K SSGG+ITD RQLD VMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAW      
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS

Query:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD
                                QD ANQLIKCLEDEEILIRKQAADLLPCV+P LFLPSLVRLVY SNDEVLASAREALIGVLKCHNQNIGAI MLLD
Subjt:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD

Query:  CV
        CV
Subjt:  CV

A0A1S4DUC1 uncharacterized protein LOC103486160 isoform X75.2e-29068.45Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        MEEDEGELLLWKSD  P+SMVSVT+GRVM TLL ARPKKLH+AVS LSPDHR GASSLDS+ QSLWFLHQYV+DAVQNH SLDEIL+PM+EH LR KDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQV+VLLNWLFLDEL FQTLIKN+ADIIVRKDDRYVALGWCILVRSLVEFESVPCEL LNGLRERFNDMLKVLCSCIPRLTCIL+KGS+LQEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRL+V AADC+VSLTNALT KAEVQTRQKR NA+SSYQQ+T FS  V DQREKPISN+SK+S  DMEYLLW QLKDL  LVQRLLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WS+ SRPLHAKGLEQVLKWL EIN HYGNFQ
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
        +EAGK KSKI +TG+LLLSSCWRHYSILLFLED  FSQHY+EWLNQYLSGIQYYSGHHT E++GNK  RET+IFFLNCLCLLLGRLDSK+ EST+SEYGT
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QISQVLLLQ                                                                     FHSTDEDVIDEVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS
        +K SSGG+ITD RQLD VMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAW      
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS

Query:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD
                                QD ANQLIKCLEDEEILIRKQAADLLPCV+P LFLPSLVRLVY SNDEVLASAREALIGVLKCHNQNIGAI MLLD
Subjt:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD

Query:  CV
        CV
Subjt:  CV

A0A1S4DV28 uncharacterized protein LOC103486160 isoform X95.2e-29068.45Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        MEEDEGELLLWKSD  P+SMVSVT+GRVM TLL ARPKKLH+AVS LSPDHR GASSLDS+ QSLWFLHQYV+DAVQNH SLDEIL+PM+EH LR KDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQV+VLLNWLFLDEL FQTLIKN+ADIIVRKDDRYVALGWCILVRSLVEFESVPCEL LNGLRERFNDMLKVLCSCIPRLTCIL+KGS+LQEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRL+V AADC+VSLTNALT KAEVQTRQKR NA+SSYQQ+T FS  V DQREKPISN+SK+S  DMEYLLW QLKDL  LVQRLLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WS+ SRPLHAKGLEQVLKWL EIN HYGNFQ
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
        +EAGK KSKI +TG+LLLSSCWRHYSILLFLED  FSQHY+EWLNQYLSGIQYYSGHHT E++GNK  RET+IFFLNCLCLLLGRLDSK+ EST+SEYGT
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QISQVLLLQ                                                                     FHSTDEDVIDEVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS
        +K SSGG+ITD RQLD VMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAW      
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLS

Query:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD
                                QD ANQLIKCLEDEEILIRKQAADLLPCV+P LFLPSLVRLVY SNDEVLASAREALIGVLKCHNQNIGAI MLLD
Subjt:  SFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLLD

Query:  CV
        CV
Subjt:  CV

A0A6J1EPJ3 uncharacterized protein LOC1114354672.7e-29470.11Show/hide
Query:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN
        ME+DEGELLLWKSDS PQSMVSVT+GRVM TLLAARPKKLHDAVS LSPDHRHGA SLDSLDQSLWFLH+YVRDAVQNH SLDEILVPM+EHTLRFKDKN
Subjt:  MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKN

Query:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL
        WKRGGQVMVLLNWLFLDEL FQ+LIKNLADIIVRKDDRYVALGWCILVRSLVEFESVP ELSLNGLRERF DMLKV  SCIPRLTCIL+KGSILQEGFEL
Subjt:  WKRGGQVMVLLNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFEL

Query:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF
        PSRLSV AADCVVSLTNALT KAE QTRQKRLNASSSYQQ+T FS AV DQREKPIS+SSK+S+ DMEYLLW QLKDL+ILVQRLLA             
Subjt:  PSRLSVYAADCVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCF

Query:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ
                                                                             WS KSRPLHAKGLEQVLKWL EIN HYG+F+
Subjt:  AWYSFDLLNVTWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQ

Query:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT
        NEAGK K+KISQTGALLLSSCWRHYSILLFL+DCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSK+FEST+SEYG+
Subjt:  NEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGT

Query:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN
        QISQVLLLQ                                                                     FHSTDEDVI EVVSIFKAVFLN
Subjt:  QISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLN

Query:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDN-QFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTL
         K SSG +ITDIRQLDVVMP LLNLLDERDVTARAVIILIAESCLMSRDN QFLLEVFKRFDSDSI+QRRNAIDVISEIVQMSSN RNLLTQSAW     
Subjt:  AKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDN-QFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTL

Query:  SSFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLL
                                 QD A QLIKCLEDEEILIRKQAADLLPC+DP+L LPSLVRLVYSSND+ LASA EAL+GVLKCHNQNIGAIF+LL
Subjt:  SSFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLL

Query:  DCV
        DCV
Subjt:  DCV

SwissProt top hitse value%identityAlignment
P07371 Chlorophyll a-b binding protein AB80, chloroplastic3.7e-9996.07Show/hide
Query:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD
        DPETF+KNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSL+HAQSILAIWATQV+LMGAVEGYRIAGGPLGE+ D
Subjt:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD

Query:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG
        P+YPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAW+YATNFVPG
Subjt:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG

P08221 Chlorophyll a-b binding protein of LHCII type I, chloroplastic (Fragment)4.4e-10098.31Show/hide
Query:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD
        DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSL+HAQSILAIWA QVVLMGAVEGYRIAGGPLGE+TD
Subjt:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD

Query:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG
        PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG
Subjt:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG

P08222 Chlorophyll a-b binding protein of LHCII type 1 (Fragment)2.2e-9997.75Show/hide
Query:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD
        DPETFAKNRELEVIHS WAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSL+HAQSILAIWA QVVLMGAVEGYRIAGGPLGE+TD
Subjt:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD

Query:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG
        PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG
Subjt:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG

P09756 Chlorophyll a-b binding protein 3, chloroplastic5.7e-10098.31Show/hide
Query:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD
        DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQV+LMGAVEGYRIAGGPLGE+TD
Subjt:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD

Query:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG
        PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLA PVNNNAWAYATNFVPG
Subjt:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG

P12329 Chlorophyll a-b binding protein 1, chloroplastic9.7e-10097.21Show/hide
Query:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD
        DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWA QVVLMGAVEGYRIAGGPLGE+ D
Subjt:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITD

Query:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPGN
        P+YPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADH+ADPVNNNAWAYATNFVPGN
Subjt:  PIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPGN

Arabidopsis top hitse value%identityAlignment
AT1G29910.1 chlorophyll A/B binding protein 33.3e-9592.18Show/hide
Query:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAG-GPLGEIT
        DPETFA+NRELEVIHSRWAMLGALGCVFPELL+RNGVKFGEAVWFKAGSQIFS+GGLDYLGNPSL+HAQSILAIWATQV+LMGAVEGYR+AG GPLGE  
Subjt:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAG-GPLGEIT

Query:  DPIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG
        D +YPGGSFDPLGLA DPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGP+ENLADHLADPVNNNAWA+ATNFVPG
Subjt:  DPIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG

AT2G34420.1 photosystem II light harvesting complex gene B1B21.9e-9592.74Show/hide
Query:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAG-GPLGEIT
        DPETFA+NRELEVIHSRWAMLGALGCVFPELL+RNGVKFGEAVWFKAGSQIFS+GGLDYLGNPSL+HAQSILAIWATQV+LMGAVEGYR+AG GPLGE  
Subjt:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAG-GPLGEIT

Query:  DPIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG
        D +YPGGSFDPLGLA DPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWA+ATNFVPG
Subjt:  DPIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG

AT2G34430.1 light-harvesting chlorophyll-protein complex II subunit B11.9e-9592.74Show/hide
Query:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAG-GPLGEIT
        DPETFA+NRELEVIHSRWAMLGALGCVFPELL+RNGVKFGEAVWFKAGSQIFS+GGLDYLGNPSL+HAQSILAIWATQV+LMGAVEGYR+AG GPLGE  
Subjt:  DPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAG-GPLGEIT

Query:  DPIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG
        D +YPGGSFDPLGLA DPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWA+ATNFVPG
Subjt:  DPIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVNNNAWAYATNFVPG

AT3G57570.1 ARM repeat superfamily protein5.1e-12836.13Show/hide
Query:  SPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKNWKRGGQVMVLLNWL
        S P+S+VSVT+ R M TLL+ARPKKL +++S L+PD + G S   S+D++LWFL + V DA +  +++ EILVP++EHTLRFKD   K G   M+LLNWL
Subjt:  SPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKNWKRGGQVMVLLNWL

Query:  FLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFELPSRLSVYAADCVVS
        F DE++FQ + +NL++II+R +DR++ALGWC+L+R LVE E    +   +G+RE+ +  ++++ SC+P L  I+  GSILQ+G+E+PSRLS+ AADC++S
Subjt:  FLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFELPSRLSVYAADCVVS

Query:  LTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCFAWYSFDLLNVTWNL
        +T AL  +      + +    +   Q    +  + +++++P S   ++S+ +   +LW+ ++DL  LVQ L A                           
Subjt:  LTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCFAWYSFDLLNVTWNL

Query:  GWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQNEAGKAKSKISQTG
                                                               W+RK+R LHAKGL QVLKWL E+  H+G  Q EAG   +++S  G
Subjt:  GWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQNEAGKAKSKISQTG

Query:  ALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYS-GHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGTQISQVLLLQGKGL
        ALLLSSCW+HYS+LL +ED +FS+  +E L QYLSGI+YYS  +  G S     G ET  FFLNCLCLLLGR + K+FES +SEYG ++  +LL Q    
Subjt:  ALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYS-GHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGTQISQVLLLQGKGL

Query:  GVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLNAKFSSGGNITDIR
                                                                           S +E++ + VV+IFKAVF   +  SG + +D  
Subjt:  GVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLNAKFSSGGNITDIR

Query:  QLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLSSFFIFPRLDIFST
         +DVV+P LL+LLDERD  A+AV +L+A+ C  +  N  L E+ +R  S + +QR N++DVISE++ MS                          D F +
Subjt:  QLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLSSFFIFPRLDIFST

Query:  IIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLL
         I   +I  C       L+KCL+DEE  I KQ ++LL  ++P+  LP+LV L+Y+ N +V +SA E L+GVLK H ++   I MLL
Subjt:  IIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLL

AT3G57570.2 ARM repeat superfamily protein9.0e-12535.57Show/hide
Query:  SPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKNWKRGGQVMVLLNWL
        S P+S+VSVT+ R M TLL+ARPKKL +++S L+PD + G S   S+D++LWFL + V DA +  +++ EILVP++EHTLRFKD   K G   M+LLNWL
Subjt:  SPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKNWKRGGQVMVLLNWL

Query:  FLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKG----SILQEGFELPSRLSVYAAD
        F DE++FQ + +NL++II+R +DR++ALGWC+L+R LVE E    +   +G+RE+ +  ++++ SC+P L  I+  G    S+  +G+E+PSRLS+ AAD
Subjt:  FLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKG----SILQEGFELPSRLSVYAAD

Query:  CVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCFAWYSFDLLNV
        C++S+T AL  +      + +    +   Q    +  + +++++P S   ++S+ +   +LW+ ++DL  LVQ L A                       
Subjt:  CVVSLTNALTTKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCFAWYSFDLLNV

Query:  TWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQNEAGKAKSKI
                                                                   W+RK+R LHAKGL QVLKWL E+  H+G  Q EAG   +++
Subjt:  TWNLGWHAVLSRRQEFKKARIANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQNEAGKAKSKI

Query:  SQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYS-GHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGTQISQVLLLQ
        S  GALLLSSCW+HYS+LL +ED +FS+  +E L QYLSGI+YYS  +  G S     G ET  FFLNCLCLLLGR + K+FES +SEYG ++  +LL Q
Subjt:  SQTGALLLSSCWRHYSILLFLEDCRFSQHYEEWLNQYLSGIQYYS-GHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGTQISQVLLLQ

Query:  GKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLNAKFSSGGNI
                                                                               S +E++ + VV+IFKAVF   +  SG + 
Subjt:  GKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGIWEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLNAKFSSGGNI

Query:  TDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLSSFFIFPRLD
        +D   +DVV+P LL+LLDERD  A+AV +L+A+ C  +  N  L E+ +R  S + +QR N++DVISE++ MS                          D
Subjt:  TDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRFDSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLSSFFIFPRLD

Query:  IFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLL
         F + I   +I  C       L+KCL+DEE  I KQ ++LL  ++P+  LP+LV L+Y+ N +V +SA E L+GVLK H ++   I MLL
Subjt:  IFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSNDEVLASAREALIGVLKCHNQNIGAIFMLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGACGAAGGGGAACTGCTCCTATGGAAGTCGGACTCACCACCTCAATCTATGGTCTCCGTCACCATTGGCAGAGTAATGGTCACCTTACTCGCTGCTCGCCC
AAAGAAGCTCCACGACGCCGTTTCCAGTCTCTCACCCGACCACCGCCACGGAGCTTCTTCACTAGATTCTCTTGACCAATCTCTCTGGTTTCTGCACCAGTATGTCAGGG
ACGCTGTTCAAAACCATGACTCTTTAGATGAAATCCTCGTCCCCATGCTTGAACATACGTTGAGATTCAAGGATAAGAACTGGAAGCGAGGGGGCCAAGTTATGGTGCTT
CTTAACTGGCTGTTCCTTGATGAACTTATTTTCCAAACGCTGATAAAAAATCTTGCGGATATTATCGTGAGAAAGGATGATCGCTATGTCGCCCTTGGCTGGTGTATCCT
TGTCCGCAGTCTTGTGGAGTTCGAGTCTGTCCCTTGTGAACTTTCATTGAACGGGTTAAGGGAGAGATTCAATGATATGTTGAAGGTACTCTGTTCGTGCATTCCACGTC
TGACATGTATTTTAACTAAAGGAAGTATTCTACAGGAAGGGTTTGAGTTACCATCTCGCCTTTCAGTTTATGCTGCTGATTGTGTTGTATCTCTTACTAATGCACTGACC
ACAAAGGCTGAGGTTCAAACTAGGCAGAAAAGATTAAATGCCAGTTCATCATATCAGCAACTTACTTTCTTTTCAAAGGCTGTTGGTGACCAGCGAGAGAAACCAATTAG
TAATTCTTCAAAGAACTCAGACTCAGACATGGAATATTTACTCTGGCATCAATTGAAGGATCTCGTGATTTTAGTGCAGAGGCTTCTTGCAGTATGTTTCTGTTTGCATG
TTATACTCTTCTTCTGTTTTGCATGGTACAGTTTTGATTTGTTAAATGTTACTTGGAACCTAGGTTGGCATGCTGTTTTATCCCGACGACAAGAATTCAAAAAGGCTAGA
ATTGCAAATATTGAGAACACTCATGAAATTCGAGAACCTAACCGAAGCTGTGAAACCGTGCAAGCAATGAATGAAACAAAAAATCTAGAAGCTTCGAGACTTGAGTGTGA
AGGACGGTGGAGCAGGAAAAGCCGACCGTTGCATGCAAAAGGTCTAGAGCAAGTGCTTAAGTGGTTGCATGAGATAAATACGCATTATGGTAACTTCCAGAATGAGGCAG
GGAAGGCAAAATCTAAGATTTCCCAAACTGGAGCACTGCTACTTTCTTCTTGTTGGAGACACTACAGCATCTTATTATTTTTGGAAGATTGCCGATTTTCTCAGCATTAC
GAAGAATGGTTGAACCAGTATTTGTCAGGCATTCAGTATTATTCAGGGCACCACACTGGGGAAAGTGTTGGAAATAAGGATGGTAGAGAGACCATAATTTTTTTCCTGAA
TTGTTTATGCCTTCTATTGGGACGGCTTGACAGCAAAAGATTTGAAAGCACAATTTCAGAATATGGAACTCAGATTTCTCAGGTTCTGCTATTACAGGGGAAGGGGTTGG
GAGTGGGAGGTGATAAAGAGATTGCTCTGGTGCCTCAGGAGGATACGGGGGGAGAGGTGTTCTTAGGGAAGGATAAGATAATTTCGGTTGAGGATTATGGGAAAGGGATT
TGGGAAGACCTTTCGTGGGAGTGTGACTTTGCATTCAACTGTGAAGCGATCATAAAGAAAGAGAATGTGTCAGCAGAGTGGGAGTTCCATAGTACAGATGAAGATGTTAT
TGACGAGGTTGTTAGCATATTTAAGGCGGTTTTTCTCAATGCAAAATTTTCTTCTGGAGGCAACATCACTGATATTAGGCAACTGGATGTTGTGATGCCATTGTTGCTTA
ACCTTCTGGATGAGCGGGATGTCACAGCTAGAGCTGTCATCATTCTCATTGCTGAATCTTGTTTAATGAGCAGAGATAATCAGTTCCTTTTGGAAGTCTTTAAGCGATTT
GATTCTGATAGTATCATGCAAAGGAGGAATGCTATTGATGTGATATCTGAAATTGTTCAGATGTCATCAAATACAAGAAATTTACTTACTCAGTCAGCATGGTACATATG
CACCTTGTCTTCATTTTTCATTTTTCCTAGATTAGATATTTTTAGCACTATAATAGCTTTGACTGACATTAATTACTGCAGGCAAGATACAGCTAATCAATTAATCAAGT
GCCTGGAAGATGAAGAAATTCTGATTCGTAAACAGGCAGCTGATTTGCTTCCTTGTGTTGACCCTACATTGTTTTTACCGTCCTTAGTACGTCTTGTCTACTCGTCAAAT
GATGAAGTTTTAGCTTCAGCCAGGGAGGCTCTCATTGGGGTGCTCAAATGTCATAATCAGAACATTGGAGCAATTTTTATGCTACTTGACTGTGTTAGAAAAGCAAGATT
GTTTTCAGACCATTTTCCATTACTCCTTGAAGCTGGATCAGTTACTTGGGGTTCACCATTTCGCTTTTGCAATAGCTGGATGAGAATTAAAGATTGCTCGAAATTAATTC
AAGCAACCTTGACTCTTGACCGATCAGATGGGTGGGCTGGTTTTGCAATCAGCCTCAAGCTTTGCAAGGTGAAAGATAAAGTCAAAGCTTGGTTTGGGGATTATGAGCTG
GAAAGAAAAAGAAAAGAGAAGGACTTATTATTAGAATTGACATTTTTTGATTCAAAAGCCGAGAGTGAAGAATTGTCATCTGCAGAATTGGATATCAAGCTAGCCGTCAA
AGGAGATCTAATGAATTTATATCTTCTGGAGGAGAGGAATCTAATTCAAAAAACCGATCCCGAGACCTTCGCCAAGAACCGTGAGTTGGAAGTGATCCACTCCAGATGGG
CCATGCTTGGAGCTTTGGGCTGTGTCTTCCCTGAGCTTCTGTCCCGCAATGGCGTGAAGTTTGGTGAGGCTGTATGGTTCAAGGCTGGATCTCAAATCTTCAGCGAGGGA
GGACTCGACTACTTGGGGAACCCAAGCTTGATCCACGCACAGAGCATTTTGGCCATCTGGGCCACTCAGGTCGTGTTGATGGGCGCCGTCGAGGGCTACCGTATTGCCGG
CGGCCCACTCGGGGAGATCACTGACCCAATCTACCCAGGTGGAAGCTTCGACCCATTGGGTTTGGCTGATGACCCAGAGGCCTTCGCTGAATTGAAGGTTAAGGAGCTCA
AGAATGGAAGGTTGGCCATGTTTTCCATGTTTGGATTCTTCGTTCAGGCCATTGTTACTGGAAAGGGTCCATTGGAGAATTTGGCCGACCACCTTGCTGACCCAGTCAAC
AACAATGCTTGGGCCTATGCCACAAACTTTGTTCCCGGAAATGATTTTGCCCTAAATGCAGCTCTTCCAAATAGGGGGGGTAATGGTCAAGGTATATTTTTCAATTTGCA
GCAGATACATGACATTTCAATCGTTCTTGGTATTGTGTTATGTTCTCCATATACGCACGACTTATGTGGAAGGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGACGAAGGGGAACTGCTCCTATGGAAGTCGGACTCACCACCTCAATCTATGGTCTCCGTCACCATTGGCAGAGTAATGGTCACCTTACTCGCTGCTCGCCC
AAAGAAGCTCCACGACGCCGTTTCCAGTCTCTCACCCGACCACCGCCACGGAGCTTCTTCACTAGATTCTCTTGACCAATCTCTCTGGTTTCTGCACCAGTATGTCAGGG
ACGCTGTTCAAAACCATGACTCTTTAGATGAAATCCTCGTCCCCATGCTTGAACATACGTTGAGATTCAAGGATAAGAACTGGAAGCGAGGGGGCCAAGTTATGGTGCTT
CTTAACTGGCTGTTCCTTGATGAACTTATTTTCCAAACGCTGATAAAAAATCTTGCGGATATTATCGTGAGAAAGGATGATCGCTATGTCGCCCTTGGCTGGTGTATCCT
TGTCCGCAGTCTTGTGGAGTTCGAGTCTGTCCCTTGTGAACTTTCATTGAACGGGTTAAGGGAGAGATTCAATGATATGTTGAAGGTACTCTGTTCGTGCATTCCACGTC
TGACATGTATTTTAACTAAAGGAAGTATTCTACAGGAAGGGTTTGAGTTACCATCTCGCCTTTCAGTTTATGCTGCTGATTGTGTTGTATCTCTTACTAATGCACTGACC
ACAAAGGCTGAGGTTCAAACTAGGCAGAAAAGATTAAATGCCAGTTCATCATATCAGCAACTTACTTTCTTTTCAAAGGCTGTTGGTGACCAGCGAGAGAAACCAATTAG
TAATTCTTCAAAGAACTCAGACTCAGACATGGAATATTTACTCTGGCATCAATTGAAGGATCTCGTGATTTTAGTGCAGAGGCTTCTTGCAGTATGTTTCTGTTTGCATG
TTATACTCTTCTTCTGTTTTGCATGGTACAGTTTTGATTTGTTAAATGTTACTTGGAACCTAGGTTGGCATGCTGTTTTATCCCGACGACAAGAATTCAAAAAGGCTAGA
ATTGCAAATATTGAGAACACTCATGAAATTCGAGAACCTAACCGAAGCTGTGAAACCGTGCAAGCAATGAATGAAACAAAAAATCTAGAAGCTTCGAGACTTGAGTGTGA
AGGACGGTGGAGCAGGAAAAGCCGACCGTTGCATGCAAAAGGTCTAGAGCAAGTGCTTAAGTGGTTGCATGAGATAAATACGCATTATGGTAACTTCCAGAATGAGGCAG
GGAAGGCAAAATCTAAGATTTCCCAAACTGGAGCACTGCTACTTTCTTCTTGTTGGAGACACTACAGCATCTTATTATTTTTGGAAGATTGCCGATTTTCTCAGCATTAC
GAAGAATGGTTGAACCAGTATTTGTCAGGCATTCAGTATTATTCAGGGCACCACACTGGGGAAAGTGTTGGAAATAAGGATGGTAGAGAGACCATAATTTTTTTCCTGAA
TTGTTTATGCCTTCTATTGGGACGGCTTGACAGCAAAAGATTTGAAAGCACAATTTCAGAATATGGAACTCAGATTTCTCAGGTTCTGCTATTACAGGGGAAGGGGTTGG
GAGTGGGAGGTGATAAAGAGATTGCTCTGGTGCCTCAGGAGGATACGGGGGGAGAGGTGTTCTTAGGGAAGGATAAGATAATTTCGGTTGAGGATTATGGGAAAGGGATT
TGGGAAGACCTTTCGTGGGAGTGTGACTTTGCATTCAACTGTGAAGCGATCATAAAGAAAGAGAATGTGTCAGCAGAGTGGGAGTTCCATAGTACAGATGAAGATGTTAT
TGACGAGGTTGTTAGCATATTTAAGGCGGTTTTTCTCAATGCAAAATTTTCTTCTGGAGGCAACATCACTGATATTAGGCAACTGGATGTTGTGATGCCATTGTTGCTTA
ACCTTCTGGATGAGCGGGATGTCACAGCTAGAGCTGTCATCATTCTCATTGCTGAATCTTGTTTAATGAGCAGAGATAATCAGTTCCTTTTGGAAGTCTTTAAGCGATTT
GATTCTGATAGTATCATGCAAAGGAGGAATGCTATTGATGTGATATCTGAAATTGTTCAGATGTCATCAAATACAAGAAATTTACTTACTCAGTCAGCATGGTACATATG
CACCTTGTCTTCATTTTTCATTTTTCCTAGATTAGATATTTTTAGCACTATAATAGCTTTGACTGACATTAATTACTGCAGGCAAGATACAGCTAATCAATTAATCAAGT
GCCTGGAAGATGAAGAAATTCTGATTCGTAAACAGGCAGCTGATTTGCTTCCTTGTGTTGACCCTACATTGTTTTTACCGTCCTTAGTACGTCTTGTCTACTCGTCAAAT
GATGAAGTTTTAGCTTCAGCCAGGGAGGCTCTCATTGGGGTGCTCAAATGTCATAATCAGAACATTGGAGCAATTTTTATGCTACTTGACTGTGTTAGAAAAGCAAGATT
GTTTTCAGACCATTTTCCATTACTCCTTGAAGCTGGATCAGTTACTTGGGGTTCACCATTTCGCTTTTGCAATAGCTGGATGAGAATTAAAGATTGCTCGAAATTAATTC
AAGCAACCTTGACTCTTGACCGATCAGATGGGTGGGCTGGTTTTGCAATCAGCCTCAAGCTTTGCAAGGTGAAAGATAAAGTCAAAGCTTGGTTTGGGGATTATGAGCTG
GAAAGAAAAAGAAAAGAGAAGGACTTATTATTAGAATTGACATTTTTTGATTCAAAAGCCGAGAGTGAAGAATTGTCATCTGCAGAATTGGATATCAAGCTAGCCGTCAA
AGGAGATCTAATGAATTTATATCTTCTGGAGGAGAGGAATCTAATTCAAAAAACCGATCCCGAGACCTTCGCCAAGAACCGTGAGTTGGAAGTGATCCACTCCAGATGGG
CCATGCTTGGAGCTTTGGGCTGTGTCTTCCCTGAGCTTCTGTCCCGCAATGGCGTGAAGTTTGGTGAGGCTGTATGGTTCAAGGCTGGATCTCAAATCTTCAGCGAGGGA
GGACTCGACTACTTGGGGAACCCAAGCTTGATCCACGCACAGAGCATTTTGGCCATCTGGGCCACTCAGGTCGTGTTGATGGGCGCCGTCGAGGGCTACCGTATTGCCGG
CGGCCCACTCGGGGAGATCACTGACCCAATCTACCCAGGTGGAAGCTTCGACCCATTGGGTTTGGCTGATGACCCAGAGGCCTTCGCTGAATTGAAGGTTAAGGAGCTCA
AGAATGGAAGGTTGGCCATGTTTTCCATGTTTGGATTCTTCGTTCAGGCCATTGTTACTGGAAAGGGTCCATTGGAGAATTTGGCCGACCACCTTGCTGACCCAGTCAAC
AACAATGCTTGGGCCTATGCCACAAACTTTGTTCCCGGAAATGATTTTGCCCTAAATGCAGCTCTTCCAAATAGGGGGGGTAATGGTCAAGGTATATTTTTCAATTTGCA
GCAGATACATGACATTTCAATCGTTCTTGGTATTGTGTTATGTTCTCCATATACGCACGACTTATGTGGAAGGTTTTGA
Protein sequenceShow/hide protein sequence
MEEDEGELLLWKSDSPPQSMVSVTIGRVMVTLLAARPKKLHDAVSSLSPDHRHGASSLDSLDQSLWFLHQYVRDAVQNHDSLDEILVPMLEHTLRFKDKNWKRGGQVMVL
LNWLFLDELIFQTLIKNLADIIVRKDDRYVALGWCILVRSLVEFESVPCELSLNGLRERFNDMLKVLCSCIPRLTCILTKGSILQEGFELPSRLSVYAADCVVSLTNALT
TKAEVQTRQKRLNASSSYQQLTFFSKAVGDQREKPISNSSKNSDSDMEYLLWHQLKDLVILVQRLLAVCFCLHVILFFCFAWYSFDLLNVTWNLGWHAVLSRRQEFKKAR
IANIENTHEIREPNRSCETVQAMNETKNLEASRLECEGRWSRKSRPLHAKGLEQVLKWLHEINTHYGNFQNEAGKAKSKISQTGALLLSSCWRHYSILLFLEDCRFSQHY
EEWLNQYLSGIQYYSGHHTGESVGNKDGRETIIFFLNCLCLLLGRLDSKRFESTISEYGTQISQVLLLQGKGLGVGGDKEIALVPQEDTGGEVFLGKDKIISVEDYGKGI
WEDLSWECDFAFNCEAIIKKENVSAEWEFHSTDEDVIDEVVSIFKAVFLNAKFSSGGNITDIRQLDVVMPLLLNLLDERDVTARAVIILIAESCLMSRDNQFLLEVFKRF
DSDSIMQRRNAIDVISEIVQMSSNTRNLLTQSAWYICTLSSFFIFPRLDIFSTIIALTDINYCRQDTANQLIKCLEDEEILIRKQAADLLPCVDPTLFLPSLVRLVYSSN
DEVLASAREALIGVLKCHNQNIGAIFMLLDCVRKARLFSDHFPLLLEAGSVTWGSPFRFCNSWMRIKDCSKLIQATLTLDRSDGWAGFAISLKLCKVKDKVKAWFGDYEL
ERKRKEKDLLLELTFFDSKAESEELSSAELDIKLAVKGDLMNLYLLEERNLIQKTDPETFAKNRELEVIHSRWAMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEG
GLDYLGNPSLIHAQSILAIWATQVVLMGAVEGYRIAGGPLGEITDPIYPGGSFDPLGLADDPEAFAELKVKELKNGRLAMFSMFGFFVQAIVTGKGPLENLADHLADPVN
NNAWAYATNFVPGNDFALNAALPNRGGNGQGIFFNLQQIHDISIVLGIVLCSPYTHDLCGRF