; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017607 (gene) of Chayote v1 genome

Gene IDSed0017607
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG01:65919046..65923262
RNA-Seq ExpressionSed0017607
SyntenySed0017607
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN50377.1 hypothetical protein Csa_000462 [Cucumis sativus]0.0e+0087.77Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  IDAS LTED A KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNP+NS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRS-DDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSAL
        S+SRS   AASPLPISSIPSPTLHNHKKRSSQ  +NAP+LTASYQVHSLAMIEPTRSYAPLI SSPPTP A+N  G++S++G N HQLVLSGGKDDL AL
Subjt:  SVSRS-DDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSAL

Query:  EMLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIA
        EMLENSVKKLK+PHASPGPRLSKEQ DSAIELLTDWFIESCGSVS+SC +HPKFKA LSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDA+FFQIA
Subjt:  EMLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIA

Query:  SDGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQG
        SDGWKN KNC    C EESVVKFMVN PNGTTVFQKALFTGGLVSSKYAEEVI D V+EICG GLQ+CVGIIADRYK KAL+NLE+KNHWMVNLSCQLQG
Subjt:  SDGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQG

Query:  FIGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADS
        FI LIKDFNKELPLFR VTENC KVANFVNTKSQVRNC+NKYKVQELEG WLLHVPSPNCDTSKNFSPVY++LDDMLN AHVLQMVVLDESYKVACM DS
Subjt:  FIGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADS

Query:  LASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRD
        LA+EV+SLIQNERFWDE+E VHS VKMIR MA+EIEAERPLIGQCLPLWEELRTKVKEWC K+SIAE PVEKI++KRFRKNYHPAWS AFILDPLYLRRD
Subjt:  LASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRD

Query:  INGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFL
        +NGKYLPPFKCLSQEQ KDVDSLI RLVSREEAH+A MELMKWRSEGLDPLYAQAVQVKQ DPLTGKMKIANPQSR+LVWETCLS FK LGKVALRLIFL
Subjt:  INGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFL

Query:  HSTACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSEVNLGLDT
        HST+C +KCKCSIMNLVCS+RHSRVGLERAQKMVFVAAHAKLER DFSNEEDKDAELFAMADGENDMLNEVFSDAPSI V+DVFDQTEPELS VNLGL T
Subjt:  HSTACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSEVNLGLDT

Query:  ISLQLKLHFSDRIPDVSPVVPTYVAF
        ISLQLK+H +DRIPD+SPV+PTYV F
Subjt:  ISLQLKLHFSDRIPDVSPVVPTYVAF

XP_022928873.1 uncharacterized protein LOC111435659 isoform X1 [Cucurbita moschata]0.0e+0087.14Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  ID+S LTED A KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNPSNS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNPGISSRIGLNHHQLVLSGGKDDLSALEM
        S+S S+ +ASPLPISSIPSPT HNHKKRSS   +NAP+LTASYQVHSLAMIEPTRSYAPLI SSP TPVA+NP +S       HQLVLSGGKDDL ALEM
Subjt:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNPGISSRIGLNHHQLVLSGGKDDLSALEM

Query:  LENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIASD
        LENSVKKLK+PHASPGPRLSKEQ DSAIELLTDW IESCGSVS+SC+EHPKFKA LSQLGLPS+PRTDILGARLDSKFEEAKADSEARIRDA+ FQIASD
Subjt:  LENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIASD

Query:  GWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGFI
        GWKN KNC    CGEESVVKFMVN PNGTTVFQKALFTGGLVSSKYAEEVI D V+EICG  LQ+CVGIIAD+YK KAL+NLE+K HWMVNLSCQLQGFI
Subjt:  GWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGFI

Query:  GLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSLA
         LIKDFNKELPLFRVVTENC KVANFV+TKSQVRNCLNKYKVQELEG  L HVPSPNCDTSKNFSPVYA+LDD+L+ AHVLQMVVLDESYK+ACM DSLA
Subjt:  GLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSLA

Query:  SEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDIN
        +EV+SLIQNERFWDEVE VHS VKMIRGMAREIEAERPLIGQCLPLWEELR+KVKEWCAKYSIAE PVEKII+KRFRKNYHPAWSAAFILDPLYLRRDIN
Subjt:  SEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDIN

Query:  GKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLHS
        GKYLPPFKCLSQEQ KDVDSL+ RLVSREEAHVA MELMKWRSEGLDPLYAQAVQVKQ DPLTGKMKIANPQSR+LVWETCLSEFK L KVALRLIFLHS
Subjt:  GKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLHS

Query:  TACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSEVNLGLDTIS
        T+C YKCKCSIMNLVCSHRHSRVGLE+AQKMVFVAAHAKLER DFSNE DKDAELF+MADGENDMLNEVFSDAPSI V+D+FDQTEPELSEVNLGL TIS
Subjt:  TACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSEVNLGLDTIS

Query:  LQLKLHFSDRIPDVSPVVPTYVAF
        LQLK+HF+D IPD+SPVVP YV F
Subjt:  LQLKLHFSDRIPDVSPVVPTYVAF

XP_022958247.1 uncharacterized protein LOC111459531 isoform X1 [Cucurbita moschata]0.0e+0089.65Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  IDASALT+D   KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNPSNS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE
        S+SRS+  ASPLPISSIPSPTLHNHKKRSS  ++NAP+LTASYQVHSLAMIEPTRSYAPLI SSPPTPVA+NP G++S++ +N HQLVLSGGKDDL ALE
Subjt:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE

Query:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS
        MLENSVKKL++PHASPGPRLSKEQ DSAIELLTDWFIESCGSVS+SC+EHPKFKA LSQ+GLPSL RTDILGARLDSKFEEAKADSEARIRDASFFQIAS
Subjt:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS

Query:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF
        DGWKN KNC GYYCGEESVVKFMVN PNG+T+FQKALFTGGLVS KYAEEVI D V+EICG  LQRCVGIIADRYKGKAL+NLE+KNHWMVNLSCQLQGF
Subjt:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF

Query:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL
        I LIKDFNKELPLFRVVTENC KVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYA+LDDML+ AHVLQMVVLDESY++ACM D L
Subjt:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL

Query:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI
        ASEV+SLIQNERFWDE+EGVHSLVKMIRGMA+EIEAERPLIGQCLPLWEELRTKVKEWCAK+SIAEGPVEKI++KRFRKNYHPAWSAAFILDPLYLRRDI
Subjt:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI

Query:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH
        NGKYLPPFKCLSQEQ KDVDSLI RLVSREEAHVA MELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSR+LVWETCLSEFK LGKVALRLIFLH
Subjt:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH

Query:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE
        ST+C YKCKCSIMNLVCSHRHSR+GLERAQKMVFVAAHAKLERKDFSNE+DKDAELF MADGENDMLNEVFSDAPSIT LDVFDQ EPELS+
Subjt:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE

XP_022995714.1 uncharacterized protein LOC111491170 isoform X1 [Cucurbita maxima]0.0e+0089.39Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  IDASALT+D   KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNPSNS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE
        S+SRS+  ASPLPISSIPSPTLHNHKKRSSQ  +NAP+LTASYQVHSLAMIEPTRSYAPLI SSPPTPVA+NP G++S++ +N HQLVLSGGKDDL ALE
Subjt:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE

Query:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS
        MLENSVKKL++PHASPGPRLSKEQ DSAIELLTDWFIESCGSVS+SC+EHPKFKA LSQ+GLPSL RTDILGARLDSKFEEAK DSEARIRDASFFQIAS
Subjt:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS

Query:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF
        DGWKN KNC GYYC EESVVKFMVN PNG+T+FQKALFTGGLVS KYAEEVI D V+EICG  LQRCVGIIADRYKGKAL+NLEMKNHWMVNLSCQLQGF
Subjt:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF

Query:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL
        I LIKDFNKELPLFRVVTENC KVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYA+LDDML+ AHVLQMVVLDESY++ACM D L
Subjt:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL

Query:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI
        ASEV+SLIQNERFWDEVEGVHSLVKMIRGMA+EIEAERPLIGQCLPLWEELRTKVKEWCAK+SIAEGPVEKI++KRFRKNYHPAWSAAFILDPLYLRRDI
Subjt:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI

Query:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH
        NGKYLPPFKCLSQEQ KDVDSLI RLVSREEAHVA MELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSR+LVWETCLSEFK LGKVALRLIFLH
Subjt:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH

Query:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE
        ST+C Y CKCSI+NLVCSHRHSR+GLERAQKMVFVAAH KLERKDFSNE+DKDAELFAMADGENDMLNEVFSDAPSIT LDVFD+ EPELS+
Subjt:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE

XP_023533563.1 uncharacterized protein LOC111795394 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0089.77Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  IDASALT+D   KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNPSNS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE
        S+SRS+  ASPLPISSIPSPTLHNHKKRSSQ  +NAP+LTASYQVHSLAMIEPTRSYAPLI SSPPTPVA+NP G++S++ +N HQLVLSGGKDDL ALE
Subjt:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE

Query:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS
        MLENSVKKL++PHASPGPRLSKEQ DSAIELLTDWFIESCGSVS+SC+EHPKFKA LSQ+GLPSL RTDILGARLDSKFEEAKADSEARIRDASFFQIAS
Subjt:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS

Query:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF
        DGWKN KNC GYYCGEESVVKFMVN PNG+T+FQKALFTGGLVS KYAEEVI D V+EICG  LQRCVGIIADRYKGKAL+NLE+KNHWMVNLSCQLQGF
Subjt:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF

Query:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL
        I LIKDFNKELPLFRVVTENC KVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYA+LDDML+ AHVLQMVVLDESY++ CM D L
Subjt:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL

Query:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI
        ASEV+SLIQNERFWDE+EGVHSLVKMIRGMA+EIEAERPLIGQCLPLWEELRTKVKEWCAK+SIAEGPVEKI++KRFRKNYHPAWSAAFILDPLYLRRDI
Subjt:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI

Query:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH
        NGKYLPPFKCLSQEQ KDVDSLI RLVSREEAHVA MELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSR+LVWETCLSEFK LGKVALRLIFLH
Subjt:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH

Query:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE
        ST+C YKCKCSIMNLVCSHRHSR+GLERAQKMVFVAAHAKLERKDFSNE+DKDAELFAMADGENDMLNEVFSDAPSIT LDVFDQ EPELS+
Subjt:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE

TrEMBL top hitse value%identityAlignment
A0A0A0KPN2 Uncharacterized protein0.0e+0087.77Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  IDAS LTED A KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNP+NS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRS-DDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSAL
        S+SRS   AASPLPISSIPSPTLHNHKKRSSQ  +NAP+LTASYQVHSLAMIEPTRSYAPLI SSPPTP A+N  G++S++G N HQLVLSGGKDDL AL
Subjt:  SVSRS-DDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSAL

Query:  EMLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIA
        EMLENSVKKLK+PHASPGPRLSKEQ DSAIELLTDWFIESCGSVS+SC +HPKFKA LSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDA+FFQIA
Subjt:  EMLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIA

Query:  SDGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQG
        SDGWKN KNC    C EESVVKFMVN PNGTTVFQKALFTGGLVSSKYAEEVI D V+EICG GLQ+CVGIIADRYK KAL+NLE+KNHWMVNLSCQLQG
Subjt:  SDGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQG

Query:  FIGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADS
        FI LIKDFNKELPLFR VTENC KVANFVNTKSQVRNC+NKYKVQELEG WLLHVPSPNCDTSKNFSPVY++LDDMLN AHVLQMVVLDESYKVACM DS
Subjt:  FIGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADS

Query:  LASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRD
        LA+EV+SLIQNERFWDE+E VHS VKMIR MA+EIEAERPLIGQCLPLWEELRTKVKEWC K+SIAE PVEKI++KRFRKNYHPAWS AFILDPLYLRRD
Subjt:  LASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRD

Query:  INGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFL
        +NGKYLPPFKCLSQEQ KDVDSLI RLVSREEAH+A MELMKWRSEGLDPLYAQAVQVKQ DPLTGKMKIANPQSR+LVWETCLS FK LGKVALRLIFL
Subjt:  INGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFL

Query:  HSTACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSEVNLGLDT
        HST+C +KCKCSIMNLVCS+RHSRVGLERAQKMVFVAAHAKLER DFSNEEDKDAELFAMADGENDMLNEVFSDAPSI V+DVFDQTEPELS VNLGL T
Subjt:  HSTACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSEVNLGLDT

Query:  ISLQLKLHFSDRIPDVSPVVPTYVAF
        ISLQLK+H +DRIPD+SPV+PTYV F
Subjt:  ISLQLKLHFSDRIPDVSPVVPTYVAF

A0A6J1EL65 uncharacterized protein LOC111435659 isoform X10.0e+0087.14Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  ID+S LTED A KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNPSNS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNPGISSRIGLNHHQLVLSGGKDDLSALEM
        S+S S+ +ASPLPISSIPSPT HNHKKRSS   +NAP+LTASYQVHSLAMIEPTRSYAPLI SSP TPVA+NP +S       HQLVLSGGKDDL ALEM
Subjt:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNPGISSRIGLNHHQLVLSGGKDDLSALEM

Query:  LENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIASD
        LENSVKKLK+PHASPGPRLSKEQ DSAIELLTDW IESCGSVS+SC+EHPKFKA LSQLGLPS+PRTDILGARLDSKFEEAKADSEARIRDA+ FQIASD
Subjt:  LENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIASD

Query:  GWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGFI
        GWKN KNC    CGEESVVKFMVN PNGTTVFQKALFTGGLVSSKYAEEVI D V+EICG  LQ+CVGIIAD+YK KAL+NLE+K HWMVNLSCQLQGFI
Subjt:  GWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGFI

Query:  GLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSLA
         LIKDFNKELPLFRVVTENC KVANFV+TKSQVRNCLNKYKVQELEG  L HVPSPNCDTSKNFSPVYA+LDD+L+ AHVLQMVVLDESYK+ACM DSLA
Subjt:  GLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSLA

Query:  SEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDIN
        +EV+SLIQNERFWDEVE VHS VKMIRGMAREIEAERPLIGQCLPLWEELR+KVKEWCAKYSIAE PVEKII+KRFRKNYHPAWSAAFILDPLYLRRDIN
Subjt:  SEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDIN

Query:  GKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLHS
        GKYLPPFKCLSQEQ KDVDSL+ RLVSREEAHVA MELMKWRSEGLDPLYAQAVQVKQ DPLTGKMKIANPQSR+LVWETCLSEFK L KVALRLIFLHS
Subjt:  GKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLHS

Query:  TACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSEVNLGLDTIS
        T+C YKCKCSIMNLVCSHRHSRVGLE+AQKMVFVAAHAKLER DFSNE DKDAELF+MADGENDMLNEVFSDAPSI V+D+FDQTEPELSEVNLGL TIS
Subjt:  TACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSEVNLGLDTIS

Query:  LQLKLHFSDRIPDVSPVVPTYVAF
        LQLK+HF+D IPD+SPVVP YV F
Subjt:  LQLKLHFSDRIPDVSPVVPTYVAF

A0A6J1H1B8 uncharacterized protein LOC111459531 isoform X10.0e+0089.65Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  IDASALT+D   KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNPSNS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE
        S+SRS+  ASPLPISSIPSPTLHNHKKRSS  ++NAP+LTASYQVHSLAMIEPTRSYAPLI SSPPTPVA+NP G++S++ +N HQLVLSGGKDDL ALE
Subjt:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE

Query:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS
        MLENSVKKL++PHASPGPRLSKEQ DSAIELLTDWFIESCGSVS+SC+EHPKFKA LSQ+GLPSL RTDILGARLDSKFEEAKADSEARIRDASFFQIAS
Subjt:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS

Query:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF
        DGWKN KNC GYYCGEESVVKFMVN PNG+T+FQKALFTGGLVS KYAEEVI D V+EICG  LQRCVGIIADRYKGKAL+NLE+KNHWMVNLSCQLQGF
Subjt:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF

Query:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL
        I LIKDFNKELPLFRVVTENC KVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYA+LDDML+ AHVLQMVVLDESY++ACM D L
Subjt:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL

Query:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI
        ASEV+SLIQNERFWDE+EGVHSLVKMIRGMA+EIEAERPLIGQCLPLWEELRTKVKEWCAK+SIAEGPVEKI++KRFRKNYHPAWSAAFILDPLYLRRDI
Subjt:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI

Query:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH
        NGKYLPPFKCLSQEQ KDVDSLI RLVSREEAHVA MELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSR+LVWETCLSEFK LGKVALRLIFLH
Subjt:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH

Query:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE
        ST+C YKCKCSIMNLVCSHRHSR+GLERAQKMVFVAAHAKLERKDFSNE+DKDAELF MADGENDMLNEVFSDAPSIT LDVFDQ EPELS+
Subjt:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE

A0A6J1H4J7 uncharacterized protein LOC111459531 isoform X30.0e+0089.83Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  IDASALT+D   KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNPSNS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE
        S+SRS+  ASPLPISSIPSPTLHNHKKRSS  ++NAP+LTASYQVHSLAMIEPTRSYAPLI SSPPTPVA+NP G++S++ +N HQLVLSGGKDDL ALE
Subjt:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE

Query:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS
        MLENSVKKL++PHASPGPRLSKEQ DSAIELLTDWFIESCGSVS+SC+EHPKFKA LSQ+GLPSL RTDILGARLDSKFEEAKADSEARIRDASFFQIAS
Subjt:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS

Query:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF
        DGWKN KNC GYYCGEESVVKFMVN PNG+T+FQKALFTGGLVS KYAEEVI D V+EICG  LQRCVGIIADRYKGKAL+NLE+KNHWMVNLSCQLQGF
Subjt:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF

Query:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL
        I LIKDFNKELPLFRVVTENC KVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYA+LDDML+ AHVLQMVVLDESY++ACM D L
Subjt:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL

Query:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI
        ASEV+SLIQNERFWDE+EGVHSLVKMIRGMA+EIEAERPLIGQCLPLWEELRTKVKEWCAK+SIAEGPVEKI++KRFRKNYHPAWSAAFILDPLYLRRDI
Subjt:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI

Query:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH
        NGKYLPPFKCLSQEQ KDVDSLI RLVSREEAHVA MELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSR+LVWETCLSEFK LGKVALRLIFLH
Subjt:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH

Query:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSI
        ST+C YKCKCSIMNLVCSHRHSR+GLERAQKMVFVAAHAKLERKDFSNE+DKDAELF MADGENDMLNEVFSDAPSI
Subjt:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSI

A0A6J1K2P3 uncharacterized protein LOC111491170 isoform X10.0e+0089.39Show/hide
Query:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR
        MAS+NSPP  IDASALT+D   KALNKRYECLV+VR KAIKGKGAWYWAHLEPVL+RNPSNS PKAVKLKCSLC+SVFSASNPSRTASEHLKRGTCPNL 
Subjt:  MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLR

Query:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE
        S+SRS+  ASPLPISSIPSPTLHNHKKRSSQ  +NAP+LTASYQVHSLAMIEPTRSYAPLI SSPPTPVA+NP G++S++ +N HQLVLSGGKDDL ALE
Subjt:  SVSRSDDAASPLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNP-GISSRIGLNHHQLVLSGGKDDLSALE

Query:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS
        MLENSVKKL++PHASPGPRLSKEQ DSAIELLTDWFIESCGSVS+SC+EHPKFKA LSQ+GLPSL RTDILGARLDSKFEEAK DSEARIRDASFFQIAS
Subjt:  MLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIAS

Query:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF
        DGWKN KNC GYYC EESVVKFMVN PNG+T+FQKALFTGGLVS KYAEEVI D V+EICG  LQRCVGIIADRYKGKAL+NLEMKNHWMVNLSCQLQGF
Subjt:  DGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGF

Query:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL
        I LIKDFNKELPLFRVVTENC KVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYA+LDDML+ AHVLQMVVLDESY++ACM D L
Subjt:  IGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSL

Query:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI
        ASEV+SLIQNERFWDEVEGVHSLVKMIRGMA+EIEAERPLIGQCLPLWEELRTKVKEWCAK+SIAEGPVEKI++KRFRKNYHPAWSAAFILDPLYLRRDI
Subjt:  ASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDI

Query:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH
        NGKYLPPFKCLSQEQ KDVDSLI RLVSREEAHVA MELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSR+LVWETCLSEFK LGKVALRLIFLH
Subjt:  NGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLH

Query:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE
        ST+C Y CKCSI+NLVCSHRHSR+GLERAQKMVFVAAH KLERKDFSNE+DKDAELFAMADGENDMLNEVFSDAPSIT LDVFD+ EPELS+
Subjt:  STACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSITVLDVFDQTEPELSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12380.1 unknown protein1.5e-24353.53Show/hide
Query:  SPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLRSVSRS
        +PPA       T++  AKALNKRYE L++VR KA+KGKGAWYW HLEP+LVRN     PKAVKL+CSLC++VFSASNPSRTASEHLKRGTCPN  SV+  
Subjt:  SPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLRSVSRS

Query:  DDAASPLPISSIPSPTLHNHKKRSS-------QTNVNAPVLTASYQVHSLAMIEPTR-SYAPLISSSPPTPVARNPGISSRIGLNHHQLVLSGGKDDLSA
            +P P SS  SP  H+ K+ SS        + +N P +  SY V  + +++P+R     L  S+PP P                 L+LSGGKDDL  
Subjt:  DDAASPLPISSIPSPTLHNHKKRSS-------QTNVNAPVLTASYQVHSLAMIEPTR-SYAPLISSSPPTPVARNPGISSRIGLNHHQLVLSGGKDDLSA

Query:  LEMLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQI
        L MLE+SVKKLK+P  S    L++ Q +SA++ L+DW  ESCGSVS+S +EHPKF+AFL+Q+GLP + + D    RLD K EEA+A++E+RIRDA FFQI
Subjt:  LEMLENSVKKLKTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQI

Query:  ASDGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQ
        +SDGWK  ++        ES+V  +VN PNGT+++++A+   G V S YAEEV+ + V  ICG   QRCVGI++D++K KAL+NLE ++ WMVNLSCQ Q
Subjt:  ASDGWKNKKNCIGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQ

Query:  GFIGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVP--------SPNCDTSKN-------FSPVYALLDDMLNFAHVLQ
        G   LIKDF KELPLF+ V++NC ++A F+N  +Q+RN   KY++QE     +L +P          +C +S +       + P++ LL+D+L+ A  +Q
Subjt:  GFIGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVP--------SPNCDTSKN-------FSPVYALLDDMLNFAHVLQ

Query:  MVVLDESYKVACMADSLASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHP
        +VV D++ KV  M D +A EV  ++ +E FW+EVE VH+L+K+++ MAR IE E+ L+GQCLPLW+ELR KVK+W +K+++ EG VEK++++RF+K+YHP
Subjt:  MVVLDESYKVACMADSLASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHP

Query:  AWSAAFILDPLYLRRDINGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCL
        AW+AAFILDPLYL RD +GKYLPPFKCLS EQ KDVD LI RLVSR+EAH+ALMELMKWR+EGLDP+YA+AVQ+K+ DP++GKM+IANPQS +LVWET L
Subjt:  AWSAAFILDPLYLRRDINGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCL

Query:  SEFKILGKVALRLIFLHSTACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSI
        SEF+ LGKVA+RLIFLH+T   +KC  S++  V S+  S   ++RAQK++F++A++K ER+DFSNEED+DAEL AMA+G++ MLN+V  D  S+
Subjt:  SEFKILGKVALRLIFLHSTACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSI

AT1G62870.1 unknown protein2.0e-24054.1Show/hide
Query:  EDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLRSVSRSDDAASPLPISSI
        E+ A KAL KRYE L+ VR KA+KGKGAWYW+HLEP+L+ N     PKAVKL+CSLC++VFSASNPSRTASEHLKRGTCPN  S+ +     SP P    
Subjt:  EDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLRSVSRSDDAASPLPISSI

Query:  PSPTLHNHKKRSSQTNVNA---------PVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNPGISSRIGLNHHQLVLSGGKDDLSALEMLENSVKKL
        P P   +H+KR+S   V A              SY V  L++++P+R            PV + P            L+LSGGKDDL  L MLE+SVKKL
Subjt:  PSPTLHNHKKRSSQTNVNA---------PVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNPGISSRIGLNHHQLVLSGGKDDLSALEMLENSVKKL

Query:  KTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIASDGWKNKKNC
        K+P  S    L+K Q DSA++ L+DW  ESCGSVS+S +EHPK +AFL+Q+GLP + R D +  RLD K+E+++A++E+RI DA FFQIASDGWK     
Subjt:  KTPHASPGPRLSKEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIASDGWKNKKNC

Query:  IGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGFIGLIKDFNK
          +    E++V  +VN PNGT+++++A+F  G V S YAEEV+ + V  ICG   QRCVGI++DR+  KAL+NLE ++ WMVNLSCQ QGF  LI+DF K
Subjt:  IGYYCGEESVVKFMVNFPNGTTVFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGFIGLIKDFNK

Query:  ELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSLASEVTSLIQ
        ELPLF+ V+++C ++ NFVN+ +Q+RN + KY++QE     +LH+P      S  F P+Y LL+D+L+FA  +Q+V+ D+  K   M D +A EV  ++ 
Subjt:  ELPLFRVVTENCFKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSLASEVTSLIQ

Query:  NERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIA-EGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPF
        +  FW+EVE V+ L+K+++ MAR IE ERPL+GQCLPLW+ELR+K+K+W AK+++  E  VEKI+++RF+K+YHPAW+AAFILDPLYL +D +GKYLPPF
Subjt:  NERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEELRTKVKEWCAKYSIA-EGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPF

Query:  KCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLHSTACSYKC
        KCLS EQ KDVD LI RLVSR+EAH+A+MELMKWR+EGLDP+YA+AVQ+K+ DP++GKM+IANPQS +LVWET LSEF+ LG+VA+RLIFLH+T+C +KC
Subjt:  KCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLHSTACSYKC

Query:  KCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSI
          S++  V S+  SR  ++RAQK++F++A++K ER+DFSNEE++DAEL AMA+GE+D+LN+V  D  S+
Subjt:  KCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVFSDAPSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCTCCAACTCACCGCCGGCCACCATTGACGCATCGGCGTTGACGGAGGATTCAGCGGCGAAGGCTTTGAATAAACGCTATGAATGCCTTGTATCTGTTCGAAA
AAAGGCGATTAAGGGGAAAGGCGCTTGGTATTGGGCTCATTTGGAGCCTGTTCTTGTGCGAAATCCTAGTAACAGTGCTCCCAAAGCGGTGAAGCTCAAGTGTTCGTTGT
GCGAGTCGGTTTTCTCCGCTTCGAATCCTTCCAGGACTGCATCTGAGCATTTGAAACGAGGCACTTGCCCTAATTTGCGCTCAGTCTCTCGGTCCGATGACGCGGCGTCG
CCGTTGCCGATTTCGTCGATTCCTTCTCCGACATTGCATAACCACAAGAAGCGAAGCTCTCAAACGAATGTGAATGCTCCGGTTCTTACTGCTTCTTATCAAGTTCATTC
TCTTGCCATGATTGAGCCCACCCGGTCCTATGCTCCGCTGATTTCCTCTTCGCCGCCCACGCCGGTTGCTCGAAATCCGGGGATTTCGAGTAGGATAGGGCTGAATCATC
ATCAGTTGGTGTTATCAGGTGGGAAAGATGATTTGAGCGCATTGGAAATGCTGGAGAACAGTGTCAAGAAGCTGAAAACTCCTCATGCCTCACCTGGACCAAGGTTAAGT
AAGGAACAGACTGATTCTGCTATAGAATTACTGACTGATTGGTTCATTGAGTCGTGTGGGTCAGTGTCGATTTCATGCATTGAGCATCCGAAGTTTAAAGCCTTTCTTAG
TCAGTTGGGATTGCCTTCATTGCCTCGAACAGACATTTTAGGAGCTCGGCTCGATTCCAAGTTCGAGGAGGCCAAAGCTGATTCAGAAGCAAGGATTAGAGATGCTTCGT
TTTTTCAAATTGCTTCAGATGGGTGGAAGAATAAGAAGAATTGCATTGGATATTACTGTGGAGAAGAGAGTGTAGTTAAATTTATGGTTAACTTTCCAAATGGTACTACT
GTGTTTCAAAAAGCACTGTTTACTGGTGGATTGGTGTCATCCAAGTATGCCGAAGAGGTTATTTCGGACAGGGTCGACGAGATTTGTGGGGGTGGTTTGCAGAGATGTGT
AGGGATAATTGCAGATAGGTATAAGGGCAAGGCGTTGAAGAATTTGGAGATGAAGAATCATTGGATGGTAAATCTCTCTTGCCAGCTTCAGGGTTTTATTGGTTTGATAA
AGGATTTTAACAAAGAGCTTCCACTTTTCAGGGTAGTCACTGAAAATTGCTTCAAGGTTGCAAACTTTGTAAACACCAAATCTCAAGTTAGGAACTGTTTAAACAAGTAC
AAGGTGCAGGAGCTAGAGGGTCGTTGGTTGCTTCATGTTCCTTCTCCAAATTGTGACACTTCCAAAAATTTCTCACCTGTTTATGCTCTACTTGATGATATGCTTAACTT
TGCTCATGTTCTTCAAATGGTTGTGTTAGACGAGTCTTATAAAGTTGCATGTATGGCTGATTCACTTGCTTCTGAGGTTACTAGTCTGATACAAAATGAAAGGTTTTGGG
ATGAAGTGGAGGGAGTTCATTCACTTGTGAAGATGATTCGAGGAATGGCTCGAGAGATTGAAGCTGAAAGGCCACTGATTGGACAATGCTTGCCTCTTTGGGAGGAGTTG
AGAACAAAAGTGAAGGAATGGTGTGCTAAGTACAGCATAGCTGAAGGGCCAGTGGAGAAAATTATCGATAAGCGGTTTCGGAAAAATTATCATCCAGCATGGTCAGCTGC
ATTTATACTGGATCCGCTTTACTTGAGGAGGGATATAAATGGGAAATATCTTCCACCCTTCAAGTGCCTCTCGCAAGAGCAGGTAAAGGATGTCGATTCACTTATTATCC
GGTTGGTGTCAAGGGAAGAAGCCCATGTCGCACTCATGGAGCTCATGAAATGGAGATCCGAAGGGCTGGATCCACTTTATGCTCAGGCAGTTCAGGTTAAACAACTTGAC
CCTTTAACTGGAAAGATGAAAATTGCCAACCCACAGAGTAGGCAACTTGTTTGGGAAACTTGCCTAAGTGAGTTCAAGATCCTTGGTAAAGTTGCACTGAGGCTTATTTT
CCTTCATTCAACAGCATGTAGCTACAAGTGTAAGTGTTCTATCATGAATTTGGTTTGCTCGCATCGGCACTCAAGGGTTGGCTTGGAGAGAGCACAGAAGATGGTATTTG
TTGCAGCGCATGCCAAGCTCGAAAGGAAAGACTTTTCTAATGAGGAAGATAAAGATGCAGAACTATTTGCAATGGCGGATGGTGAAAATGACATGCTCAATGAGGTCTTT
TCTGATGCACCCTCAATTACAGTGCTGGATGTGTTTGATCAAACGGAACCAGAGTTGTCAGAAGTCAACCTCGGACTAGATACCATATCTTTGCAGCTGAAGTTGCACTT
CTCTGATAGAATCCCAGATGTATCTCCTGTTGTGCCTACTTATGTTGCCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCTCCAACTCACCGCCGGCCACCATTGACGCATCGGCGTTGACGGAGGATTCAGCGGCGAAGGCTTTGAATAAACGCTATGAATGCCTTGTATCTGTTCGAAA
AAAGGCGATTAAGGGGAAAGGCGCTTGGTATTGGGCTCATTTGGAGCCTGTTCTTGTGCGAAATCCTAGTAACAGTGCTCCCAAAGCGGTGAAGCTCAAGTGTTCGTTGT
GCGAGTCGGTTTTCTCCGCTTCGAATCCTTCCAGGACTGCATCTGAGCATTTGAAACGAGGCACTTGCCCTAATTTGCGCTCAGTCTCTCGGTCCGATGACGCGGCGTCG
CCGTTGCCGATTTCGTCGATTCCTTCTCCGACATTGCATAACCACAAGAAGCGAAGCTCTCAAACGAATGTGAATGCTCCGGTTCTTACTGCTTCTTATCAAGTTCATTC
TCTTGCCATGATTGAGCCCACCCGGTCCTATGCTCCGCTGATTTCCTCTTCGCCGCCCACGCCGGTTGCTCGAAATCCGGGGATTTCGAGTAGGATAGGGCTGAATCATC
ATCAGTTGGTGTTATCAGGTGGGAAAGATGATTTGAGCGCATTGGAAATGCTGGAGAACAGTGTCAAGAAGCTGAAAACTCCTCATGCCTCACCTGGACCAAGGTTAAGT
AAGGAACAGACTGATTCTGCTATAGAATTACTGACTGATTGGTTCATTGAGTCGTGTGGGTCAGTGTCGATTTCATGCATTGAGCATCCGAAGTTTAAAGCCTTTCTTAG
TCAGTTGGGATTGCCTTCATTGCCTCGAACAGACATTTTAGGAGCTCGGCTCGATTCCAAGTTCGAGGAGGCCAAAGCTGATTCAGAAGCAAGGATTAGAGATGCTTCGT
TTTTTCAAATTGCTTCAGATGGGTGGAAGAATAAGAAGAATTGCATTGGATATTACTGTGGAGAAGAGAGTGTAGTTAAATTTATGGTTAACTTTCCAAATGGTACTACT
GTGTTTCAAAAAGCACTGTTTACTGGTGGATTGGTGTCATCCAAGTATGCCGAAGAGGTTATTTCGGACAGGGTCGACGAGATTTGTGGGGGTGGTTTGCAGAGATGTGT
AGGGATAATTGCAGATAGGTATAAGGGCAAGGCGTTGAAGAATTTGGAGATGAAGAATCATTGGATGGTAAATCTCTCTTGCCAGCTTCAGGGTTTTATTGGTTTGATAA
AGGATTTTAACAAAGAGCTTCCACTTTTCAGGGTAGTCACTGAAAATTGCTTCAAGGTTGCAAACTTTGTAAACACCAAATCTCAAGTTAGGAACTGTTTAAACAAGTAC
AAGGTGCAGGAGCTAGAGGGTCGTTGGTTGCTTCATGTTCCTTCTCCAAATTGTGACACTTCCAAAAATTTCTCACCTGTTTATGCTCTACTTGATGATATGCTTAACTT
TGCTCATGTTCTTCAAATGGTTGTGTTAGACGAGTCTTATAAAGTTGCATGTATGGCTGATTCACTTGCTTCTGAGGTTACTAGTCTGATACAAAATGAAAGGTTTTGGG
ATGAAGTGGAGGGAGTTCATTCACTTGTGAAGATGATTCGAGGAATGGCTCGAGAGATTGAAGCTGAAAGGCCACTGATTGGACAATGCTTGCCTCTTTGGGAGGAGTTG
AGAACAAAAGTGAAGGAATGGTGTGCTAAGTACAGCATAGCTGAAGGGCCAGTGGAGAAAATTATCGATAAGCGGTTTCGGAAAAATTATCATCCAGCATGGTCAGCTGC
ATTTATACTGGATCCGCTTTACTTGAGGAGGGATATAAATGGGAAATATCTTCCACCCTTCAAGTGCCTCTCGCAAGAGCAGGTAAAGGATGTCGATTCACTTATTATCC
GGTTGGTGTCAAGGGAAGAAGCCCATGTCGCACTCATGGAGCTCATGAAATGGAGATCCGAAGGGCTGGATCCACTTTATGCTCAGGCAGTTCAGGTTAAACAACTTGAC
CCTTTAACTGGAAAGATGAAAATTGCCAACCCACAGAGTAGGCAACTTGTTTGGGAAACTTGCCTAAGTGAGTTCAAGATCCTTGGTAAAGTTGCACTGAGGCTTATTTT
CCTTCATTCAACAGCATGTAGCTACAAGTGTAAGTGTTCTATCATGAATTTGGTTTGCTCGCATCGGCACTCAAGGGTTGGCTTGGAGAGAGCACAGAAGATGGTATTTG
TTGCAGCGCATGCCAAGCTCGAAAGGAAAGACTTTTCTAATGAGGAAGATAAAGATGCAGAACTATTTGCAATGGCGGATGGTGAAAATGACATGCTCAATGAGGTCTTT
TCTGATGCACCCTCAATTACAGTGCTGGATGTGTTTGATCAAACGGAACCAGAGTTGTCAGAAGTCAACCTCGGACTAGATACCATATCTTTGCAGCTGAAGTTGCACTT
CTCTGATAGAATCCCAGATGTATCTCCTGTTGTGCCTACTTATGTTGCCTTTTGA
Protein sequenceShow/hide protein sequence
MASSNSPPATIDASALTEDSAAKALNKRYECLVSVRKKAIKGKGAWYWAHLEPVLVRNPSNSAPKAVKLKCSLCESVFSASNPSRTASEHLKRGTCPNLRSVSRSDDAAS
PLPISSIPSPTLHNHKKRSSQTNVNAPVLTASYQVHSLAMIEPTRSYAPLISSSPPTPVARNPGISSRIGLNHHQLVLSGGKDDLSALEMLENSVKKLKTPHASPGPRLS
KEQTDSAIELLTDWFIESCGSVSISCIEHPKFKAFLSQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDASFFQIASDGWKNKKNCIGYYCGEESVVKFMVNFPNGTT
VFQKALFTGGLVSSKYAEEVISDRVDEICGGGLQRCVGIIADRYKGKALKNLEMKNHWMVNLSCQLQGFIGLIKDFNKELPLFRVVTENCFKVANFVNTKSQVRNCLNKY
KVQELEGRWLLHVPSPNCDTSKNFSPVYALLDDMLNFAHVLQMVVLDESYKVACMADSLASEVTSLIQNERFWDEVEGVHSLVKMIRGMAREIEAERPLIGQCLPLWEEL
RTKVKEWCAKYSIAEGPVEKIIDKRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQVKDVDSLIIRLVSREEAHVALMELMKWRSEGLDPLYAQAVQVKQLD
PLTGKMKIANPQSRQLVWETCLSEFKILGKVALRLIFLHSTACSYKCKCSIMNLVCSHRHSRVGLERAQKMVFVAAHAKLERKDFSNEEDKDAELFAMADGENDMLNEVF
SDAPSITVLDVFDQTEPELSEVNLGLDTISLQLKLHFSDRIPDVSPVVPTYVAF