; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012109 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012109
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr1:37417109..37420160
RNA-Seq ExpressionLag0012109
SyntenyLag0012109
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144685.2 uncharacterized protein LOC101208481 [Cucumis sativus]0.0e+0084.71Show/hide
Query:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE
        RSVR     +G+ C       SCP LFSLLVSASPVLICTAVLLGTLLS+GQPNIPEIETEEKVSR+VASLRSGILDNATVVAKE+DSFTVERF+GN+VE
Subjt:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE

Query:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAA
        NSYV RG E++RKT  LDEHAGFVDFV VIHER+REIQFEKGG+ + EEFEKGEVE+AA E+E H+SELEER EIY++DLD+++LA D ENAVENQLLAA
Subjt:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAA

Query:  QSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSD
        QS  NEILEVEDRNI IEPVHKGDHL+LS+  KDD DEN YDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLP HRSNEESDASSEQSHKSD
Subjt:  QSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSD

Query:  AECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLP
         ECVMSDDEAENQGEEGG VEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLID DGF+LP
Subjt:  AECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLP

Query:  VNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNFAVPKPEQQNIRW
         NVPPI TA+RNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYD NEEKPDLKSD+FEQEFL P  KDMFRRHESFSVGPSNFAVPK EQQNIRW
Subjt:  VNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNFAVPKPEQQNIRW

Query:  KPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHHEVI
        KPYFMPEK+AAE TSYSPLERQFSEVSESK+SSVSDTESMSSIADQDDKKPDESQ FLETTAVSYL  TAS IEHGNG WEDIGSEDYVQE+RDVHHEVI
Subjt:  KPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHHEVI

Query:  EITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE-ADFKI
        EITLG++E+HFESQSGSS I   +TP+E NASEIHSKN+LVETDFSSNSSLSSLSE + ET FEVKTDEV+PSS   EES IDTT+ S+PA  E  DFK 
Subjt:  EITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE-ADFKI

Query:  SSEVLDDNHPKEPVYDSSPSAEGKFS
        +SEVLDDN  +EPVYDSSPSAEGK S
Subjt:  SSEVLDDNHPKEPVYDSSPSAEGKFS

XP_008442050.1 PREDICTED: uncharacterized protein LOC103486029 [Cucumis melo]0.0e+0084.15Show/hide
Query:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE
        RSVR     +G+ C       SCP LFSLLVSASPVLICTAVLLGTLLS+GQPNIPEIET EKVSR+VASLRSGILDNATVVAKE+DSFTVERF+GN+VE
Subjt:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE

Query:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGV------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMD
        NSYVERGSE++RKTS  DEHAGFVDFVPVIHER REIQFEKG V            G VEEFEKGEVE+AAAE+ELH+SELEER EIYERDLDV+SLA D
Subjt:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGV------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMD

Query:  GENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEE
         ENA+ENQLLAAQS  NEILEV DRNI IEPVHKGDHL+LS+  KDD DEN YDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLP HRSNEE
Subjt:  GENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEE

Query:  SDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG
        SDASSEQSHKSD ECVMSDDEAENQGEEGG VEHDEDED+DDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG
Subjt:  SDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG

Query:  KNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNF
        KNLID DGF+LP NVPPI TA+RNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD+FEQEFL P  KDMFRRHESFSVGPSNF
Subjt:  KNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNF

Query:  AVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDY
        AVPK EQQNIRWKPYFMPEK+AAE TSYSPLERQFSEVSESK+SSVSDTESMSSIADQDDKKPDESQ FLETTAVSYLD TA  IEHGNG WEDIGSEDY
Subjt:  AVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDY

Query:  VQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRS
        VQE+RDVHHEVIEITLG++E+HFES SGSS I   +TP+E NASEIHSK++LVETDFSSNSSLSSLSE + ET FEVKTDEV+PSS   EES IDTT+ S
Subjt:  VQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRS

Query:  MPAALE-ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS
        +PA  E  DFK++SEVLDDN  +EPVYDSSPSAEGK S
Subjt:  MPAALE-ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS

XP_022154469.1 uncharacterized protein LOC111021742 [Momordica charantia]0.0e+0082.44Show/hide
Query:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE
        RSVR     +G+ C       SCP LFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIE EEKVS++VASLRSGILDNATVVAKE+DSFTVE+F+GNKV 
Subjt:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE

Query:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEF-EKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLA
        +SYVE GSE+D+KTS LDEH GFVDFVP+IHERDREI FEKG  G VEEF EKG+VE+ A    LHSSELEE+ EIYERDLDVKS+A D ENAVENQLLA
Subjt:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEF-EKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLA

Query:  AQSTINEILEVEDRNILIEPVHKGD-HLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHK
        AQS  NEILEVEDRNI IEPVHKGD HLNL+    +D DE+DYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLP++RSNE SDASSEQSHK
Subjt:  AQSTINEILEVEDRNILIEPVHKGD-HLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHK

Query:  SDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFD
        SD ECVMSDD+AENQGEE GAVE+D+++DD+DDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNN+RMLAGKNLID DGFD
Subjt:  SDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFD

Query:  LPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDM-FRRHESFSVGPSNFAVPKPEQQN
        LP NVPPI T++RNPFDLPYDSY+NMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD+FEQEFLP   KD+ FRRHESFSVGPS+FAVPK EQQN
Subjt:  LPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDM-FRRHESFSVGPSNFAVPKPEQQN

Query:  IRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHH
        IRW+PYFMPEK+A+E TSYSPL+ QFSEVSESKLSSVSDTESMSSIADQD+KKPDES+ FLETTAVSYLD TASSIEHGNGSWED+GSEDYVQE RDVHH
Subjt:  IRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHH

Query:  EVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE--AD
        EVIEITLG++E+HFE QSGSSEIGA E PVE NA+EIH K+++VETDFSS+SSLSSLSEV+ETPFEVKTDEV+PSSPR+ ES +     SM AALE  AD
Subjt:  EVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE--AD

Query:  FKISSEVLDDNHPKEPVYDSSPSAEGKFS
        FKI+SEVLDDN  KEPVYDSSPSAEGK S
Subjt:  FKISSEVLDDNHPKEPVYDSSPSAEGKFS

XP_038883254.1 uncharacterized protein LOC120074258 isoform X1 [Benincasa hispida]0.0e+0083.42Show/hide
Query:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE
        RSVR     +G+ C       SCP LFSLLVSASPVLICTAVLLGTLLS+GQPNIPEIETEEK+SR+VASLRSGILDNATVVAK++DSFTVERF+GN+VE
Subjt:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE

Query:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQF-------EKGGV---------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYER
        NSYVERGSE++RKTS LDEHAGFVDFVPVIHER+REIQF       EKGGV               G VEEFEKGE+E+AAAE+E HSSEL+ER EIYER
Subjt:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQF-------EKGGV---------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYER

Query:  DLDVKSLAMDGENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSET
        DLDV+SLA D ENA+ENQLLAAQS  NEILEVED NI IEPVHKGDHLNLS+  KDD DENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSET
Subjt:  DLDVKSLAMDGENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSET

Query:  PLPIHRSNEESDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRR
        PLP HRSNEESDASSEQSHKSD ECVMS+DEAENQGEEGG VEHDED+DDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRR
Subjt:  PLPIHRSNEESDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRR

Query:  ARNNLRMLAGKNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRH
        ARNNLRMLAGKNLID DGFDLPVNVPPI TA+RNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLK+D+FEQEFLPP  KDMFRRH
Subjt:  ARNNLRMLAGKNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRH

Query:  ESFSVGPSNFAVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNG
        ESFSVGPSNF VPK EQQNIRWKPYFMPEK+AAE TSYSPLERQFSEVS+SK+SSVSDTESMSSIADQDDKKPDESQ FLETTA+SYLD TAS IEHGNG
Subjt:  ESFSVGPSNFAVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNG

Query:  SWEDIGSEDYVQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEE
         WEDIGSEDYVQE+RDVHHEVIEITLG++E+HFESQSGSS+I   ++P+E NA+EIHSKN+LVETDFSSNSSLSSLSEV+ETPFEVKTDE++PSS + +E
Subjt:  SWEDIGSEDYVQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEE

Query:  SHIDTTSRSMPAALE--ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS
        S ID+TS S+ AALE  ADFKI SEVLDDN  +EPVYDSSPSAEGK S
Subjt:  SHIDTTSRSMPAALE--ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS

XP_038883255.1 uncharacterized protein LOC120074258 isoform X2 [Benincasa hispida]0.0e+0083.42Show/hide
Query:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE
        RSVR     +G+ C       SCP LFSLLVSASPVLICTAVLLGTLLS+GQPNIPEIETEEK+SR+VASLRSGILDNATVVAK++DSFTVERF+GN+VE
Subjt:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE

Query:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQF-------EKGGV---------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYER
        NSYVERGSE++RKTS LDEHAGFVDFVPVIHER+REIQF       EKGGV               G VEEFEKGE+E+AAAE+E HSSEL+ER EIYER
Subjt:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQF-------EKGGV---------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYER

Query:  DLDVKSLAMDGENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSET
        DLDV+SLA D ENA+ENQLLAAQS  NEILEVED NI IEPVHKGDHLNLS+  KDD DENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSET
Subjt:  DLDVKSLAMDGENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSET

Query:  PLPIHRSNEESDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRR
        PLP HRSNEESDASSEQSHKSD ECVMS+DEAENQGEEGG VEHDED+DDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRR
Subjt:  PLPIHRSNEESDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRR

Query:  ARNNLRMLAGKNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRH
        ARNNLRMLAGKNLID DGFDLPVNVPPI TA+RNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLK+D+FEQEFLPP  KDMFRRH
Subjt:  ARNNLRMLAGKNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRH

Query:  ESFSVGPSNFAVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNG
        ESFSVGPSNF VPK EQQNIRWKPYFMPEK+AAE TSYSPLERQFSEVS+SK+SSVSDTESMSSIADQDDKKPDESQ FLETTA+SYLD TAS IEHGNG
Subjt:  ESFSVGPSNFAVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNG

Query:  SWEDIGSEDYVQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEE
         WEDIGSEDYVQE+RDVHHEVIEITLG++E+HFESQSGSS+I   ++P+E NA+EIHSKN+LVETDFSSNSSLSSLSEV+ETPFEVKTDE++PSS + +E
Subjt:  SWEDIGSEDYVQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEE

Query:  SHIDTTSRSMPAALE--ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS
        S ID+TS S+ AALE  ADFKI SEVLDDN  +EPVYDSSPSAEGK S
Subjt:  SHIDTTSRSMPAALE--ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS

TrEMBL top hitse value%identityAlignment
A0A0A0KYZ8 Uncharacterized protein0.0e+0084.71Show/hide
Query:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE
        RSVR     +G+ C       SCP LFSLLVSASPVLICTAVLLGTLLS+GQPNIPEIETEEKVSR+VASLRSGILDNATVVAKE+DSFTVERF+GN+VE
Subjt:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE

Query:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAA
        NSYV RG E++RKT  LDEHAGFVDFV VIHER+REIQFEKGG+ + EEFEKGEVE+AA E+E H+SELEER EIY++DLD+++LA D ENAVENQLLAA
Subjt:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAA

Query:  QSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSD
        QS  NEILEVEDRNI IEPVHKGDHL+LS+  KDD DEN YDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLP HRSNEESDASSEQSHKSD
Subjt:  QSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSD

Query:  AECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLP
         ECVMSDDEAENQGEEGG VEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLID DGF+LP
Subjt:  AECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLP

Query:  VNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNFAVPKPEQQNIRW
         NVPPI TA+RNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYD NEEKPDLKSD+FEQEFL P  KDMFRRHESFSVGPSNFAVPK EQQNIRW
Subjt:  VNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNFAVPKPEQQNIRW

Query:  KPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHHEVI
        KPYFMPEK+AAE TSYSPLERQFSEVSESK+SSVSDTESMSSIADQDDKKPDESQ FLETTAVSYL  TAS IEHGNG WEDIGSEDYVQE+RDVHHEVI
Subjt:  KPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHHEVI

Query:  EITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE-ADFKI
        EITLG++E+HFESQSGSS I   +TP+E NASEIHSKN+LVETDFSSNSSLSSLSE + ET FEVKTDEV+PSS   EES IDTT+ S+PA  E  DFK 
Subjt:  EITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE-ADFKI

Query:  SSEVLDDNHPKEPVYDSSPSAEGKFS
        +SEVLDDN  +EPVYDSSPSAEGK S
Subjt:  SSEVLDDNHPKEPVYDSSPSAEGKFS

A0A1S3B4T0 uncharacterized protein LOC1034860290.0e+0084.15Show/hide
Query:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE
        RSVR     +G+ C       SCP LFSLLVSASPVLICTAVLLGTLLS+GQPNIPEIET EKVSR+VASLRSGILDNATVVAKE+DSFTVERF+GN+VE
Subjt:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE

Query:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGV------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMD
        NSYVERGSE++RKTS  DEHAGFVDFVPVIHER REIQFEKG V            G VEEFEKGEVE+AAAE+ELH+SELEER EIYERDLDV+SLA D
Subjt:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGV------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMD

Query:  GENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEE
         ENA+ENQLLAAQS  NEILEV DRNI IEPVHKGDHL+LS+  KDD DEN YDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLP HRSNEE
Subjt:  GENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEE

Query:  SDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG
        SDASSEQSHKSD ECVMSDDEAENQGEEGG VEHDEDED+DDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG
Subjt:  SDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG

Query:  KNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNF
        KNLID DGF+LP NVPPI TA+RNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD+FEQEFL P  KDMFRRHESFSVGPSNF
Subjt:  KNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNF

Query:  AVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDY
        AVPK EQQNIRWKPYFMPEK+AAE TSYSPLERQFSEVSESK+SSVSDTESMSSIADQDDKKPDESQ FLETTAVSYLD TA  IEHGNG WEDIGSEDY
Subjt:  AVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDY

Query:  VQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRS
        VQE+RDVHHEVIEITLG++E+HFES SGSS I   +TP+E NASEIHSK++LVETDFSSNSSLSSLSE + ET FEVKTDEV+PSS   EES IDTT+ S
Subjt:  VQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRS

Query:  MPAALE-ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS
        +PA  E  DFK++SEVLDDN  +EPVYDSSPSAEGK S
Subjt:  MPAALE-ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS

A0A5A7TJW0 Uncharacterized protein0.0e+0084.15Show/hide
Query:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE
        RSVR     +G+ C       SCP LFSLLVSASPVLICTAVLLGTLLS+GQPNIPEIET EKVSR+VASLRSGILDNATVVAKE+DSFTVERF+GN+VE
Subjt:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE

Query:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGV------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMD
        NSYVERGSE++RKTS  DEHAGFVDFVPVIHER REIQFEKG V            G VEEFEKGEVE+AAAE+ELH+SELEER EIYERDLDV+SLA D
Subjt:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGV------------GKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMD

Query:  GENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEE
         ENA+ENQLLAAQS  NEILEV DRNI IEPVHKGDHL+LS+  KDD DEN YDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLP HRSNEE
Subjt:  GENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEE

Query:  SDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG
        SDASSEQSHKSD ECVMSDDEAENQGEEGG VEHDEDED+DDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG
Subjt:  SDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG

Query:  KNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNF
        KNLID DGF+LP NVPPI TA+RNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD+FEQEFL P  KDMFRRHESFSVGPSNF
Subjt:  KNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNF

Query:  AVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDY
        AVPK EQQNIRWKPYFMPEK+AAE TSYSPLERQFSEVSESK+SSVSDTESMSSIADQDDKKPDESQ FLETTAVSYLD TA  IEHGNG WEDIGSEDY
Subjt:  AVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDY

Query:  VQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRS
        VQE+RDVHHEVIEITLG++E+HFES SGSS I   +TP+E NASEIHSK++LVETDFSSNSSLSSLSE + ET FEVKTDEV+PSS   EES IDTT+ S
Subjt:  VQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVD-ETPFEVKTDEVRPSSPRVEESHIDTTSRS

Query:  MPAALE-ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS
        +PA  E  DFK++SEVLDDN  +EPVYDSSPSAEGK S
Subjt:  MPAALE-ADFKISSEVLDDNHPKEPVYDSSPSAEGKFS

A0A6J1DM73 uncharacterized protein LOC1110217420.0e+0082.44Show/hide
Query:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE
        RSVR     +G+ C       SCP LFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIE EEKVS++VASLRSGILDNATVVAKE+DSFTVE+F+GNKV 
Subjt:  RSVRERGEGYGIGCG----NGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTVERFDGNKVE

Query:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEF-EKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLA
        +SYVE GSE+D+KTS LDEH GFVDFVP+IHERDREI FEKG  G VEEF EKG+VE+ A    LHSSELEE+ EIYERDLDVKS+A D ENAVENQLLA
Subjt:  NSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEF-EKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLA

Query:  AQSTINEILEVEDRNILIEPVHKGD-HLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHK
        AQS  NEILEVEDRNI IEPVHKGD HLNL+    +D DE+DYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLP++RSNE SDASSEQSHK
Subjt:  AQSTINEILEVEDRNILIEPVHKGD-HLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHK

Query:  SDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFD
        SD ECVMSDD+AENQGEE GAVE+D+++DD+DDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNN+RMLAGKNLID DGFD
Subjt:  SDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFD

Query:  LPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDM-FRRHESFSVGPSNFAVPKPEQQN
        LP NVPPI T++RNPFDLPYDSY+NMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD+FEQEFLP   KD+ FRRHESFSVGPS+FAVPK EQQN
Subjt:  LPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDM-FRRHESFSVGPSNFAVPKPEQQN

Query:  IRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHH
        IRW+PYFMPEK+A+E TSYSPL+ QFSEVSESKLSSVSDTESMSSIADQD+KKPDES+ FLETTAVSYLD TASSIEHGNGSWED+GSEDYVQE RDVHH
Subjt:  IRWKPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHH

Query:  EVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE--AD
        EVIEITLG++E+HFE QSGSSEIGA E PVE NA+EIH K+++VETDFSS+SSLSSLSEV+ETPFEVKTDEV+PSSPR+ ES +     SM AALE  AD
Subjt:  EVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE--AD

Query:  FKISSEVLDDNHPKEPVYDSSPSAEGKFS
        FKI+SEVLDDN  KEPVYDSSPSAEGK S
Subjt:  FKISSEVLDDNHPKEPVYDSSPSAEGKFS

A0A6J1ILQ6 uncharacterized protein LOC111478159 isoform X37.1e-30480.78Show/hide
Query:  SCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTV-------------ERFDGNKVENSYVERGS
        SCP LFSLLVSASPVLICTA LLGTLLSFGQPNIPEIETEEKVSR+VA   S ILDNATVVAKE+DSFTV             ERF+GN+V NSYVERGS
Subjt:  SCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSFTV-------------ERFDGNKVENSYVERGS

Query:  EDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAAQSTINEIL
        E++RKTS LDEHAGFV  VPVI+E +REIQFEK   G VEEFEKGE+E+AA ERE  SSELEER EIYE+DLDVKSL  DGEN VENQLLAA+ST NE+ 
Subjt:  EDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAAQSTINEIL

Query:  EVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSDAECVMSDD
        EVED NI IE  HKGD L+LS+  KDD  ENDY+S  SESDRAESSSPDASM DIIPLLDELHPLLDSETP P   SNEESDA SE  HKSD ECVMSDD
Subjt:  EVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSDAECVMSDD

Query:  EAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPVNVPPIFT
        EAENQGEE G VE DED++DDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAG NL+D DGFDLP NVPPI T
Subjt:  EAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPVNVPPIFT

Query:  AKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNFAVPKPEQQNIRWKPYFMPEK
         +RNPFDLPYDSY+NMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD+FE EFLPP  KDMFRRHESFSVGPSNF++PK EQQNIRWKPYFMPEK
Subjt:  AKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNFAVPKPEQQNIRWKPYFMPEK

Query:  VAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHHEVIEITLGASE
        VAAEET+YSPLERQ SE SESKLS VSDTESMSSIADQDDKKPDES  FLETTAVS+LD  AS IEHGNG WEDIGSE+YVQE+R VHHEVIEITLG++E
Subjt:  VAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHHEVIEITLGASE

Query:  NHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE--ADFKISSEVLDDN
        +HFESQSGSSEIGA + PVE NASEIHSKN+LVETD SS+SSLSSLSEV+ET  EVKTDE +P+SP+ EES IDTTS +M  A E  ADFKI SEVLDDN
Subjt:  NHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALE--ADFKISSEVLDDN

Query:  HPKEPVYDSSPSAEGKFS
           EPVYDSSPSAEGK S
Subjt:  HPKEPVYDSSPSAEGKFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G07330.1 unknown protein3.6e-2633.25Show/hide
Query:  DEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPVNVPPIF
        D +E     GG  E +     + +E  +EE   E K  + WTEDDQKNLMDLG+ E+ERN+RLE+LI RRR R  +R+ A  +L+D +       VPP+ 
Subjt:  DEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPVNVPPIF

Query:  TAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNFAVPKPEQQNIRWKPYFMPE
           RN F L  ++Y   GL  +P SAPS+LLP +NPFD+PYDP EEKP+L  D F+QEF    +   F RHESF         P   Q + +W+P+   +
Subjt:  TAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNFAVPKPEQQNIRWKPYFMPE

Query:  KVAAEETSYSPLERQFSEVSESK---LSSVSDTES--MSSIADQDDK---KPDESQPFLETTAVSYLDATASSIEHGNGSWE--------------DIGS
        K   ++ S   L  +   V + K      V+D ES  M+ I   D      P++ +   + +  +Y   T+     GNG                  + S
Subjt:  KVAAEETSYSPLERQFSEVSESK---LSSVSDTES--MSSIADQDDK---KPDESQPFLETTAVSYLDATASSIEHGNGSWE--------------DIGS

Query:  EDYVQEDRDVHHEVIEITLGASEN-HFESQSGSSEIGATETPVETNASEIHSKNILV-------ETDFSSNSSLSSLSE
            +  R V H       G   +   + Q   SEIG+  T V+ N S    K+ +V       ET FS   S+   +E
Subjt:  EDYVQEDRDVHHEVIEITLGASEN-HFESQSGSSEIGATETPVETNASEIHSKNILV-------ETDFSSNSSLSSLSE

AT2G29620.1 unknown protein8.9e-2535.4Show/hide
Query:  LHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADI
        L SS L      Y R+ D+ S   D +   +  L    STI++    E R    +  H+   +  + + K +    D+DSS +  D            ++
Subjt:  LHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAAQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADI

Query:  IPLLDELHPLLDS---ETPLPIHRSNEESDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLG
        +P  + + P ++    E  L +   N             + E ++S D     G++   VE       + ++  +E +ED SK  + WTEDDQKNLMDLG
Subjt:  IPLLDELHPLLDS---ETPLPIHRSNEESDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLG

Query:  SLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD
        + E+ERN+RLENLI+RRR+R    + A  +L+D    D+ V   P     RN +     +Y   GL  +PGSAPS+LLPRRNPFDLPYDP EEKP+L  D
Subjt:  SLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPVNVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD

Query:  EFEQEFLPPLHKDM-FRRHESF
         F+QEF     KD+ F RHESF
Subjt:  EFEQEFLPPLHKDM-FRRHESF

AT5G17910.1 unknown protein1.5e-12042.29Show/hide
Query:  CPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKE---EDSFTVERFDGNKVENSYVERGSED-----DRKT
        CP LF+ LV+ASPVL+CT VLLGT+LSFG+PNIPEIE + ++  E A LR+ +  +A V   +   ++SFTVE F G   E   +E G++D     D + 
Subjt:  CPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKE---EDSFTVERFDGNKVENSYVERGSED-----DRKT

Query:  SGLDEHAGFVDFVPVIHE------RDREIQF-EKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAAQSTINEI
        S +++     D+ P++ E      RD  ++F EK  +  VE  +KG+ E                        D K +  DG  A +++      T   +
Subjt:  SGLDEHAGFVDFVPVIHE------RDREIQF-EKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAAQSTINEI

Query:  LEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSDA-ECVMS
         E  D  + + PV     +       DD D +D   SG  SD AESSSPDASM DIIP+LDELHPLL SE P       E SDA+SE  H+S + E + S
Subjt:  LEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSDA-ECVMS

Query:  DDEAENQGEEGGAVEHD--EDEDDDDDEGMQEEKE---DESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPV
        D ++E+ GEEG     D  EDE+++D+E  QE+KE   DESKSAIKWTE DQ+N+MDLGSLELERNQRLENLIARRRAR+N+R++A +NLIDFD  D+P 
Subjt:  DDEAENQGEEGGAVEHD--EDEDDDDDEGMQEEKE---DESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPV

Query:  NVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKD-MFRRHESFSVGPSNFAVPKPEQQNIRW
        N+PPI TA+ NPFD+ YDSY +M   PIPGSAPSI+  RRNPFDLPY+PNEEKPDLK D F++EF     KD MFRRHESFSVGPS    P+ +    R 
Subjt:  NVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKD-MFRRHESFSVGPSNFAVPKPEQQNIRW

Query:  KPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSED------------Y
        +P+F+ E++A E TSY P ERQ SEVSESK+SS+ DTES+ ++ + D+KK DE+    E T ++ +D  + + E  N S  D   E+            +
Subjt:  KPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSED------------Y

Query:  VQEDRD---------VHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEES
          ED D         +HH+V EI LG+ E H E     S++   ET  +    E+             + S SSLSE +E   ++  DE    S +V + 
Subjt:  VQEDRD---------VHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEES

Query:  HIDTTSRSMPAALEADFKISSEVLDDNH--------------------------------PKEPVYDSSPSAEGKF-SFSPVSSD
        H +  + S+P+  E +  ++  V DD H                                 +EPVYDSSP +  +F SFS VSSD
Subjt:  HIDTTSRSMPAALEADFKISSEVLDDNH--------------------------------PKEPVYDSSPSAEGKF-SFSPVSSD

AT5G17910.2 unknown protein1.5e-12042.29Show/hide
Query:  CPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKE---EDSFTVERFDGNKVENSYVERGSED-----DRKT
        CP LF+ LV+ASPVL+CT VLLGT+LSFG+PNIPEIE + ++  E A LR+ +  +A V   +   ++SFTVE F G   E   +E G++D     D + 
Subjt:  CPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKE---EDSFTVERFDGNKVENSYVERGSED-----DRKT

Query:  SGLDEHAGFVDFVPVIHE------RDREIQF-EKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAAQSTINEI
        S +++     D+ P++ E      RD  ++F EK  +  VE  +KG+ E                        D K +  DG  A +++      T   +
Subjt:  SGLDEHAGFVDFVPVIHE------RDREIQF-EKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLAAQSTINEI

Query:  LEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSDA-ECVMS
         E  D  + + PV     +       DD D +D   SG  SD AESSSPDASM DIIP+LDELHPLL SE P       E SDA+SE  H+S + E + S
Subjt:  LEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSDA-ECVMS

Query:  DDEAENQGEEGGAVEHD--EDEDDDDDEGMQEEKE---DESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPV
        D ++E+ GEEG     D  EDE+++D+E  QE+KE   DESKSAIKWTE DQ+N+MDLGSLELERNQRLENLIARRRAR+N+R++A +NLIDFD  D+P 
Subjt:  DDEAENQGEEGGAVEHD--EDEDDDDDEGMQEEKE---DESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPV

Query:  NVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKD-MFRRHESFSVGPSNFAVPKPEQQNIRW
        N+PPI TA+ NPFD+ YDSY +M   PIPGSAPSI+  RRNPFDLPY+PNEEKPDLK D F++EF     KD MFRRHESFSVGPS    P+ +    R 
Subjt:  NVPPIFTAKRNPFDLPYDSYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKD-MFRRHESFSVGPSNFAVPKPEQQNIRW

Query:  KPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSED------------Y
        +P+F+ E++A E TSY P ERQ SEVSESK+SS+ DTES+ ++ + D+KK DE+    E T ++ +D  + + E  N S  D   E+            +
Subjt:  KPYFMPEKVAAEETSYSPLERQFSEVSESKLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSED------------Y

Query:  VQEDRD---------VHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEES
          ED D         +HH+V EI LG+ E H E     S++   ET  +    E+             + S SSLSE +E   ++  DE    S +V + 
Subjt:  VQEDRD---------VHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNILVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEES

Query:  HIDTTSRSMPAALEADFKISSEVLDDNH--------------------------------PKEPVYDSSPSAEGKF-SFSPVSSD
        H +  + S+P+  E +  ++  V DD H                                 +EPVYDSSP +  +F SFS VSSD
Subjt:  HIDTTSRSMPAALEADFKISSEVLDDNH--------------------------------PKEPVYDSSPSAEGKF-SFSPVSSD

AT5G58880.1 unknown protein1.1e-1136.94Show/hide
Query:  SNEESDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDD--------DDEGMQEEKEDESKSAIKWTEDDQK--------NLMDLGSLELERNQ
        S EE D     S  +D        E   +  E   VE +ED++ +        D   +  E+       IK+ E D K        N  + G  E+ERN+
Subjt:  SNEESDASSEQSHKSDAECVMSDDEAENQGEEGGAVEHDEDEDDD--------DDEGMQEEKEDESKSAIKWTEDDQK--------NLMDLGSLELERNQ

Query:  RLENLIARRRARNNLRM-LAGKNLIDFDGFDLP--VNVPPI-FTAKRNPFDLPYDSYSN----MGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD
        RLE+LIARRRAR   R+ L  KN +  +    P   N   +  T  RN  +   ++ S+     GL  IPGSAPS++L  RNPFD+PYDP EE+P+L  D
Subjt:  RLENLIARRRARNNLRM-LAGKNLIDFDGFDLP--VNVPPI-FTAKRNPFDLPYDSYSN----MGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSD

Query:  EFEQEFLPPLHKDM-FRRHESF
         F+QEF     KD+ F RHESF
Subjt:  EFEQEFLPPLHKDM-FRRHESF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGATGTTCGAATTCTACAAGTAGTTCAACTAGTAAGTCAAAGTAAACATAACGATATAATTGATTGGGATAGAAGCGTTCGAGAACGGGGAGAAGGATATGGGAT
TGGCTGTGGAAATGGATCATGTCCTTCTTTGTTTTCCCTTTTGGTGTCTGCGTCCCCTGTTTTGATTTGCACCGCTGTTCTGCTTGGAACCCTTCTGAGTTTTGGACAGC
CTAATATACCTGAAATTGAAACGGAGGAGAAGGTTTCGCGTGAGGTTGCTTCTTTGAGATCTGGGATTTTGGACAATGCTACTGTTGTTGCTAAGGAGGAAGATAGTTTT
ACTGTAGAGAGATTTGATGGAAATAAAGTAGAGAATTCTTATGTGGAAAGGGGTTCTGAAGATGACAGGAAAACAAGCGGGCTTGACGAACATGCTGGTTTTGTTGACTT
TGTTCCAGTGATCCACGAGCGTGATCGCGAAATTCAGTTTGAGAAGGGAGGAGTTGGCAAAGTTGAAGAGTTTGAGAAGGGTGAAGTTGAGCAGGCAGCAGCAGAGAGAG
AATTGCATAGTTCAGAGTTGGAGGAGAGGACAGAGATTTATGAGAGGGATTTGGATGTTAAGAGTTTGGCAATGGATGGTGAAAATGCTGTTGAGAACCAGCTTTTGGCT
GCCCAAAGCACGATAAATGAGATTCTTGAAGTAGAAGATCGTAACATCTTAATAGAACCTGTACATAAAGGTGATCATCTGAACTTATCTGTCAAAGTTAAAGATGATCG
TGATGAAAATGATTATGATTCTTCGGGTTCTGAGTCTGATAGAGCAGAAAGTTCTTCACCTGATGCCTCAATGGCCGATATCATACCGTTGCTGGATGAGTTGCATCCTC
TCTTGGACTCAGAAACTCCACTACCTATTCATAGGTCAAATGAGGAGTCAGATGCTTCTTCAGAACAGTCTCATAAAAGTGATGCTGAATGTGTGATGTCGGATGATGAG
GCTGAAAACCAGGGAGAAGAAGGCGGTGCTGTTGAGCATGATGAGGACGAGGATGACGATGATGATGAGGGGATGCAGGAAGAGAAAGAGGATGAGAGCAAATCTGCTAT
CAAGTGGACTGAGGATGATCAAAAGAATCTCATGGATTTGGGAAGTTTAGAGCTAGAAAGGAATCAGCGTTTGGAGAATCTCATTGCAAGGAGAAGAGCAAGGAACAACC
TGAGAATGTTGGCTGGGAAGAATTTGATAGACTTTGATGGTTTCGATCTTCCTGTAAATGTACCACCCATATTTACAGCCAAACGTAACCCATTTGATCTCCCTTATGAT
TCATATAGTAATATGGGATTACCACCGATTCCTGGGTCTGCTCCATCCATTTTGTTGCCAAGACGTAATCCATTTGATCTCCCATATGACCCAAATGAAGAAAAACCAGA
CCTCAAGAGTGACGAGTTTGAACAAGAATTTTTGCCACCTCTACACAAAGATATGTTCCGAAGACATGAAAGTTTCAGTGTGGGCCCTTCAAACTTTGCAGTTCCCAAGC
CAGAGCAGCAGAATATTAGATGGAAACCATATTTCATGCCTGAAAAAGTAGCTGCTGAAGAAACGAGCTACTCTCCATTAGAAAGGCAATTCAGTGAAGTCAGTGAATCA
AAACTAAGTTCTGTTTCTGATACTGAATCAATGAGTTCCATTGCAGATCAGGATGACAAGAAGCCTGATGAATCACAACCTTTTCTGGAAACAACAGCAGTTTCCTACCT
TGACGCCACAGCTAGTAGTATTGAGCATGGAAATGGGTCATGGGAGGATATAGGCTCTGAAGATTACGTACAAGAAGATAGAGATGTGCATCATGAAGTCATTGAGATAA
CTTTGGGAGCTTCCGAGAATCACTTTGAAAGCCAATCTGGATCATCTGAAATTGGAGCTACAGAAACCCCAGTGGAGACTAATGCTAGTGAAATTCACTCCAAAAATATA
TTAGTTGAAACGGATTTCAGCAGCAATTCCAGCTTGTCTTCATTATCAGAAGTCGATGAAACACCATTTGAGGTCAAAACAGATGAGGTGAGACCAAGTAGCCCTCGTGT
AGAGGAATCTCATATTGATACTACTAGCAGATCAATGCCAGCTGCCCTTGAAGCAGACTTCAAGATTTCCAGTGAAGTGTTGGATGACAATCATCCTAAGGAGCCTGTTT
ATGACTCGAGCCCTTCAGCAGAAGGCAAGTTCTCTTTTTCACCTGTTTCTTCTGATTTGTATGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGATGTTCGAATTCTACAAGTAGTTCAACTAGTAAGTCAAAGTAAACATAACGATATAATTGATTGGGATAGAAGCGTTCGAGAACGGGGAGAAGGATATGGGAT
TGGCTGTGGAAATGGATCATGTCCTTCTTTGTTTTCCCTTTTGGTGTCTGCGTCCCCTGTTTTGATTTGCACCGCTGTTCTGCTTGGAACCCTTCTGAGTTTTGGACAGC
CTAATATACCTGAAATTGAAACGGAGGAGAAGGTTTCGCGTGAGGTTGCTTCTTTGAGATCTGGGATTTTGGACAATGCTACTGTTGTTGCTAAGGAGGAAGATAGTTTT
ACTGTAGAGAGATTTGATGGAAATAAAGTAGAGAATTCTTATGTGGAAAGGGGTTCTGAAGATGACAGGAAAACAAGCGGGCTTGACGAACATGCTGGTTTTGTTGACTT
TGTTCCAGTGATCCACGAGCGTGATCGCGAAATTCAGTTTGAGAAGGGAGGAGTTGGCAAAGTTGAAGAGTTTGAGAAGGGTGAAGTTGAGCAGGCAGCAGCAGAGAGAG
AATTGCATAGTTCAGAGTTGGAGGAGAGGACAGAGATTTATGAGAGGGATTTGGATGTTAAGAGTTTGGCAATGGATGGTGAAAATGCTGTTGAGAACCAGCTTTTGGCT
GCCCAAAGCACGATAAATGAGATTCTTGAAGTAGAAGATCGTAACATCTTAATAGAACCTGTACATAAAGGTGATCATCTGAACTTATCTGTCAAAGTTAAAGATGATCG
TGATGAAAATGATTATGATTCTTCGGGTTCTGAGTCTGATAGAGCAGAAAGTTCTTCACCTGATGCCTCAATGGCCGATATCATACCGTTGCTGGATGAGTTGCATCCTC
TCTTGGACTCAGAAACTCCACTACCTATTCATAGGTCAAATGAGGAGTCAGATGCTTCTTCAGAACAGTCTCATAAAAGTGATGCTGAATGTGTGATGTCGGATGATGAG
GCTGAAAACCAGGGAGAAGAAGGCGGTGCTGTTGAGCATGATGAGGACGAGGATGACGATGATGATGAGGGGATGCAGGAAGAGAAAGAGGATGAGAGCAAATCTGCTAT
CAAGTGGACTGAGGATGATCAAAAGAATCTCATGGATTTGGGAAGTTTAGAGCTAGAAAGGAATCAGCGTTTGGAGAATCTCATTGCAAGGAGAAGAGCAAGGAACAACC
TGAGAATGTTGGCTGGGAAGAATTTGATAGACTTTGATGGTTTCGATCTTCCTGTAAATGTACCACCCATATTTACAGCCAAACGTAACCCATTTGATCTCCCTTATGAT
TCATATAGTAATATGGGATTACCACCGATTCCTGGGTCTGCTCCATCCATTTTGTTGCCAAGACGTAATCCATTTGATCTCCCATATGACCCAAATGAAGAAAAACCAGA
CCTCAAGAGTGACGAGTTTGAACAAGAATTTTTGCCACCTCTACACAAAGATATGTTCCGAAGACATGAAAGTTTCAGTGTGGGCCCTTCAAACTTTGCAGTTCCCAAGC
CAGAGCAGCAGAATATTAGATGGAAACCATATTTCATGCCTGAAAAAGTAGCTGCTGAAGAAACGAGCTACTCTCCATTAGAAAGGCAATTCAGTGAAGTCAGTGAATCA
AAACTAAGTTCTGTTTCTGATACTGAATCAATGAGTTCCATTGCAGATCAGGATGACAAGAAGCCTGATGAATCACAACCTTTTCTGGAAACAACAGCAGTTTCCTACCT
TGACGCCACAGCTAGTAGTATTGAGCATGGAAATGGGTCATGGGAGGATATAGGCTCTGAAGATTACGTACAAGAAGATAGAGATGTGCATCATGAAGTCATTGAGATAA
CTTTGGGAGCTTCCGAGAATCACTTTGAAAGCCAATCTGGATCATCTGAAATTGGAGCTACAGAAACCCCAGTGGAGACTAATGCTAGTGAAATTCACTCCAAAAATATA
TTAGTTGAAACGGATTTCAGCAGCAATTCCAGCTTGTCTTCATTATCAGAAGTCGATGAAACACCATTTGAGGTCAAAACAGATGAGGTGAGACCAAGTAGCCCTCGTGT
AGAGGAATCTCATATTGATACTACTAGCAGATCAATGCCAGCTGCCCTTGAAGCAGACTTCAAGATTTCCAGTGAAGTGTTGGATGACAATCATCCTAAGGAGCCTGTTT
ATGACTCGAGCCCTTCAGCAGAAGGCAAGTTCTCTTTTTCACCTGTTTCTTCTGATTTGTATGCATAA
Protein sequenceShow/hide protein sequence
MSDVRILQVVQLVSQSKHNDIIDWDRSVRERGEGYGIGCGNGSCPSLFSLLVSASPVLICTAVLLGTLLSFGQPNIPEIETEEKVSREVASLRSGILDNATVVAKEEDSF
TVERFDGNKVENSYVERGSEDDRKTSGLDEHAGFVDFVPVIHERDREIQFEKGGVGKVEEFEKGEVEQAAAERELHSSELEERTEIYERDLDVKSLAMDGENAVENQLLA
AQSTINEILEVEDRNILIEPVHKGDHLNLSVKVKDDRDENDYDSSGSESDRAESSSPDASMADIIPLLDELHPLLDSETPLPIHRSNEESDASSEQSHKSDAECVMSDDE
AENQGEEGGAVEHDEDEDDDDDEGMQEEKEDESKSAIKWTEDDQKNLMDLGSLELERNQRLENLIARRRARNNLRMLAGKNLIDFDGFDLPVNVPPIFTAKRNPFDLPYD
SYSNMGLPPIPGSAPSILLPRRNPFDLPYDPNEEKPDLKSDEFEQEFLPPLHKDMFRRHESFSVGPSNFAVPKPEQQNIRWKPYFMPEKVAAEETSYSPLERQFSEVSES
KLSSVSDTESMSSIADQDDKKPDESQPFLETTAVSYLDATASSIEHGNGSWEDIGSEDYVQEDRDVHHEVIEITLGASENHFESQSGSSEIGATETPVETNASEIHSKNI
LVETDFSSNSSLSSLSEVDETPFEVKTDEVRPSSPRVEESHIDTTSRSMPAALEADFKISSEVLDDNHPKEPVYDSSPSAEGKFSFSPVSSDLYA