; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011465 (gene) of Snake gourd v1 genome

Gene IDTan0011465
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionembryonic flower 1 (EMF1)
Genome locationLG02:1065308..1071378
RNA-Seq ExpressionTan0011465
SyntenyTan0011465
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148072.1 uncharacterized protein LOC111016842 isoform X1 [Momordica charantia]0.0e+0073.7Show/hide
Query:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNN---------CVREKEEMKV---EKICPVCGVFVTATVNAMNAHI
        FSIREYAL MRG+DL R WPF +NVK+EVAEA+LPPISV KFRWW HE+E  KS+N           +++EE KV   EKICPVCGVFVTATVNAMNAHI
Subjt:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNN---------CVREKEEMKV---EKICPVCGVFVTATVNAMNAHI

Query:  DNCLAQT-TKEKRRN------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKA-NNTKNKYNNN-HKN
        D+CLAQT T +KR+N      K KSRTPKKRSIAEIFAVAPPVET++       E+  G  +   +LKATSLARTLV+AMKTIKA  N ++K   +  KN
Subjt:  DNCLAQT-TKEKRRN------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKA-NNTKNKYNNN-HKN

Query:  KDFGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFS
        KDFGHE L KKG+RNHKDVSV RCKKPCFKRLSRQKK+KLVKKSNV AKQQRPVP IRSILK SVKVVSET+PS  NL GS QVINNGG+ +SDRRVSF 
Subjt:  KDFGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFS

Query:  DKDDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSME-VGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQ-CSMRPHPCWDNANHS
        DKDDVLGP TRA SDTFEQ+ G+PF+ SEG+T SGE+NK V SME VG+NDD+ VSFSTRH VDSQ +KGKIQLPNIHDQVNAQ  SMRPHPCW N  H 
Subjt:  DKDDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSME-VGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQ-CSMRPHPCWDNANHS

Query:  AEKLIPANRVIPQENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSEN
         E+ I ANRV+P E+N HLFDHVY+DAPQ+ P V SAIP     Q+ERQYG VRTQ GS+FP AH+F GKSVDHL+NPINGVA L SM STVP+F+ +EN
Subjt:  AEKLIPANRVIPQENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSEN

Query:  AVGRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLV
         VGR  NLAES AKD R  FPN EQ  VAYKEKG NDGFFCLPLNSKGELIQLNS ++NR+DQMNEA NNMACSSRIPVCGLV PRSTRDYFIDNE +L+
Subjt:  AVGRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLV

Query:  DTELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQ
        DTEL  NQLTLFPLHS MQEN+N+ LS RF + EPGTS   DIRLLNSERGT+SG   HSNLMD PFNRCRYYGKL NQNVS EIYPENSS+M +NPARQ
Subjt:  DTELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQ

Query:  TMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERV----------FYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKN
        TMRLMGKDVAVGGNGKEVQEPE INFWKNS+LIENCLTN IQENPMRKRNFLQ+RV          FYPAGFH  QVAQ NLLPNAPQVRYPH RL++KN
Subjt:  TMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERV----------FYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKN

Query:  SIMNQRSNSVINLNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPETL
         +M QRS+SVINLNERF+NI+AF P STEAFNM PNFQAPFISGP TL
Subjt:  SIMNQRSNSVINLNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPETL

XP_022148073.1 uncharacterized protein LOC111016842 isoform X2 [Momordica charantia]0.0e+0073.44Show/hide
Query:  KDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNN---------CVREKEEMKV---EKICPVCGVFVTATVNAMNAHIDNCLAQT-TKEK
        +DL R WPF +NVK+EVAEA+LPPISV KFRWW HE+E  KS+N           +++EE KV   EKICPVCGVFVTATVNAMNAHID+CLAQT T +K
Subjt:  KDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNN---------CVREKEEMKV---EKICPVCGVFVTATVNAMNAHIDNCLAQT-TKEK

Query:  RRN------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKA-NNTKNKYNNN-HKNKDFGHEQLCKKG
        R+N      K KSRTPKKRSIAEIFAVAPPVET++       E+  G  +   +LKATSLARTLV+AMKTIKA  N ++K   +  KNKDFGHE L KKG
Subjt:  RRN------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKA-NNTKNKYNNN-HKNKDFGHEQLCKKG

Query:  QRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDKDDVLGPSTRA
        +RNHKDVSV RCKKPCFKRLSRQKK+KLVKKSNV AKQQRPVP IRSILK SVKVVSET+PS  NL GS QVINNGG+ +SDRRVSF DKDDVLGP TRA
Subjt:  QRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDKDDVLGPSTRA

Query:  ISDTFEQNDGSPFEASEGDTNSGETNKEVDSME-VGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQ-CSMRPHPCWDNANHSAEKLIPANRVIP
         SDTFEQ+ G+PF+ SEG+T SGE+NK V SME VG+NDD+ VSFSTRH VDSQ +KGKIQLPNIHDQVNAQ  SMRPHPCW N  H  E+ I ANRV+P
Subjt:  ISDTFEQNDGSPFEASEGDTNSGETNKEVDSME-VGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQ-CSMRPHPCWDNANHSAEKLIPANRVIP

Query:  QENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAVGRFLNLAESP
         E+N HLFDHVY+DAPQ+ P V SAIP     Q+ERQYG VRTQ GS+FP AH+F GKSVDHL+NPINGVA L SM STVP+F+ +EN VGR  NLAES 
Subjt:  QENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAVGRFLNLAESP

Query:  AKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDTELAGNQLTLF
        AKD R  FPN EQ  VAYKEKG NDGFFCLPLNSKGELIQLNS ++NR+DQMNEA NNMACSSRIPVCGLV PRSTRDYFIDNE +L+DTEL  NQLTLF
Subjt:  AKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDTELAGNQLTLF

Query:  PLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTMRLMGKDVAVG
        PLHS MQEN+N+ LS RF + EPGTS   DIRLLNSERGT+SG   HSNLMD PFNRCRYYGKL NQNVS EIYPENSS+M +NPARQTMRLMGKDVAVG
Subjt:  PLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTMRLMGKDVAVG

Query:  GNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERV----------FYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNSIMNQRSNSVIN
        GNGKEVQEPE INFWKNS+LIENCLTN IQENPMRKRNFLQ+RV          FYPAGFH  QVAQ NLLPNAPQVRYPH RL++KN +M QRS+SVIN
Subjt:  GNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERV----------FYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNSIMNQRSNSVIN

Query:  LNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPETL
        LNERF+NI+AF P STEAFNM PNFQAPFISGP TL
Subjt:  LNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPETL

XP_022969330.1 uncharacterized protein LOC111468375 isoform X3 [Cucurbita maxima]1.6e-29865.99Show/hide
Query:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKE-----EMKVEKICPVCGVFVTATVNAMNAHIDNCLAQT
        FSIREYAL MRG DL+RSWPFSENVK+EVA+ALLPP+ V+KFRWW H+         V EKE      ++++KICPVCGVFV ATVNAMNAHI +CLAQT
Subjt:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKE-----EMKVEKICPVCGVFVTATVNAMNAHIDNCLAQT

Query:  TKEKRRNK----AKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKANNTKN----------KYNNNHKNKD
        TKE+RRNK    AKSRTPKKRSIAEIFAVAPPV+TMII NDC+ E  +GKQ I DKLKATSLAR+LVSAMKTIKA NT+N          +     KNK+
Subjt:  TKEKRRNK----AKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKANNTKN----------KYNNNHKNKD

Query:  FGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDK
        FGHEQLCK G+RNHKDVS R CKKPCFKRLSRQK++KLVKKSNVV +QQRP+ P+RSILKHSVK +SET        GSNQ  NNGGQ K  +RVSF DK
Subjt:  FGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDK

Query:  DDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWDNANHSAEKL
        DDVLGP+T A+SDTFEQ+  +PF+ASEG + SGE++K V SMEVGV DDV VS S RH+VDSQ                          WDNA HS EKL
Subjt:  DDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWDNANHSAEKL

Query:  IPANRVIP-QENNLHLFDHVYVDAPQKLPSVDSAIP------QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAV
        I  NRVIP  +N+LHLFDHVYVDAPQKLP VDSA P      QEERQYGHVRTQC     RAHS YG                 S  S VPS S SENA 
Subjt:  IPANRVIP-QENNLHLFDHVYVDAPQKLPSVDSAIP------QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAV

Query:  GRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDT
        GRFLNLA+S  KD RC FPN EQS VAYKEKG NDGFFCLPLNSKGELIQLNS ++NRF QMNEA+N MACSSRIPVC  VLPR TRDYFIDNE LLVDT
Subjt:  GRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDT

Query:  ELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTM
        EL  NQLTLFPLHSN+QENQN+ LS RF + EPGT          SERGTESG F HSNLMD PF R RYYGKLQNQN S EI PE+SSS+ +NPARQTM
Subjt:  ELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTM

Query:  RLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNS
        RLMGKDVAVG +GKE+QEPEVINFWKNSTLI+NCLTNPIQENP RKRNFLQ+R            ++PAGFH           NAPQVRYPH  L++   
Subjt:  RLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNS

Query:  IMNQRSNSVINLNERF-NNIHAFSPLSTEAFNMEPNFQAPFISGPETLSQM-MKIMSSSLGFTVLRGFPHGCYTVTNGKKHRSQI
         M QR  SVINLNERF NN+H    +ST+AFNM PNFQAPFISGPETLSQM M+++S SLGF VLR F HGCY +TNGK+ + Q+
Subjt:  IMNQRSNSVINLNERF-NNIHAFSPLSTEAFNMEPNFQAPFISGPETLSQM-MKIMSSSLGFTVLRGFPHGCYTVTNGKKHRSQI

XP_023511520.1 uncharacterized protein LOC111776324 isoform X3 [Cucurbita pepo subsp. pepo]1.9e-29965.88Show/hide
Query:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKE------EMKVEKICPVCGVFVTATVNAMNAHIDNCLAQ
        FSIREYAL MRG DL+RSWPFSENVK+EVA+ALLPP+ V+KFRWW H+         V EKE       ++++KIC VCGVFV ATVNAMNAHID+CLAQ
Subjt:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKE------EMKVEKICPVCGVFVTATVNAMNAHIDNCLAQ

Query:  TTKEKRRNK---------AKSRTPKKRSIAEIFAVAPPVETMIIVNDCDR-ENVVGKQKIPDKLKATSLARTLVSAMKTIKANNTKN------------K
        TTKE+RRNK         AKSRTPKKRSIAEIFAVAPPV+TMII NDC+  E  +GKQ I DKLKATSLAR+LVSAMKTIKA NT+N            K
Subjt:  TTKEKRRNK---------AKSRTPKKRSIAEIFAVAPPVETMIIVNDCDR-ENVVGKQKIPDKLKATSLARTLVSAMKTIKANNTKN------------K

Query:  YNNNHKNKDFGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKS
             KNK+FGHEQLCKKG+RNHKDVS R CKKPCFKRLSRQK++KLVKKSNVV +QQRP+ P+RSILKHSVK +SET        GSNQ  NNGGQ K 
Subjt:  YNNNHKNKDFGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKS

Query:  DRRVSFSDKDDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWD
         RRVSF DKDDVLGP+T A+SDTFEQ+  +PF+ASEG + SGE++K V SMEVGV DDV VSFS RH+VDSQ                          WD
Subjt:  DRRVSFSDKDDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWD

Query:  NANHSAEKLIPANRVIPQ-ENNLHLFDHVYVDAPQKLPSVDSAIP------QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVP
        N  HS EKLI  NRVIP+ +N+LHLFD VYVDAPQKLP VDSA P      QEERQYGHVRTQC     RAHS YG                 S  S VP
Subjt:  NANHSAEKLIPANRVIPQ-ENNLHLFDHVYVDAPQKLPSVDSAIP------QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVP

Query:  SFSSSENAVGRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFI
        S S SENA GRFLNLA+S  KD RC FPNWEQS VAYKEKG NDGFFCLPLNSKGELIQLNS ++NRF QMNEA+N MACSSRIPVC LVLPR TRDYFI
Subjt:  SFSSSENAVGRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFI

Query:  DNETLLVDTELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSM
        DNE LLVDTEL  NQLTLFPLHSN+QENQN+ LS RF + EPGT          SERGTESGRF HSNLMD PF R RYYGKLQNQN S EI PE+SSS+
Subjt:  DNETLLVDTELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSM

Query:  LSNPARQTMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNLLPNAPQVRYP
         +NPARQTMRLMGKDVAVG +GKE+QEPEVINFWKNSTLI+NCLTNPIQENPMRKRNFLQ+R            ++PAGFH           NAPQVRYP
Subjt:  LSNPARQTMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNLLPNAPQVRYP

Query:  HSRLDKKNSIMNQRSNSVINLNERF-NNIHAFSPLSTEAFNMEPNFQAPFISGPETLSQM-MKIMSSSLGFTVLRGFPHGCYTVTNGKKHRSQI
        H  L++      QR +SVINLNERF NN+H    +ST+AFNM PNFQAPFISGPETLSQM M+++S SLGF VLR F HGCY +TNGK  + Q+
Subjt:  HSRLDKKNSIMNQRSNSVINLNERF-NNIHAFSPLSTEAFNMEPNFQAPFISGPETLSQM-MKIMSSSLGFTVLRGFPHGCYTVTNGKKHRSQI

XP_038888639.1 uncharacterized protein LOC120078436 [Benincasa hispida]2.4e-30269.06Show/hide
Query:  AVFSIREYALKMRGKDLRR-SWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKEEMKVEKICPVCGVFVTATVNAMNAHIDNCLAQ--T
        + FSIREYAL  R  DL R SWPFSE VK+EVAEALLPP+ VKKFRWW  E  I +    + E+  +K++KICPVCGVFV ATVNA+NAHID+CL    T
Subjt:  AVFSIREYALKMRGKDLRR-SWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKEEMKVEKICPVCGVFVTATVNAMNAHIDNCLAQ--T

Query:  TKEKRRN-KAKSRTPKKRSIAEIFAVAPPVETMIIVNDC----DRENVVGKQKI-----PDKLKATSLARTLVSAMKTIKANNTKNKYNNNH--KNKDFG
        +KE R+  KAKSRTPKKRSIA+IFAVAPPV+TMII NDC    + +  VGKQ I      + LK TSLA +LVS +KTI     + + +  H  K KDFG
Subjt:  TKEKRRN-KAKSRTPKKRSIAEIFAVAPPVETMIIVNDC----DRENVVGKQKI-----PDKLKATSLARTLVSAMKTIKANNTKNKYNNNH--KNKDFG

Query:  HEQLCKKGQ-RNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTG-SNQVINNGGQNKSDRRVSFSDK
        H QLC+KG+ RNHKDVS   CKKPCFKRL RQK++KLVKKSNVVAKQQRP+P +RSILKHSVK  SETN SS NL G +NQV NNGG  KSDRRVSF DK
Subjt:  HEQLCKKGQ-RNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTG-SNQVINNGGQNKSDRRVSFSDK

Query:  DDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWDNANHSAEKL
        DDVLG ST   SDTFEQN G+PF+ASE  TNSGE+NKEV  +E  +NDD  V FST+HEVD QH KGKIQLPN H+QVNA+        WDNA HS E L
Subjt:  DDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWDNANHSAEKL

Query:  IPANRVIP-QENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCG-SSFPRAHSFYGKSVDHLINPI-NGVAALSSMASTVPSFSSSENA
        I  N+ IP  +N+L LFDHVYVD  QKL  V SAIP     QEERQYGHVRTQCG +S  +AHS YGKS DHLINP  NGVAAL S+ S VPS S SEN 
Subjt:  IPANRVIP-QENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCG-SSFPRAHSFYGKSVDHLINPI-NGVAALSSMASTVPSFSSSENA

Query:  VGRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVD
        V RFLNLAES  KDT   F N E+S V+YKEKG NDGFFCLPLNSKGELIQLNS +INRFDQMNEASN +ACSSRIPVC LVLPRS RDYFIDNE LLVD
Subjt:  VGRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVD

Query:  TELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPG-TSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQ
        TEL GNQLTLFPLHS++ ENQNR     F ++EPG TSETADIRL+NSERGTESGRFFH NLMD P+NRCRYYGK QNQNVS + YPENSSSM +NP +Q
Subjt:  TELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPG-TSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQ

Query:  TMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKK
        TMRLMGKDVAVGGN +EVQEPEVINFWKNSTLI NCLTNPIQE  MRKRNFLQ+R            ++PAGFHGNQVAQ N   NA QVRYPH  L++K
Subjt:  TMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKK

Query:  NSIMNQRSNSVINLNERF-NNIHAFSPLSTEAFNMEPNFQAPFISGPETL
        +SIM QR +SVINLNE F NNIHAFSP ST+ FNM  NFQ PFISGPETL
Subjt:  NSIMNQRSNSVINLNERF-NNIHAFSPLSTEAFNMEPNFQAPFISGPETL

TrEMBL top hitse value%identityAlignment
A0A0A0KJS6 Uncharacterized protein8.0e-29666.86Show/hide
Query:  AVFSIREYALKMRGKDLRR-SWPFSENVKEEVAEALLPPISVKKFRWWF-------HEMEIQKSNNCVREKEEMKVEKICPVCGVFVTATVNAMNAHIDN
        + FSIREYAL  R   L   SWPFSE VK+EVAE+LLPP+ VKKFRWW         E E ++        E +K++KICPVCGVFV ATV A+NAHID 
Subjt:  AVFSIREYALKMRGKDLRR-SWPFSENVKEEVAEALLPPISVKKFRWWF-------HEMEIQKSNNCVREKEEMKVEKICPVCGVFVTATVNAMNAHIDN

Query:  CLAQTT-KEKRRN---KAKSRTPKKRSIAEIFAVAPPVETMIIVNDC----DRENVVGKQKI--PDKLKATSLARTLVSAMKTIK-------------AN
        CLAQTT KE RR    KAKSRTPKKRSIAEIFAVAPPV+TMI+VNDC    + +  VGKQ I     LK TSLA +LVSA+KTIK             A 
Subjt:  CLAQTT-KEKRRN---KAKSRTPKKRSIAEIFAVAPPVETMIIVNDC----DRENVVGKQKI--PDKLKATSLARTLVSAMKTIK-------------AN

Query:  NTKNKYNNNHKNKDFGHEQLCKKGQ-RNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINN
          K K     KNKDF H +LCKKG  RNHKDVS    ++PCFKRLS+QKK+KL KKS VVAKQQRP+PP+RSILKHSVK +SETN S  NL GSNQ  NN
Subjt:  NTKNKYNNNHKNKDFGHEQLCKKGQ-RNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINN

Query:  GGQNKSDRRVSFSDKDDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMR
        GGQ KSDRRVSF DKDDVLGPSTR ISDTFEQN G+PF+ASE  TNSGE+NKEV SME  +NDDV    STRH+VDSQH+KGKIQLPN H+QVNAQ    
Subjt:  GGQNKSDRRVSFSDKDDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMR

Query:  PHPCWDNANHSAEKLIPANRVIPQE-NNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCG-SSFPRAHSFYGKSVDHLI---NPINGVAA
            W+N  HS EKLI  +R IP + N+LHLFDHVYVDA QKLP   SAIP     QEER YGHVRTQCG +  P+AHS YGKSVDHLI   N  NGVAA
Subjt:  PHPCWDNANHSAEKLIPANRVIPQE-NNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCG-SSFPRAHSFYGKSVDHLI---NPINGVAA

Query:  LSSMASTVPSFSSSENAVGRFLNLAESPAKDT-RCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLV
        L S+ S VPS S +EN V RFLNLAES A+D+ R    N EQ  V YKEKG NDGFFCLPLNS+GELIQLNS + +RFDQMNEA+  +A SSRIPVC  V
Subjt:  LSSMASTVPSFSSSENAVGRFLNLAESPAKDT-RCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLV

Query:  LPRSTRDYFIDNETLLVDTELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSA
        +PRS RDYF+DNE L +DT+L GNQLTLFPLHS+MQENQNR L   F + EPGTSETADIRL+NSERGTE+GRFFH NLMD PFNRCRYY K QNQNVSA
Subjt:  LPRSTRDYFIDNETLLVDTELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSA

Query:  EIYPENSSSMLSNPARQTMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNL
        + YPENSSSM +NP RQTMRLMGKDVAVGGNGK+VQEPEVINFWKNS LI NCLTNPIQE  MRKRNFLQ+R            ++PAGFHGNQVAQ NL
Subjt:  EIYPENSSSMLSNPARQTMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNL

Query:  LPNAPQ-VRYPHSRLDKKNSIMNQRSNSVINLNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPET
        L NAPQ VRYPH   ++K+S++  R  SVINLNERFNNIH+F   ST+  NM  NFQAPF+SG ET
Subjt:  LPNAPQ-VRYPHSRLDKKNSIMNQRSNSVINLNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPET

A0A6J1D325 uncharacterized protein LOC111016842 isoform X20.0e+0073.44Show/hide
Query:  KDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNN---------CVREKEEMKV---EKICPVCGVFVTATVNAMNAHIDNCLAQT-TKEK
        +DL R WPF +NVK+EVAEA+LPPISV KFRWW HE+E  KS+N           +++EE KV   EKICPVCGVFVTATVNAMNAHID+CLAQT T +K
Subjt:  KDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNN---------CVREKEEMKV---EKICPVCGVFVTATVNAMNAHIDNCLAQT-TKEK

Query:  RRN------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKA-NNTKNKYNNN-HKNKDFGHEQLCKKG
        R+N      K KSRTPKKRSIAEIFAVAPPVET++       E+  G  +   +LKATSLARTLV+AMKTIKA  N ++K   +  KNKDFGHE L KKG
Subjt:  RRN------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKA-NNTKNKYNNN-HKNKDFGHEQLCKKG

Query:  QRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDKDDVLGPSTRA
        +RNHKDVSV RCKKPCFKRLSRQKK+KLVKKSNV AKQQRPVP IRSILK SVKVVSET+PS  NL GS QVINNGG+ +SDRRVSF DKDDVLGP TRA
Subjt:  QRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDKDDVLGPSTRA

Query:  ISDTFEQNDGSPFEASEGDTNSGETNKEVDSME-VGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQ-CSMRPHPCWDNANHSAEKLIPANRVIP
         SDTFEQ+ G+PF+ SEG+T SGE+NK V SME VG+NDD+ VSFSTRH VDSQ +KGKIQLPNIHDQVNAQ  SMRPHPCW N  H  E+ I ANRV+P
Subjt:  ISDTFEQNDGSPFEASEGDTNSGETNKEVDSME-VGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQ-CSMRPHPCWDNANHSAEKLIPANRVIP

Query:  QENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAVGRFLNLAESP
         E+N HLFDHVY+DAPQ+ P V SAIP     Q+ERQYG VRTQ GS+FP AH+F GKSVDHL+NPINGVA L SM STVP+F+ +EN VGR  NLAES 
Subjt:  QENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAVGRFLNLAESP

Query:  AKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDTELAGNQLTLF
        AKD R  FPN EQ  VAYKEKG NDGFFCLPLNSKGELIQLNS ++NR+DQMNEA NNMACSSRIPVCGLV PRSTRDYFIDNE +L+DTEL  NQLTLF
Subjt:  AKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDTELAGNQLTLF

Query:  PLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTMRLMGKDVAVG
        PLHS MQEN+N+ LS RF + EPGTS   DIRLLNSERGT+SG   HSNLMD PFNRCRYYGKL NQNVS EIYPENSS+M +NPARQTMRLMGKDVAVG
Subjt:  PLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTMRLMGKDVAVG

Query:  GNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERV----------FYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNSIMNQRSNSVIN
        GNGKEVQEPE INFWKNS+LIENCLTN IQENPMRKRNFLQ+RV          FYPAGFH  QVAQ NLLPNAPQVRYPH RL++KN +M QRS+SVIN
Subjt:  GNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERV----------FYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNSIMNQRSNSVIN

Query:  LNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPETL
        LNERF+NI+AF P STEAFNM PNFQAPFISGP TL
Subjt:  LNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPETL

A0A6J1D428 uncharacterized protein LOC111016842 isoform X10.0e+0073.7Show/hide
Query:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNN---------CVREKEEMKV---EKICPVCGVFVTATVNAMNAHI
        FSIREYAL MRG+DL R WPF +NVK+EVAEA+LPPISV KFRWW HE+E  KS+N           +++EE KV   EKICPVCGVFVTATVNAMNAHI
Subjt:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNN---------CVREKEEMKV---EKICPVCGVFVTATVNAMNAHI

Query:  DNCLAQT-TKEKRRN------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKA-NNTKNKYNNN-HKN
        D+CLAQT T +KR+N      K KSRTPKKRSIAEIFAVAPPVET++       E+  G  +   +LKATSLARTLV+AMKTIKA  N ++K   +  KN
Subjt:  DNCLAQT-TKEKRRN------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKA-NNTKNKYNNN-HKN

Query:  KDFGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFS
        KDFGHE L KKG+RNHKDVSV RCKKPCFKRLSRQKK+KLVKKSNV AKQQRPVP IRSILK SVKVVSET+PS  NL GS QVINNGG+ +SDRRVSF 
Subjt:  KDFGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFS

Query:  DKDDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSME-VGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQ-CSMRPHPCWDNANHS
        DKDDVLGP TRA SDTFEQ+ G+PF+ SEG+T SGE+NK V SME VG+NDD+ VSFSTRH VDSQ +KGKIQLPNIHDQVNAQ  SMRPHPCW N  H 
Subjt:  DKDDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSME-VGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQ-CSMRPHPCWDNANHS

Query:  AEKLIPANRVIPQENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSEN
         E+ I ANRV+P E+N HLFDHVY+DAPQ+ P V SAIP     Q+ERQYG VRTQ GS+FP AH+F GKSVDHL+NPINGVA L SM STVP+F+ +EN
Subjt:  AEKLIPANRVIPQENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSEN

Query:  AVGRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLV
         VGR  NLAES AKD R  FPN EQ  VAYKEKG NDGFFCLPLNSKGELIQLNS ++NR+DQMNEA NNMACSSRIPVCGLV PRSTRDYFIDNE +L+
Subjt:  AVGRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLV

Query:  DTELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQ
        DTEL  NQLTLFPLHS MQEN+N+ LS RF + EPGTS   DIRLLNSERGT+SG   HSNLMD PFNRCRYYGKL NQNVS EIYPENSS+M +NPARQ
Subjt:  DTELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQ

Query:  TMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERV----------FYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKN
        TMRLMGKDVAVGGNGKEVQEPE INFWKNS+LIENCLTN IQENPMRKRNFLQ+RV          FYPAGFH  QVAQ NLLPNAPQVRYPH RL++KN
Subjt:  TMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERV----------FYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKN

Query:  SIMNQRSNSVINLNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPETL
         +M QRS+SVINLNERF+NI+AF P STEAFNM PNFQAPFISGP TL
Subjt:  SIMNQRSNSVINLNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPETL

A0A6J1HZM3 uncharacterized protein LOC111468375 isoform X37.7e-29965.99Show/hide
Query:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKE-----EMKVEKICPVCGVFVTATVNAMNAHIDNCLAQT
        FSIREYAL MRG DL+RSWPFSENVK+EVA+ALLPP+ V+KFRWW H+         V EKE      ++++KICPVCGVFV ATVNAMNAHI +CLAQT
Subjt:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKE-----EMKVEKICPVCGVFVTATVNAMNAHIDNCLAQT

Query:  TKEKRRNK----AKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKANNTKN----------KYNNNHKNKD
        TKE+RRNK    AKSRTPKKRSIAEIFAVAPPV+TMII NDC+ E  +GKQ I DKLKATSLAR+LVSAMKTIKA NT+N          +     KNK+
Subjt:  TKEKRRNK----AKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKANNTKN----------KYNNNHKNKD

Query:  FGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDK
        FGHEQLCK G+RNHKDVS R CKKPCFKRLSRQK++KLVKKSNVV +QQRP+ P+RSILKHSVK +SET        GSNQ  NNGGQ K  +RVSF DK
Subjt:  FGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDK

Query:  DDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWDNANHSAEKL
        DDVLGP+T A+SDTFEQ+  +PF+ASEG + SGE++K V SMEVGV DDV VS S RH+VDSQ                          WDNA HS EKL
Subjt:  DDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWDNANHSAEKL

Query:  IPANRVIP-QENNLHLFDHVYVDAPQKLPSVDSAIP------QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAV
        I  NRVIP  +N+LHLFDHVYVDAPQKLP VDSA P      QEERQYGHVRTQC     RAHS YG                 S  S VPS S SENA 
Subjt:  IPANRVIP-QENNLHLFDHVYVDAPQKLPSVDSAIP------QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAV

Query:  GRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDT
        GRFLNLA+S  KD RC FPN EQS VAYKEKG NDGFFCLPLNSKGELIQLNS ++NRF QMNEA+N MACSSRIPVC  VLPR TRDYFIDNE LLVDT
Subjt:  GRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDT

Query:  ELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTM
        EL  NQLTLFPLHSN+QENQN+ LS RF + EPGT          SERGTESG F HSNLMD PF R RYYGKLQNQN S EI PE+SSS+ +NPARQTM
Subjt:  ELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTM

Query:  RLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNS
        RLMGKDVAVG +GKE+QEPEVINFWKNSTLI+NCLTNPIQENP RKRNFLQ+R            ++PAGFH           NAPQVRYPH  L++   
Subjt:  RLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQER-----------VFYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNS

Query:  IMNQRSNSVINLNERF-NNIHAFSPLSTEAFNMEPNFQAPFISGPETLSQM-MKIMSSSLGFTVLRGFPHGCYTVTNGKKHRSQI
         M QR  SVINLNERF NN+H    +ST+AFNM PNFQAPFISGPETLSQM M+++S SLGF VLR F HGCY +TNGK+ + Q+
Subjt:  IMNQRSNSVINLNERF-NNIHAFSPLSTEAFNMEPNFQAPFISGPETLSQM-MKIMSSSLGFTVLRGFPHGCYTVTNGKKHRSQI

A0A6J1JPI0 uncharacterized protein LOC111486332 isoform X11.7e-29067.93Show/hide
Query:  FSIREYALKMRGKDL-RRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKEEMKVEKICPVCGVFVTATVNAMNAHIDNCLAQTTKEK
        FSIREYALKMRGKDL RRSWPFSE VKEEVAEALLPPISV KFRWW  E++I KSN  V   E+ KV+KICPVCGVFVTATVNAM+AHID CLA TTKEK
Subjt:  FSIREYALKMRGKDL-RRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKEEMKVEKICPVCGVFVTATVNAMNAHIDNCLAQTTKEK

Query:  RRN-----------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKANNTKNKYNNNHKNKDFGHEQLC
        R+N           KAKSR PKKRSIAEIFAVAPPVETM +++DC+ E V GKQ+  DK+KATSLA TLVSAMKT+KANN     NNN+KNK+FGHEQLC
Subjt:  RRN-----------KAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKANNTKNKYNNNHKNKDFGHEQLC

Query:  KKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDKDDVLGPS
        KKG RNHK V V  CKKPCFKRLSRQK +K VKKSNVVAKQQR VPPIRSILKHSV     TN SSTN   S+QVINNG + KSDRRVSFSDK DVLGPS
Subjt:  KKGQRNHKDVSVRRCKKPCFKRLSRQKKRKLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDKDDVLGPS

Query:  TRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWDNANHSAEKLIPANRVI
        T  +     Q  GSPF+ SEG+TNSGE+N  VDSMEVG+N+D                                   R HPCWD  NHSAEK I  NRVI
Subjt:  TRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGVNDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWDNANHSAEKLIPANRVI

Query:  PQENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAVGRFLNLAES
        P EN+LHLFDH     PQKLPSV SAIP     QEERQYGH           AHSF GKSVD+LI P+NGVAAL            SENA GRFLNLAES
Subjt:  PQENNLHLFDHVYVDAPQKLPSVDSAIP-----QEERQYGHVRTQCGSSFPRAHSFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAVGRFLNLAES

Query:  PAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDTELAGNQLTL
         AKDTR   PNWEQS VAYKEKG NDGFFCLPLNSKGELIQLNS +IN FDQMN+ SN M CSSRIP CGLVLPRS RD FIDN+ LLVDTEL GNQL+L
Subjt:  PAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSSRIPVCGLVLPRSTRDYFIDNETLLVDTELAGNQLTL

Query:  FPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTMRLMGKDVAV
        FPLHSNMQENQ R LS  F + E G S TADIRL NSERGTE GRFFHSNLMD PFN                  PENSSS+L NPARQTMRLMGKDVAV
Subjt:  FPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEIYPENSSSMLSNPARQTMRLMGKDVAV

Query:  GGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERVFYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNSIMNQRSNSVINLNERFNNIH
        GGNGK+V EPEVINFWKN++L ENCLTN IQENPMRKRN+L++ +FYPAGFH NQVAQR+LLPNAPQ RYPH R+D+KNSIM  RS+SVINLNERFNNIH
Subjt:  GGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERVFYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNSIMNQRSNSVINLNERFNNIH

Query:  AFSPLST-EAFNMEPNFQAPFISGPET--LSQMMKIMSSS--LGF
        +FSPL T +AFNM  NF+APF SG +   LS      S+S  LGF
Subjt:  AFSPLST-EAFNMEPNFQAPFISGPET--LSQMMKIMSSS--LGF

SwissProt top hitse value%identityAlignment
Q9LYD9 Protein EMBRYONIC FLOWER 17.4e-0446.81Show/hide
Query:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEA--LLPPISVKKFRWW
        FS+R +  + R +DLR+ WPFSE     V +    LP +SV KFRWW
Subjt:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEA--LLPPISVKKFRWW

Arabidopsis top hitse value%identityAlignment
AT3G58770.1 unknown protein1.6e-0932.64Show/hide
Query:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKEEMKVEKICPVCGVFVTATVNAMNAHIDNCLAQTTKEKR
        FSIREY  K+R  + R+ WPF+     ++ ++ LPPI+V KFRWW HE+        +  K  + V+   P                           +R
Subjt:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKEEMKVEKICPVCGVFVTATVNAMNAHIDNCLAQTTKEKR

Query:  RNKAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKI
        + KAK+R  KKRSI EI A AP ++          + VV K+KI
Subjt:  RNKAKSRTPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKI

AT5G11530.1 embryonic flower 1 (EMF1)5.2e-0546.81Show/hide
Query:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEA--LLPPISVKKFRWW
        FS+R +  + R +DLR+ WPFSE     V +    LP +SV KFRWW
Subjt:  FSIREYALKMRGKDLRRSWPFSENVKEEVAEA--LLPPISVKKFRWW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTTTTCTCTATTCGAGAGTATGCTTTGAAAATGAGGGGGAAGGATTTGAGGAGAAGTTGGCCGTTTAGTGAGAACGTGAAGGAAGAAGTGGCAGAAGCTTTGCT
GCCACCAATTTCTGTAAAGAAATTCCGATGGTGGTTTCACGAGATGGAGATTCAGAAATCGAATAATTGCGTAAGGGAAAAAGAAGAAATGAAAGTGGAGAAAATTTGTC
CGGTTTGTGGAGTTTTTGTTACGGCTACGGTGAACGCCATGAATGCTCATATTGATAATTGTTTGGCTCAAACAACAAAGGAAAAGAGAAGAAACAAAGCGAAATCAAGA
ACCCCAAAAAAGAGATCAATTGCAGAAATCTTCGCAGTCGCTCCGCCAGTAGAAACAATGATTATTGTTAATGATTGTGACCGAGAAAATGTCGTTGGGAAACAAAAAAT
TCCAGACAAGCTCAAAGCGACGTCGTTGGCTAGGACTCTTGTCTCCGCTATGAAGACAATCAAAGCCAACAACACCAAAAACAAATACAACAACAACCACAAAAATAAGG
ATTTTGGACATGAGCAACTTTGCAAGAAGGGCCAGAGGAATCACAAGGATGTTTCGGTCCGACGTTGCAAGAAACCGTGTTTTAAACGCTTGTCGAGACAAAAAAAGCGA
AAACTAGTTAAAAAATCCAATGTAGTTGCCAAGCAACAGAGGCCAGTGCCTCCAATTAGGAGCATTTTGAAGCATAGTGTAAAAGTAGTTTCTGAGACAAATCCTTCATC
CACCAACTTAACAGGCAGTAATCAAGTGATTAACAATGGCGGTCAGAATAAGTCGGATCGGCGTGTTAGCTTCTCGGATAAGGATGATGTTCTTGGTCCAAGCACTAGAG
CCATTTCAGATACTTTTGAACAAAATGATGGCAGTCCATTTGAAGCCTCAGAAGGAGACACTAATTCCGGTGAAACTAATAAAGAAGTTGATTCAATGGAGGTTGGTGTA
AATGATGATGTTTTCGTTAGCTTTAGCACTCGACATGAAGTTGATAGTCAACACATGAAAGGAAAGATTCAGTTGCCTAATATCCATGATCAAGTCAATGCTCAATGTTC
AATGAGGCCTCATCCTTGTTGGGACAATGCGAATCATTCGGCCGAGAAGTTGATACCAGCAAATCGGGTTATTCCACAGGAAAATAATTTGCACTTGTTTGATCATGTCT
ATGTAGATGCACCTCAGAAGCTGCCATCAGTAGATTCTGCCATTCCTCAAGAAGAAAGGCAATATGGCCATGTAAGAACTCAATGTGGTTCAAGTTTTCCTCGAGCGCAT
TCTTTCTATGGAAAATCAGTTGACCATTTGATAAATCCTATCAATGGAGTAGCTGCCTTAAGCTCAATGGCAAGCACAGTGCCTTCTTTTTCTTCAAGTGAAAATGCAGT
TGGCAGATTTCTTAATTTGGCTGAATCTCCTGCTAAGGACACTCGATGCCACTTTCCGAATTGGGAGCAAAGTCCGGTCGCCTACAAAGAGAAGGGCACGAATGATGGAT
TTTTCTGCCTGCCATTGAACTCAAAGGGTGAACTGATACAGCTAAATTCGAGTATGATTAATAGGTTTGATCAAATGAATGAAGCCAGTAACAATATGGCATGTTCTAGC
AGGATACCGGTATGCGGTCTCGTCCTGCCAAGAAGCACCCGGGATTATTTCATAGATAATGAGACGCTCCTTGTTGATACAGAACTTGCAGGAAACCAGTTGACTTTATT
TCCATTACATAGTAATATGCAAGAAAATCAAAATCGACTTTTGTCCGGTAGATTCCATCTAGCTGAGCCTGGAACTTCAGAAACAGCTGATATTAGACTGCTAAATTCAG
AAAGGGGAACTGAATCTGGTAGGTTTTTTCACTCGAACTTGATGGATCGTCCATTTAACAGATGCAGGTATTATGGAAAGTTGCAGAACCAAAATGTAAGTGCAGAGATT
TATCCTGAAAATTCGAGTAGCATGTTGTCGAATCCTGCCCGACAAACGATGCGGTTGATGGGCAAGGATGTAGCTGTTGGTGGAAATGGGAAAGAAGTTCAAGAACCTGA
AGTTATAAACTTTTGGAAGAACTCAACCTTAATTGAGAACTGCCTAACCAATCCTATCCAAGAGAATCCCATGAGAAAAAGAAACTTTCTGCAAGAGAGGGTGTTTTATC
CTGCAGGCTTTCATGGAAATCAAGTGGCACAAAGAAATTTATTGCCAAATGCTCCACAAGTTAGGTACCCCCATTCGCGCCTCGATAAAAAAAACAGTATAATGAATCAA
AGATCCAACTCTGTCATCAACTTAAACGAAAGGTTCAACAACATCCATGCCTTTTCGCCTTTGTCGACCGAAGCGTTTAATATGGAACCAAACTTTCAAGCACCCTTTAT
TTCTGGTCCTGAAACACTAAGCCAAATGATGAAAATCATGTCCAGCTCCCTTGGTTTCACAGTTCTAAGAGGCTTCCCCCATGGATGTTACACGGTCACCAACGGGAAGA
AGCACCGATCGCAAATTCTAAACTCGCTGACATAA
mRNA sequenceShow/hide mRNA sequence
ATAAAAACATCTCTTTTTTTTTTCTCTCTCTCTGTCACCGCTAGAGAGAAAAGCTCTTCATTCATACCATACTTCATCATCTTCTTCCACTGTCTCCACTGCTTGTCTGT
CTCTCTGTCCCATTCTCCTCTCAATTTCACACCATTTTTTCTCATCCTTTTCCAAATTAAAACCCAACCCATTTCCCAATTTTCAATCTTCTCACTTGGGTCTCTCTCCG
TTTCTCTTTTGGATGTGAAAATCATGGCCGTTTTCTCTATTCGAGAGTATGCTTTGAAAATGAGGGGGAAGGATTTGAGGAGAAGTTGGCCGTTTAGTGAGAACGTGAAG
GAAGAAGTGGCAGAAGCTTTGCTGCCACCAATTTCTGTAAAGAAATTCCGATGGTGGTTTCACGAGATGGAGATTCAGAAATCGAATAATTGCGTAAGGGAAAAAGAAGA
AATGAAAGTGGAGAAAATTTGTCCGGTTTGTGGAGTTTTTGTTACGGCTACGGTGAACGCCATGAATGCTCATATTGATAATTGTTTGGCTCAAACAACAAAGGAAAAGA
GAAGAAACAAAGCGAAATCAAGAACCCCAAAAAAGAGATCAATTGCAGAAATCTTCGCAGTCGCTCCGCCAGTAGAAACAATGATTATTGTTAATGATTGTGACCGAGAA
AATGTCGTTGGGAAACAAAAAATTCCAGACAAGCTCAAAGCGACGTCGTTGGCTAGGACTCTTGTCTCCGCTATGAAGACAATCAAAGCCAACAACACCAAAAACAAATA
CAACAACAACCACAAAAATAAGGATTTTGGACATGAGCAACTTTGCAAGAAGGGCCAGAGGAATCACAAGGATGTTTCGGTCCGACGTTGCAAGAAACCGTGTTTTAAAC
GCTTGTCGAGACAAAAAAAGCGAAAACTAGTTAAAAAATCCAATGTAGTTGCCAAGCAACAGAGGCCAGTGCCTCCAATTAGGAGCATTTTGAAGCATAGTGTAAAAGTA
GTTTCTGAGACAAATCCTTCATCCACCAACTTAACAGGCAGTAATCAAGTGATTAACAATGGCGGTCAGAATAAGTCGGATCGGCGTGTTAGCTTCTCGGATAAGGATGA
TGTTCTTGGTCCAAGCACTAGAGCCATTTCAGATACTTTTGAACAAAATGATGGCAGTCCATTTGAAGCCTCAGAAGGAGACACTAATTCCGGTGAAACTAATAAAGAAG
TTGATTCAATGGAGGTTGGTGTAAATGATGATGTTTTCGTTAGCTTTAGCACTCGACATGAAGTTGATAGTCAACACATGAAAGGAAAGATTCAGTTGCCTAATATCCAT
GATCAAGTCAATGCTCAATGTTCAATGAGGCCTCATCCTTGTTGGGACAATGCGAATCATTCGGCCGAGAAGTTGATACCAGCAAATCGGGTTATTCCACAGGAAAATAA
TTTGCACTTGTTTGATCATGTCTATGTAGATGCACCTCAGAAGCTGCCATCAGTAGATTCTGCCATTCCTCAAGAAGAAAGGCAATATGGCCATGTAAGAACTCAATGTG
GTTCAAGTTTTCCTCGAGCGCATTCTTTCTATGGAAAATCAGTTGACCATTTGATAAATCCTATCAATGGAGTAGCTGCCTTAAGCTCAATGGCAAGCACAGTGCCTTCT
TTTTCTTCAAGTGAAAATGCAGTTGGCAGATTTCTTAATTTGGCTGAATCTCCTGCTAAGGACACTCGATGCCACTTTCCGAATTGGGAGCAAAGTCCGGTCGCCTACAA
AGAGAAGGGCACGAATGATGGATTTTTCTGCCTGCCATTGAACTCAAAGGGTGAACTGATACAGCTAAATTCGAGTATGATTAATAGGTTTGATCAAATGAATGAAGCCA
GTAACAATATGGCATGTTCTAGCAGGATACCGGTATGCGGTCTCGTCCTGCCAAGAAGCACCCGGGATTATTTCATAGATAATGAGACGCTCCTTGTTGATACAGAACTT
GCAGGAAACCAGTTGACTTTATTTCCATTACATAGTAATATGCAAGAAAATCAAAATCGACTTTTGTCCGGTAGATTCCATCTAGCTGAGCCTGGAACTTCAGAAACAGC
TGATATTAGACTGCTAAATTCAGAAAGGGGAACTGAATCTGGTAGGTTTTTTCACTCGAACTTGATGGATCGTCCATTTAACAGATGCAGGTATTATGGAAAGTTGCAGA
ACCAAAATGTAAGTGCAGAGATTTATCCTGAAAATTCGAGTAGCATGTTGTCGAATCCTGCCCGACAAACGATGCGGTTGATGGGCAAGGATGTAGCTGTTGGTGGAAAT
GGGAAAGAAGTTCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCAACCTTAATTGAGAACTGCCTAACCAATCCTATCCAAGAGAATCCCATGAGAAAAAGAAACTT
TCTGCAAGAGAGGGTGTTTTATCCTGCAGGCTTTCATGGAAATCAAGTGGCACAAAGAAATTTATTGCCAAATGCTCCACAAGTTAGGTACCCCCATTCGCGCCTCGATA
AAAAAAACAGTATAATGAATCAAAGATCCAACTCTGTCATCAACTTAAACGAAAGGTTCAACAACATCCATGCCTTTTCGCCTTTGTCGACCGAAGCGTTTAATATGGAA
CCAAACTTTCAAGCACCCTTTATTTCTGGTCCTGAAACACTAAGCCAAATGATGAAAATCATGTCCAGCTCCCTTGGTTTCACAGTTCTAAGAGGCTTCCCCCATGGATG
TTACACGGTCACCAACGGGAAGAAGCACCGATCGCAAATTCTAAACTCGCTGACATAAATGGATACTATTATCCATTCATTTCTTCTGGTACAGATGTTCTCATCAGTCC
TCCTACGCATCACCGGCACGAGGCTGTGTATCCTTGCAGTACAATGCCATCTAACTTACAAATGAAGCATAATATACCTGGCTCAACATCTTTTTTTCAACCAATTCCTG
TTGCTCCTCGAGTTCAAATGCCATCTATTAGAATGAAAACTTTGAGTGTCAAGGACTCTGATCTTTCAAGTAAAAAGCGACCTGCTGGAGAGTTCGTCGATTCGAGGAAG
CGTCAAAAGATATCGAGTTTAGAAATGAACAATAATGCTGGTGTTGTACCAGGGTGGACAAGAGGAGAATTCATTGATGACGTGCAATCTAACCTGGGGACGGCGGCGAA
AATCCATGCTAACTGTAACTGGGACAAAGCTGTTAATTCAGCTGGAAATATCACAAATGTGACTCAAACTGATGGAGTAGTGATTTCTACCACCAATGAACCTCCTAAAG
TTGAATGTATGGCAAGATCAGGCCCCATTAAGTTGACAGCAGGAGCAAAACACATACTGAAACCAAGTCAGAGCATGGACCTAGACAATACCAAGCCTACTTATTCATCA
ATTCCTTCTGCTGGATTAGCTCATAGTGTTAGCTTAGCAGAATCTCAGAAGAAGTCAACTAAAGTATACAGTTTTTGAAGTAAGTATTGTAGTTATCTTGTAATTATTTG
CTAAATATAATCCTAATCTACTTGTGGTAGCTGATATGAGCAAATGAACTTATCTGCATGACAGGAAGGAATCTCCTCTCATCTTTGTAACCACTGACATGAGAGTTATT
GTACTTTCAAGACGACTCGTCGTTGTTCGCAGTTTTTTGGTATGCGGAAGCATGTTATCATGAACGGAAACTAAA
Protein sequenceShow/hide protein sequence
MAVFSIREYALKMRGKDLRRSWPFSENVKEEVAEALLPPISVKKFRWWFHEMEIQKSNNCVREKEEMKVEKICPVCGVFVTATVNAMNAHIDNCLAQTTKEKRRNKAKSR
TPKKRSIAEIFAVAPPVETMIIVNDCDRENVVGKQKIPDKLKATSLARTLVSAMKTIKANNTKNKYNNNHKNKDFGHEQLCKKGQRNHKDVSVRRCKKPCFKRLSRQKKR
KLVKKSNVVAKQQRPVPPIRSILKHSVKVVSETNPSSTNLTGSNQVINNGGQNKSDRRVSFSDKDDVLGPSTRAISDTFEQNDGSPFEASEGDTNSGETNKEVDSMEVGV
NDDVFVSFSTRHEVDSQHMKGKIQLPNIHDQVNAQCSMRPHPCWDNANHSAEKLIPANRVIPQENNLHLFDHVYVDAPQKLPSVDSAIPQEERQYGHVRTQCGSSFPRAH
SFYGKSVDHLINPINGVAALSSMASTVPSFSSSENAVGRFLNLAESPAKDTRCHFPNWEQSPVAYKEKGTNDGFFCLPLNSKGELIQLNSSMINRFDQMNEASNNMACSS
RIPVCGLVLPRSTRDYFIDNETLLVDTELAGNQLTLFPLHSNMQENQNRLLSGRFHLAEPGTSETADIRLLNSERGTESGRFFHSNLMDRPFNRCRYYGKLQNQNVSAEI
YPENSSSMLSNPARQTMRLMGKDVAVGGNGKEVQEPEVINFWKNSTLIENCLTNPIQENPMRKRNFLQERVFYPAGFHGNQVAQRNLLPNAPQVRYPHSRLDKKNSIMNQ
RSNSVINLNERFNNIHAFSPLSTEAFNMEPNFQAPFISGPETLSQMMKIMSSSLGFTVLRGFPHGCYTVTNGKKHRSQILNSLT