; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr006460 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr006460
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLate embryogenesis abundant protein-related / LEA protein-related protein
Genome locationtig00004627:57822..66454
RNA-Seq ExpressionSgr006460
SyntenySgr006460
Gene Ontology termsGO:0019441 - tryptophan catabolic process to kynurenine (biological process)
GO:0004061 - arylformamidase activity (molecular function)
InterPro domainsIPR007325 - Kynurenine formamidase/cyclase-like
IPR009646 - Root cap
IPR037175 - Kynurenine formamidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3435454.1 hypothetical protein FNV43_RR22543 [Rhamnella rubrinervis]1.9e-23855.06Show/hide
Query:  ASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDH
        ++N+AYPS+   G+ DCSL D    G+ PVR+EVY NGRI DI+HR+T DMP W SD GLG FLWL  SMKNGSL N SEMK   H+GTHVDAPGH FDH
Subjt:  ASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDH

Query:  YFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFD
        YFDAGFDVDTLDLEVLNG  LL+DVPRD NITAEVMK+LNIPKGVRRVLFRTLNTDRRLMWK++FDTS+VGFM+DGAKWLVENTDIKLVG DYLSVAA+ 
Subjt:  YFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFD

Query:  DLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIKYFLLWPPQTREEVEVPVFLGSSSSSSSSSSSLCLEMAYSSGFSLLA
        +  P+H VFL+ REII+VEGLKLDD+Q G YS+HCLPL                    W      E      L    SS     S+ L     S   LL 
Subjt:  DLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIKYFLLWPPQTREEVEVPVFLGSSSSSSSSSSSLCLEMAYSSGFSLLA

Query:  PLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETYSPPTPSPPPST
         L++ ++   +   TPPGIAKNPSHA C   KYK+C NL HVCPKFCP++CTV C SCKP C G ++ PP ED  P  Q+P             +PPPS 
Subjt:  PLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETYSPPTPSPPPST

Query:  PTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFPSTPPANPNPPPTPSTPNSPSPSLSSPPSP
        PT P        P TP+ PS P SP   T+PSP    +PP  T  PP  P  TP  P+PP+T+       PS PP  P+P   P TP+ PSP  ++PPSP
Subjt:  PTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFPSTPPANPNPPPTPSTPNSPSPSLSSPPSP

Query:  PKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSPPPTPSETPSSPTTNTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAG
                        T SPP+ +P                         PSSPTT+T              P+SPP               S QSS AG
Subjt:  PKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSPPPTPSETPSSPTTNTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAG

Query:  AAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFT
          K+ RCKN  Y +CY+M H CPS+CPGGCEVDCVTCKPVC CD+PGAVCQDPRFIGGDGITFYFHGKKD +FCL+S+ N+HINAH IG+RNQN+KRDFT
Subjt:  AAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFT

Query:  WVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGIT
        WVQS+G+LFG +QL+IGA + A WDDSVD LA+  +G  ++ P+++G+  Q     PS+ + R+A TN V+V+ +G  R+T KVVPIT+EDSRIHNYGIT
Subjt:  WVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGIT

Query:  KENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSSLFAADCAVARF---GGGGSEEEA
         E+SFAHLD+ FKFFSLS+ VSGVLGQTY P+Y SR+N+ A M VMG + EF+TS+L   DCAVARF    G  S EEA
Subjt:  KENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSSLFAADCAVARF---GGGGSEEEA

KAG6571952.1 hypothetical protein SDJN03_28680, partial [Cucurbita argyrosperma subsp. sororia]1.0e-17863.67Show/hide
Query:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPS------
        MA  S F +L PLVVAV++  MA+ATPPGIAKNPSHA+CKIKKYKHCYNL HVCPKFCPDQCTVECASCKPICGGDA+PPPE+DPTPAT SPPS      
Subjt:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPS------

Query:  ----------PPSE---TYSPPTPSPPPSTPTNPNSPSNSYSPPTPATPSSPPS-----PSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPP
                  PPSE   +YSPP PSP P TP +P+ P+N   P TP T S PP       S  T+P+PPT S+P    STPP SP   PP   P  + PP
Subjt:  ----------PPSE---TYSPPTPSPPPSTPTNPNSPSNSYSPPTPATPSSPPS-----PSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPP

Query:  PYSHFPSTPPANPNP--PPTPSTPNSPS--PSLSSPP---SPPKYPPTAPSPPAPSA-GTPSPPTVSP--STPPA--TTPSTPNSPSLSPPPTPSETPSS
           + PS+PP +PNP  P TPS PN PS  P+ S PP   +PP  PPT+P+PP PS   +P+PP+  P  S PP     PS PN P  S PP     PS+
Subjt:  PYSHFPSTPPANPNP--PPTPSTPNSPS--PSLSSPP---SPPKYPPTAPSPPAPSA-GTPSPPTVSP--STPPA--TTPSTPNSPSLSPPPTPSETPSS

Query:  PTTNTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDP
        P   + PP SP P  P +P   P    P G    P  P  P SSS GAAKRVRCKNANYPQCYNMIHTCPSACP GC+VDCVTCKPVCHCDRPGAVCQDP
Subjt:  PTTNTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDP

Query:  RFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYP
        RFIGGDGITFYFHG+KD++FCLVS+PNLHINAH IGKRN +L RDFTWVQSLGILF  ++L I AQ+   WDDS+D L +AL+   VALPE+EGSQWQ+P
Subjt:  RFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYP

Query:  IENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFE
         ENP++ IVRL   NHV+V+AKGLFRIT KVVPITEEDSR+H+YGI + +SFAHLDVGFKFF LS GV+GVLGQTY   Y S +N+KA MPVMGRE EFE
Subjt:  IENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFE

Query:  TSSLFAADCAVARFGGGGSEEE
        TSSLFAADCAVARFG  G +++
Subjt:  TSSLFAADCAVARFGGGGSEEE

XP_022147747.1 formin-like protein 20 [Momordica charantia]4.9e-19762Show/hide
Query:  LCLEMAYS------SGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQ
        LCLE   S      S F LL PL V V MA M EATPPGIA NPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDAN PPE+DPTPAT 
Subjt:  LCLEMAYS------SGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQ

Query:  SPPSPPSET-YSPPTPSPPPSTPTNPNSPSNSYSPPT------------------PATPSSP------PSPSAGTTPSPPTISSPPPVT--------STP
        SPPSPPSET YSPP PS PP  P  P +P  + +PP+                  PA P+SP      P+P    TPS P  S  PP T        STP
Subjt:  SPPSPPSET-YSPPTPSPPPSTPTNPNSPSNSYSPPT------------------PATPSSP------PSPSAGTTPSPPTISSPPPVT--------STP

Query:  P--PSPVQTPP-----------KPTPPVTSPPPYSHFPSTPPANPNPPPTPST---------------------PNSPSPSLSSP---------------
        P  P+P  TPP           KP PP + PP   + PSTPPA+PNPP TP T                     PN PS   +SP               
Subjt:  P--PSPVQTPP-----------KPTPPVTSPPPYSHFPSTPPANPNPPPTPST---------------------PNSPSPSLSSP---------------

Query:  ----------PSPPKYPPTAPSPPAPSAGTPSPPTV-----------------SPSTPPAT-------------TPSTPNSPSLSPPP-TPSETPSSPTT
                  P+PP  PPT+P+PP+     P+PP+                   PSTPPA+             +PSTPNSPSLSPPP TPSETPSSP+T
Subjt:  ----------PSPPKYPPTAPSPPAPSAGTPSPPTV-----------------SPSTPPAT-------------TPSTPNSPSLSPPP-TPSETPSSPTT

Query:  NTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFI
        NT  PP P PSPP S PASPPR  P G AG+P A   P SS+  A K+VRCKN NYPQCYNM+HTCPSACP GCEVDCVTCKPVCHCDRPGAVCQDPRFI
Subjt:  NTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFI

Query:  GGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIEN
        GGDGITFYFHGKKDR+FCLVS+ NLHINAHLIGKRN NLKRDFTWVQSLGIL   +Q+FIGAQ+ AAWDDSVD LAVA+NGQ VALPE+ GSQWQYP EN
Subjt:  GGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIEN

Query:  PSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSS
        P+I +VRLAP N V+V+AKG+FRIT KVVPITE+DSRIHNYGITKE+SFAHLD+GFKFFSLS  VSGVLGQTY PEY SR+N+KA MPVMGRE EFETSS
Subjt:  PSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSS

Query:  LFAADCAVARFG-GGGSEEEA
        LFAADCAVARFG  GGS  EA
Subjt:  LFAADCAVARFG-GGGSEEEA

XP_022952949.1 basic proline-rich protein [Cucurbita moschata]2.2e-18164.3Show/hide
Query:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETY
        MA  S F +L PLVVAV++ AMA+ATPPGIAKNPSHA+CKIKKYKHCYNL HVCPKFCPDQCTVECASCKPICGGDA+PPPE+DPTPAT SPPS     Y
Subjt:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETY

Query:  SPP-----TPSPPPSTPTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPT---ISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFP-------STP
        SPP     TPSPPPS PT       SYSPP P+     PSPS  T P+PP+     S PP    PP SP  +P  PTP   S PP S  P       S P
Subjt:  SPP-----TPSPPPSTPTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPT---ISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFP-------STP

Query:  PANPNPPPTPST-PNSPSPSLSSPPSPPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPS-LSPPPTPSETPSS---PTTNTPPP---PSPTP
        P N NPP +P T PN P+PS  S P+PP  PPT   PP    G  +PP+  P++P   TPSTP+SP+  S PPT S  P +   P+   PPP   P   P
Subjt:  PANPNPPPTPST-PNSPSPSLSSPPSPPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPS-LSPPPTPSETPSS---PTTNTPPP---PSPTP

Query:  SPPRSPPASPPRPPPTGVAGQP----------PAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFI
        +PP +PP SP  P P+  +  P          P    P SSSAGAAKRVRCKNANYPQCYNMIHTCPSACP GC+VDCVTCKPVCHCDRPGAVCQDPRFI
Subjt:  SPPRSPPASPPRPPPTGVAGQP----------PAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFI

Query:  GGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIEN
        GGDGITFYFHG+KD++FCLVS+PNLHINAH IGKRN +L RDFTWVQSLGILF  ++L I AQ+ A WDDS+D L +AL+   VALPE+EGSQWQ+P EN
Subjt:  GGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIEN

Query:  PSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSS
        P+I IVRL   NHV+V+AKGLFRIT KVVPITEEDSR+H+YGI + +SFAHLDVGFKFF LS GV+GVLGQTY   Y S +N+KA MPVMGRE EFETSS
Subjt:  PSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSS

Query:  LFAADCAVARFGGGGSEEE
        LFAADCAVARFG  G +++
Subjt:  LFAADCAVARFGGGGSEEE

XP_022972442.1 mucin-2 [Cucurbita maxima]5.8e-18263.62Show/hide
Query:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETY
        MA  S F +L PLVVAV++  MA+ATPPGIAKNPSHA+CKIKKYKHCYNL HVCPKFCPDQCTVECASCKPICGGDANPPPE+DPTPAT SPPS     Y
Subjt:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETY

Query:  SPP-----TPSPPPSTPTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFPSTPPANPNPPPTP
        SPP     TPSPPPS PT       SYSPP P+     PSPS  T P+PP   S PP  S PP +  Q PP  +PP +  PP    PSTPPANPNPP +P
Subjt:  SPP-----TPSPPPSTPTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFPSTPPANPNPPPTP

Query:  ST-----------------PNSPSPSLSSPPSPPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSP----PPTPSETPSSPT----TNTP
         T                 PN P+PS  S P+PP  PPT   PP  +   PS P  SP+ P  +TPS PN PS  P    PP     PS+P     +N P
Subjt:  ST-----------------PNSPSPSLSSPPSPPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSP----PPTPSETPSSPT----TNTP

Query:  PPPSPTPSPPRSPPASPPRPPPTGVAGQP----------PAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAV
          P+P  +PP +PP SP  P P+  +  P          P    P SSS GAAKRVRCKNANYPQCYNMIHTCPSACP GC+VDCVTCKPVCHCDRPGAV
Subjt:  PPPSPTPSPPRSPPASPPRPPPTGVAGQP----------PAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAV

Query:  CQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQ
        CQDPRFIGGDGITFYFHG+KD++FCLVS+PNLHINAH IGKRN +L RDFTWVQSLGILF  ++L I AQ+ A WDDS+D L +ALN   VALPE+EGSQ
Subjt:  CQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQ

Query:  WQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGRE
        WQ+P ENP++ IVRL   NHV+V+AKGLFRIT KVVPITEEDSR+H+YGI + +SFAHLDVGFKFF LS GV+GVLGQTY   Y S +N+KA MPVMGRE
Subjt:  WQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGRE

Query:  DEFETSSLFAADCAVARFGGGGSE
         EFETSSLFAADCAVA+FGG G +
Subjt:  DEFETSSLFAADCAVARFGGGGSE

TrEMBL top hitse value%identityAlignment
A0A5A7SMM6 Proline-rich protein 36-like1.0e-16861.48Show/hide
Query:  VVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGG---DANPPPEEDPTPATQSPPSPPSET-YSPP-----T
        VV V++  M E TPPGIA NPSHATCKIKKYKHCYNLVHVCPKFCP+QC VECASCKPICG    DANPPPE+       + P+PPS+T YSPP     T
Subjt:  VVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGG---DANPPPEEDPTPATQSPPSPPSET-YSPP-----T

Query:  PSPPPSTPTNPNSPSNSYSPPTPATPS-SPPSPSAGTTPSPPTISSPPPVTSTPPP---SPVQTPP-----------------KPTPPVTSPPPYSHFPS
        PSPP S P+  +SP      PTP TPS SPP P + T  +PPTIS PPPVTSTPPP   +P ++PP                  PTPP  SPPP     S
Subjt:  PSPPPSTPTNPNSPSNSYSPPTPATPS-SPPSPSAGTTPSPPTISSPPPVTSTPPP---SPVQTPP-----------------KPTPPVTSPPPYSHFPS

Query:  TPPA-NPNP---PPTPSTPNSPSPSLSSPPSPPKYPPTAPSPP---APSAGTPSPPTVSP-----------------STPPATTPSTPNSPSLSP-----
        TPP+ NPNP   PPT  TP+ P P+ + PPS P  P  +P PP    P +  P+PPT  P                 S PP+T P+ PN+PS  P     
Subjt:  TPPA-NPNP---PPTPSTPNSPSPSLSSPPSPPKYPPTAPSPP---APSAGTPSPPTVSP-----------------STPPATTPSTPNSPSLSP-----

Query:  --PPTPSETPSSPTTNTP-PPPS----PTPSPPRSPPASPPRPPPTGVAGQPPAPLS-PQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCV
          P  PSETP+SP  NTP  PPS    P+P+PP   P SPP   PT     P  P S P SSSAGA K VRCKN NYPQCYNMIH CPSACP GC+VDCV
Subjt:  --PPTPSETPSSPTTNTP-PPPS----PTPSPPRSPPASPPRPPPTGVAGQPPAPLS-PQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCV

Query:  TCKPVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVAL
        TCKPVCHCDRPGAVCQDPR +GGDGITFYFHGKKD++FCLVS+PNLHINAH IGKRN +LKRDFTWVQSL ILF  ++L I AQ+   WDDS+D L + L
Subjt:  TCKPVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVAL

Query:  NGQLVALPEAEGSQWQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTS
        +   +ALP +EGSQ Q+PIENP+I IVRLA TNHV+V+AKGLFRIT KVVPIT+EDSRIHNYGI + +SFAHLDVGFKFF LS  V+GVLGQTY   Y S
Subjt:  NGQLVALPEAEGSQWQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTS

Query:  RINVKATMPVMGREDEFETSSLFAADCAVARFGGGG
         INVKA M VMGR +EFETSSLFAADCAV+RFGG G
Subjt:  RINVKATMPVMGREDEFETSSLFAADCAVARFGGGG

A0A5D3C8U8 Proline-rich protein 36-like6.7e-16863.39Show/hide
Query:  VVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGG---DANPPPEEDPTPATQSPPSPPSET-YSPP-----T
        VV V++  M E TPPGIA NPSHATCKIKKYKHCYNLVHVCPKFCP+QC VECASCKPICG    DANPPPE+       + P+PPS+T YSPP     T
Subjt:  VVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGG---DANPPPEEDPTPATQSPPSPPSET-YSPP-----T

Query:  PSPPPSTPTNPNSPSNSYSPPTPATPS-SPPSPSAGTTPSPPTISSPPPVTSTPPPS--------PVQTPPKPTPPVTSPPPYSHFPSTPPANPNPPPTP
        PSPP S P+  +SP      PTP TPS SPP P + T  +PPTIS PPPVTSTPPP+        P    P P PP  + PP +    TPP    PPP  
Subjt:  PSPPPSTPTNPNSPSNSYSPPTPATPS-SPPSPSAGTTPSPPTISSPPPVTSTPPPS--------PVQTPPKPTPPVTSPPPYSHFPSTPPANPNPPPTP

Query:  STPNSPSPS-LSSPPS--PPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSPPPTPSETPSSPTTNTPPPPSPTPSPPRSPPASPPRPPP
        STP S +P+  +SPP+  PP  PP  P+PP  ++  PS P  +P+T P+T PSTPN PS   P  PSETP+SP  NTP  P  TPSPP S          
Subjt:  STPNSPSPS-LSSPPS--PPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSPPPTPSETPSSPTTNTPPPPSPTPSPPRSPPASPPRPPP

Query:  TGVAGQPPAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNL
                   +P SSSAGA K VRCKN NYPQCYNMIH CPSACP GC+VDCVTCKPVCHCDRPGAVCQDPR +GGDGITFYFHGKKD++FCLVS+PNL
Subjt:  TGVAGQPPAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNL

Query:  HINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIENPSIFIVRLAPTNHVIVQAKGLFRIT
        HINAH IGKRN +LKRDFTWVQSL ILF  ++L I AQ+   WDDS+D L + L+   +ALP +EGSQ Q+PIENP+I IVRLA TNHV+V+AKGLFRIT
Subjt:  HINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIENPSIFIVRLAPTNHVIVQAKGLFRIT

Query:  VKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSSLFAADCAVARFGGGG
         KVVPIT+EDSRIHNYGI + +SFAHLDVGFKFF LS  V+GVLGQTY   Y S INVKA M VMGR +EFETSSLFAADCAV+RFGG G
Subjt:  VKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSSLFAADCAVARFGGGG

A0A6J1D382 formin-like protein 202.4e-19762Show/hide
Query:  LCLEMAYS------SGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQ
        LCLE   S      S F LL PL V V MA M EATPPGIA NPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDAN PPE+DPTPAT 
Subjt:  LCLEMAYS------SGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQ

Query:  SPPSPPSET-YSPPTPSPPPSTPTNPNSPSNSYSPPT------------------PATPSSP------PSPSAGTTPSPPTISSPPPVT--------STP
        SPPSPPSET YSPP PS PP  P  P +P  + +PP+                  PA P+SP      P+P    TPS P  S  PP T        STP
Subjt:  SPPSPPSET-YSPPTPSPPPSTPTNPNSPSNSYSPPT------------------PATPSSP------PSPSAGTTPSPPTISSPPPVT--------STP

Query:  P--PSPVQTPP-----------KPTPPVTSPPPYSHFPSTPPANPNPPPTPST---------------------PNSPSPSLSSP---------------
        P  P+P  TPP           KP PP + PP   + PSTPPA+PNPP TP T                     PN PS   +SP               
Subjt:  P--PSPVQTPP-----------KPTPPVTSPPPYSHFPSTPPANPNPPPTPST---------------------PNSPSPSLSSP---------------

Query:  ----------PSPPKYPPTAPSPPAPSAGTPSPPTV-----------------SPSTPPAT-------------TPSTPNSPSLSPPP-TPSETPSSPTT
                  P+PP  PPT+P+PP+     P+PP+                   PSTPPA+             +PSTPNSPSLSPPP TPSETPSSP+T
Subjt:  ----------PSPPKYPPTAPSPPAPSAGTPSPPTV-----------------SPSTPPAT-------------TPSTPNSPSLSPPP-TPSETPSSPTT

Query:  NTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFI
        NT  PP P PSPP S PASPPR  P G AG+P A   P SS+  A K+VRCKN NYPQCYNM+HTCPSACP GCEVDCVTCKPVCHCDRPGAVCQDPRFI
Subjt:  NTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFI

Query:  GGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIEN
        GGDGITFYFHGKKDR+FCLVS+ NLHINAHLIGKRN NLKRDFTWVQSLGIL   +Q+FIGAQ+ AAWDDSVD LAVA+NGQ VALPE+ GSQWQYP EN
Subjt:  GGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIEN

Query:  PSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSS
        P+I +VRLAP N V+V+AKG+FRIT KVVPITE+DSRIHNYGITKE+SFAHLD+GFKFFSLS  VSGVLGQTY PEY SR+N+KA MPVMGRE EFETSS
Subjt:  PSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSS

Query:  LFAADCAVARFG-GGGSEEEA
        LFAADCAVARFG  GGS  EA
Subjt:  LFAADCAVARFG-GGGSEEEA

A0A6J1GN96 basic proline-rich protein1.1e-18164.3Show/hide
Query:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETY
        MA  S F +L PLVVAV++ AMA+ATPPGIAKNPSHA+CKIKKYKHCYNL HVCPKFCPDQCTVECASCKPICGGDA+PPPE+DPTPAT SPPS     Y
Subjt:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETY

Query:  SPP-----TPSPPPSTPTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPT---ISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFP-------STP
        SPP     TPSPPPS PT       SYSPP P+     PSPS  T P+PP+     S PP    PP SP  +P  PTP   S PP S  P       S P
Subjt:  SPP-----TPSPPPSTPTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPT---ISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFP-------STP

Query:  PANPNPPPTPST-PNSPSPSLSSPPSPPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPS-LSPPPTPSETPSS---PTTNTPPP---PSPTP
        P N NPP +P T PN P+PS  S P+PP  PPT   PP    G  +PP+  P++P   TPSTP+SP+  S PPT S  P +   P+   PPP   P   P
Subjt:  PANPNPPPTPST-PNSPSPSLSSPPSPPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPS-LSPPPTPSETPSS---PTTNTPPP---PSPTP

Query:  SPPRSPPASPPRPPPTGVAGQP----------PAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFI
        +PP +PP SP  P P+  +  P          P    P SSSAGAAKRVRCKNANYPQCYNMIHTCPSACP GC+VDCVTCKPVCHCDRPGAVCQDPRFI
Subjt:  SPPRSPPASPPRPPPTGVAGQP----------PAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFI

Query:  GGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIEN
        GGDGITFYFHG+KD++FCLVS+PNLHINAH IGKRN +L RDFTWVQSLGILF  ++L I AQ+ A WDDS+D L +AL+   VALPE+EGSQWQ+P EN
Subjt:  GGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIEN

Query:  PSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSS
        P+I IVRL   NHV+V+AKGLFRIT KVVPITEEDSR+H+YGI + +SFAHLDVGFKFF LS GV+GVLGQTY   Y S +N+KA MPVMGRE EFETSS
Subjt:  PSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSS

Query:  LFAADCAVARFGGGGSEEE
        LFAADCAVARFG  G +++
Subjt:  LFAADCAVARFGGGGSEEE

A0A6J1I4T7 mucin-22.8e-18263.62Show/hide
Query:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETY
        MA  S F +L PLVVAV++  MA+ATPPGIAKNPSHA+CKIKKYKHCYNL HVCPKFCPDQCTVECASCKPICGGDANPPPE+DPTPAT SPPS     Y
Subjt:  MAYSSGFSLLAPLVVAVVMAAMAEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETY

Query:  SPP-----TPSPPPSTPTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFPSTPPANPNPPPTP
        SPP     TPSPPPS PT       SYSPP P+     PSPS  T P+PP   S PP  S PP +  Q PP  +PP +  PP    PSTPPANPNPP +P
Subjt:  SPP-----TPSPPPSTPTNPNSPSNSYSPPTPATPSSPPSPSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFPSTPPANPNPPPTP

Query:  ST-----------------PNSPSPSLSSPPSPPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSP----PPTPSETPSSPT----TNTP
         T                 PN P+PS  S P+PP  PPT   PP  +   PS P  SP+ P  +TPS PN PS  P    PP     PS+P     +N P
Subjt:  ST-----------------PNSPSPSLSSPPSPPKYPPTAPSPPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSP----PPTPSETPSSPT----TNTP

Query:  PPPSPTPSPPRSPPASPPRPPPTGVAGQP----------PAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAV
          P+P  +PP +PP SP  P P+  +  P          P    P SSS GAAKRVRCKNANYPQCYNMIHTCPSACP GC+VDCVTCKPVCHCDRPGAV
Subjt:  PPPSPTPSPPRSPPASPPRPPPTGVAGQP----------PAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAV

Query:  CQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQ
        CQDPRFIGGDGITFYFHG+KD++FCLVS+PNLHINAH IGKRN +L RDFTWVQSLGILF  ++L I AQ+ A WDDS+D L +ALN   VALPE+EGSQ
Subjt:  CQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQ

Query:  WQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGRE
        WQ+P ENP++ IVRL   NHV+V+AKGLFRIT KVVPITEEDSR+H+YGI + +SFAHLDVGFKFF LS GV+GVLGQTY   Y S +N+KA MPVMGRE
Subjt:  WQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGRE

Query:  DEFETSSLFAADCAVARFGGGGSE
         EFETSSLFAADCAVA+FGG G +
Subjt:  DEFETSSLFAADCAVARFGGGGSE

SwissProt top hitse value%identityAlignment
Q0J6H8 Cyclase-like protein 31.1e-10669.85Show/hide
Query:  ILPRPLMLFALLQVLLAGASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLANNSEMK
        + P  L+L  LL  L A A+  A+P+  +     C+ +        P RRE +  GRI+DI+H +  DMPSWESD G+GQFLWLP SM+NGS ANNSEM+
Subjt:  ILPRPLMLFALLQVLLAGASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLANNSEMK

Query:  LPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVE
        LP HTGTHVDAPGHVF HYFDAGFDVD+LDLEVLNG  LL+DVPRD NITA++M+SL+IPKG++RVLFRTLNTDR+LMWK+EFDTSYVGFM+DGA+WLV+
Subjt:  LPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVE

Query:  NTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIK
        NTDIKLVGIDYLSVAAFDDLIPSHLV L+ R+II+VEGLKL+++ PG+YS+HCLPLRL GAEGSPIRCILIK
Subjt:  NTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIK

Q6YX89 Cyclase-like protein 45.0e-9664.34Show/hide
Query:  PRPLMLFALLQVLLAGASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGL-GQFLWLPKSMKNGS-LANNSEMK
        P PL L  LL  + A     A+P   +   + C ++          RRE +D GRI+DISH +  +MP WES  G  G FL L +SM+NGS +AN SE++
Subjt:  PRPLMLFALLQVLLAGASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGL-GQFLWLPKSMKNGS-LANNSEMK

Query:  LPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVE
        L AH+GTHVDAPGHVFDHY+ AGFDVDTLDL +LNGP LL+DVPRD NITA VM+SL+IPKGVRRVLFRTLNTDR+LMWK+EFDTSYVGFMKDGA+WL++
Subjt:  LPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVE

Query:  NTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIK
        NTDI+LVG+DYLSV AFD+ IP+HLVFLE RE+I+VE L L+ V PG+Y++HCLPLRL G+EGSP RCILIK
Subjt:  NTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIK

Q93V74 Cyclase-like protein 12.0e-9270.93Show/hide
Query:  PVRREVYDNGRIIDISHRFTADMPSWESDKGLGQ-FLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPR
        P+RREVY+ G+I DISHR+T ++P+WES +GLG+ FL L  SMKNGS AN SEMKL  H+GTHVDAPGH +D+Y+DAGFD D+LDL+VLNGP LL+DVPR
Subjt:  PVRREVYDNGRIIDISHRFTADMPSWESDKGLGQ-FLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPR

Query:  DKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQ
        DKNITAEVM+SL+I +GVRRVLFRT NTD+RLM+K+EFD+S+ GFM DGAKWLVENTDIKL+G+DYLS AAF++   +H V L+GR+II VE LKLD V+
Subjt:  DKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQ

Query:  PGLYSIHCLPLRLLGAEGSPIRCILIK
         G YS+HCLPLRL+GAEG+P RCILIK
Subjt:  PGLYSIHCLPLRLLGAEGSPIRCILIK

Q94JT5 Cyclase-like protein 21.5e-11172.83Show/hide
Query:  MILPRPLMLFALLQ----VLLAGASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLAN
        M +P    L  LL     ++ AGASN AYPS+  T   D   +D     L P+RREVY NG+I DISHR+T +MPSW+S +G+G+FLWL  SMKNGSLAN
Subjt:  MILPRPLMLFALLQ----VLLAGASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLAN

Query:  NSEMKLPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGA
        NSEMK+P HTGTHVD+PGHV+D Y+DAGFDVD+LDL+VLNG  LL+DVP+DKNITAEVMKSL+IPKGV RVLFRTLNTDRRLM+K+EFDTSYVGFMKDGA
Subjt:  NSEMKLPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGA

Query:  KWLVENTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILI
        +WLV+NTDIKLVGIDYLSVAA+DDLIPSHLVFL+ RE I+VEGLKLD V+ GLYS+HCLPLRL+GAEGSPIRCILI
Subjt:  KWLVENTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILI

Q94LA9 Cyclase-like protein 31.4e-9064.54Show/hide
Query:  AYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGR-IIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDHYFD
        A+PS+     T  S++      + P+  EVYD  R I DISH++T ++P WES +GLG FL L  SMKNGS AN S+M+L  H+GTHVDAPGH  DHY++
Subjt:  AYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGR-IIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDHYFD

Query:  AGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFDDLI
        +GFD D+LDL++LNGP LL+DVPRDKNI+AEVMKSL+IP+G+RRVLF+TLNTDRRLM+K+EFD+S+VGFM DGAKWLVENTDIKLVG+DYLS AA+D+  
Subjt:  AGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFDDLI

Query:  PSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIK
         +H   LE R+II VE LKLDDV+ G+Y++HCLPLRL+GAEG+P RCILIK
Subjt:  PSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIK

Arabidopsis top hitse value%identityAlignment
AT1G44542.1 Cyclase family protein1.0e-9164.54Show/hide
Query:  AYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGR-IIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDHYFD
        A+PS+     T  S++      + P+  EVYD  R I DISH++T ++P WES +GLG FL L  SMKNGS AN S+M+L  H+GTHVDAPGH  DHY++
Subjt:  AYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGR-IIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDHYFD

Query:  AGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFDDLI
        +GFD D+LDL++LNGP LL+DVPRDKNI+AEVMKSL+IP+G+RRVLF+TLNTDRRLM+K+EFD+S+VGFM DGAKWLVENTDIKLVG+DYLS AA+D+  
Subjt:  AGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFDDLI

Query:  PSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIK
         +H   LE R+II VE LKLDDV+ G+Y++HCLPLRL+GAEG+P RCILIK
Subjt:  PSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIK

AT3G19430.1 late embryogenesis abundant protein-related / LEA protein-related1.2e-12954.16Show/hide
Query:  TPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETYSPPTPSPPPSTPTNPNSPSNSYSPP
        TPPGIAKNPSHATCKIKKYKHCYNL HVCPKFCPD C VECASCKPICG    PP   D      S        Y+PP P PP S P  P +P    S P
Subjt:  TPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETYSPPTPSPPPSTPTNPNSPSNSYSPP

Query:  TPATPSSPPSPSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFPS----TPPANPNPP-PTPSTPNSPSPSLSSPPSPPKYPPTAPS
        +P  P SPP P    TP+ P++ SP P  S PPP+P  + P PTPPV SPPP +  PS    TPP +P PP PTPS P SP+P + + P P   PP +P 
Subjt:  TPATPSSPPSPSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFPS----TPPANPNPP-PTPSTPNSPSPSLSSPPSPPKYPPTAPS

Query:  PPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSP-PPTPSETPSSPTTNTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAGAAKRVRCK
        PP P+   PSPP V+P TPP  TPS P+ P ++P PPTPS       T TPP P   P+P  SPP  PP               S +  +AG AKRVRCK
Subjt:  PPAPSAGTPSPPTVSPSTPPATTPSTPNSPSLSP-PPTPSETPSSPTTNTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAGAAKRVRCK

Query:  NANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGIL
            P CY + +TCP+ CP  C+VDCVTCKPVC+CD+PG+VCQDPRFIGGDG+TFYFHGKKD NFCL+S+PNLHINAH IGKR   + RDFTWVQS+ IL
Subjt:  NANYPQCYNMIHTCPSACPGGCEVDCVTCKPVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGIL

Query:  FGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQW-QYPIENPSIFIVRL-APTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFA
        FG ++L++GA + A WDDSVD +AV+ +G +++LP+ +G++W   P   P + + R+   TN++ V+ +GL +IT +VVPIT EDSRIH Y + +++  A
Subjt:  FGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQW-QYPIENPSIFIVRL-APTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFA

Query:  HLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSSLFAADCAVARFGGGG
        HLD+GFKF  LS  V GVLGQTY   Y SR+ +   MPVMG + EF+T+ LFA DC+ ARF G G
Subjt:  HLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSSLFAADCAVARFGGGG

AT4G34180.1 Cyclase family protein1.4e-9370.93Show/hide
Query:  PVRREVYDNGRIIDISHRFTADMPSWESDKGLGQ-FLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPR
        P+RREVY+ G+I DISHR+T ++P+WES +GLG+ FL L  SMKNGS AN SEMKL  H+GTHVDAPGH +D+Y+DAGFD D+LDL+VLNGP LL+DVPR
Subjt:  PVRREVYDNGRIIDISHRFTADMPSWESDKGLGQ-FLWLPKSMKNGSLANNSEMKLPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPR

Query:  DKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQ
        DKNITAEVM+SL+I +GVRRVLFRT NTD+RLM+K+EFD+S+ GFM DGAKWLVENTDIKL+G+DYLS AAF++   +H V L+GR+II VE LKLD V+
Subjt:  DKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQ

Query:  PGLYSIHCLPLRLLGAEGSPIRCILIK
         G YS+HCLPLRL+GAEG+P RCILIK
Subjt:  PGLYSIHCLPLRLLGAEGSPIRCILIK

AT4G35220.1 Cyclase family protein1.0e-11272.83Show/hide
Query:  MILPRPLMLFALLQ----VLLAGASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLAN
        M +P    L  LL     ++ AGASN AYPS+  T   D   +D     L P+RREVY NG+I DISHR+T +MPSW+S +G+G+FLWL  SMKNGSLAN
Subjt:  MILPRPLMLFALLQ----VLLAGASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLAN

Query:  NSEMKLPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGA
        NSEMK+P HTGTHVD+PGHV+D Y+DAGFDVD+LDL+VLNG  LL+DVP+DKNITAEVMKSL+IPKGV RVLFRTLNTDRRLM+K+EFDTSYVGFMKDGA
Subjt:  NSEMKLPAHTGTHVDAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGA

Query:  KWLVENTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILI
        +WLV+NTDIKLVGIDYLSVAA+DDLIPSHLVFL+ RE I+VEGLKLD V+ GLYS+HCLPLRL+GAEGSPIRCILI
Subjt:  KWLVENTDIKLVGIDYLSVAAFDDLIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILI

AT5G60520.1 Late embryogenesis abundant (LEA) protein-related2.3e-6443.46Show/hide
Query:  KRVRCKNANYPQCYNMIHTCPSACP----------GGCEVDC-----VTCK-PVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAH
        +RV+C       C   I TCP  CP            C +DC     VTCK    +C+  G++C DPRF+GGDG+ FYFHG KD NF +VS+ NL INAH
Subjt:  KRVRCKNANYPQCYNMIHTCPSACP----------GGCEVDC-----VTCK-PVCHCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAH

Query:  LIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVP
         IG R     RDFTWVQ+  ++F  + L I A++ A+WDDSVDSL V  NG+ V +P    ++W+  ++   + + R    N+V V   G+ +I ++V P
Subjt:  LIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQYPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVP

Query:  ITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSSLFAADCAVARFGG
        I +E+ R+H Y + K+++FAHL+  FKFF+LS  V GVLG+TY P Y S +     MP+MG ED+++T SLF+  C V RF G
Subjt:  ITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSSLFAADCAVARFGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTCCCACGTCCTTTGATGCTCTTCGCCTTGCTTCAGGTCCTTCTCGCCGGCGCTTCCAACATAGCCTACCCCTCCGTGGTCGATACCGGAGCCACCGATTGCTC
TCTTTCCGACGGCGGCGATGGCGGCCTCACTCCTGTTCGTCGAGAGGTGTATGATAATGGACGAATCATCGACATCAGTCATAGGTTCACTGCAGATATGCCGTCATGGG
AATCTGACAAAGGGCTGGGCCAGTTCCTCTGGCTTCCGAAGAGCATGAAGAACGGCTCGCTTGCTAACAATTCGGAAATGAAGCTACCAGCTCACACTGGAACCCACGTC
GATGCGCCTGGTCATGTTTTCGATCACTACTTCGACGCCGGCTTCGATGTCGACACACTCGACCTCGAAGTCCTTAATGGTCCTGGACTGTTAATAGACGTTCCAAGGGA
TAAGAACATTACTGCTGAGGTCATGAAGTCTTTGAATATTCCCAAAGGAGTGCGTCGTGTACTCTTCAGAACATTAAATACAGACAGACGTCTCATGTGGAAAAGAGAGT
TTGACACGAGCTATGTGGGATTTATGAAGGATGGAGCAAAATGGCTGGTAGAGAACACTGATATCAAACTTGTTGGAATTGACTACTTATCAGTTGCTGCCTTTGACGAT
CTTATTCCATCTCATCTAGTATTTCTAGAAGGCAGGGAAATCATTATCGTCGAAGGTTTAAAGCTTGACGATGTCCAGCCGGGGTTATATTCGATCCATTGCTTACCTCT
TAGGTTGCTTGGTGCCGAGGGGTCGCCAATAAGATGCATTCTAATTAAGTATTTTTTGCTATGGCCACCTCAAACTCGTGAAGAAGTTGAAGTTCCTGTGTTTTTAGGGT
CTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTCTGTGCTTGGAAATGGCGTATTCATCTGGGTTTTCGCTTCTGGCGCCGCTGGTTGTGGCGGTGGTGATGGCGGCAATG
GCGGAGGCAACGCCGCCGGGCATTGCTAAGAATCCGAGCCATGCAACGTGCAAGATTAAGAAGTATAAACACTGTTATAATTTGGTTCATGTTTGCCCCAAGTTTTGCCC
TGATCAATGTACTGTTGAATGTGCCTCTTGTAAGCCTATTTGTGGTGGCGATGCCAATCCTCCGCCGGAGGAAGATCCTACTCCGGCCACCCAGTCGCCGCCGTCTCCTC
CCTCAGAAACTTACTCGCCTCCTACTCCGAGCCCCCCACCTTCAACTCCGACGAACCCAAATTCTCCTTCCAATTCGTATTCTCCACCAACACCGGCGACTCCGTCGTCT
CCGCCGTCTCCATCTGCCGGAACTACTCCAAGCCCCCCAACTATTTCATCACCTCCTCCAGTTACTTCAACGCCACCGCCATCACCTGTTCAAACCCCGCCTAAACCTAC
CCCGCCTGTGACCTCACCGCCGCCCTATTCACATTTTCCCTCAACGCCTCCGGCGAATCCCAACCCTCCGCCGACACCATCCACTCCAAACTCTCCATCGCCTTCACTGT
CGTCTCCACCTTCTCCACCAAAGTATCCTCCAACTGCACCATCTCCGCCAGCTCCCTCGGCCGGAACTCCAAGCCCTCCTACTGTTTCTCCGTCAACGCCTCCAGCTACA
ACCCCATCCACTCCAAATTCTCCATCGCTTTCGCCTCCACCGACACCATCAGAAACTCCCAGCTCCCCAACGACTAATACTCCTCCTCCGCCTTCTCCAACACCATCTCC
ACCACGTTCTCCACCAGCTTCTCCACCACGTCCACCGCCTACCGGTGTTGCTGGCCAACCACCAGCTCCATTATCTCCACAGTCTTCTTCGGCCGGCGCAGCTAAGAGAG
TCAGATGCAAAAATGCGAATTATCCTCAATGTTATAACATGATTCACACTTGTCCCAGCGCTTGCCCTGGTGGATGCGAAGTTGATTGCGTGACTTGCAAACCTGTCTGC
CATTGTGACAGACCAGGGGCAGTGTGCCAAGATCCTCGTTTCATCGGTGGCGATGGCATCACCTTCTACTTCCACGGCAAGAAAGACCGTAATTTCTGTCTTGTTTCCAA
TCCCAACCTCCATATCAACGCCCATCTGATCGGAAAACGAAACCAGAACTTAAAAAGAGACTTCACCTGGGTTCAATCCCTCGGAATCCTCTTCGGCAGATACCAGCTCT
TCATAGGCGCCCAAGAAAGGGCCGCCTGGGATGATTCCGTCGACAGCCTCGCCGTCGCCCTGAACGGGCAGCTGGTCGCCCTCCCGGAAGCTGAAGGCAGCCAGTGGCAG
TACCCCATCGAAAATCCGAGCATCTTCATCGTCCGGCTTGCTCCGACCAACCACGTCATTGTGCAAGCCAAAGGACTGTTCAGAATCACCGTCAAGGTGGTGCCGATAAC
CGAAGAAGACTCACGGATTCACAACTATGGAATAACGAAAGAAAATTCGTTTGCCCATTTGGACGTGGGGTTCAAATTCTTCTCGTTGAGCAGCGGAGTGAGCGGCGTGT
TGGGCCAGACGTACAGCCCTGAGTACACAAGTCGCATAAATGTGAAGGCCACGATGCCGGTGATGGGCAGGGAGGATGAGTTTGAGACGTCGAGCCTGTTTGCGGCGGAC
TGCGCGGTGGCCAGATTTGGCGGCGGCGGCTCTGAGGAGGAGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCTCCCACGTCCTTTGATGCTCTTCGCCTTGCTTCAGGTCCTTCTCGCCGGCGCTTCCAACATAGCCTACCCCTCCGTGGTCGATACCGGAGCCACCGATTGCTC
TCTTTCCGACGGCGGCGATGGCGGCCTCACTCCTGTTCGTCGAGAGGTGTATGATAATGGACGAATCATCGACATCAGTCATAGGTTCACTGCAGATATGCCGTCATGGG
AATCTGACAAAGGGCTGGGCCAGTTCCTCTGGCTTCCGAAGAGCATGAAGAACGGCTCGCTTGCTAACAATTCGGAAATGAAGCTACCAGCTCACACTGGAACCCACGTC
GATGCGCCTGGTCATGTTTTCGATCACTACTTCGACGCCGGCTTCGATGTCGACACACTCGACCTCGAAGTCCTTAATGGTCCTGGACTGTTAATAGACGTTCCAAGGGA
TAAGAACATTACTGCTGAGGTCATGAAGTCTTTGAATATTCCCAAAGGAGTGCGTCGTGTACTCTTCAGAACATTAAATACAGACAGACGTCTCATGTGGAAAAGAGAGT
TTGACACGAGCTATGTGGGATTTATGAAGGATGGAGCAAAATGGCTGGTAGAGAACACTGATATCAAACTTGTTGGAATTGACTACTTATCAGTTGCTGCCTTTGACGAT
CTTATTCCATCTCATCTAGTATTTCTAGAAGGCAGGGAAATCATTATCGTCGAAGGTTTAAAGCTTGACGATGTCCAGCCGGGGTTATATTCGATCCATTGCTTACCTCT
TAGGTTGCTTGGTGCCGAGGGGTCGCCAATAAGATGCATTCTAATTAAGTATTTTTTGCTATGGCCACCTCAAACTCGTGAAGAAGTTGAAGTTCCTGTGTTTTTAGGGT
CTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTCTGTGCTTGGAAATGGCGTATTCATCTGGGTTTTCGCTTCTGGCGCCGCTGGTTGTGGCGGTGGTGATGGCGGCAATG
GCGGAGGCAACGCCGCCGGGCATTGCTAAGAATCCGAGCCATGCAACGTGCAAGATTAAGAAGTATAAACACTGTTATAATTTGGTTCATGTTTGCCCCAAGTTTTGCCC
TGATCAATGTACTGTTGAATGTGCCTCTTGTAAGCCTATTTGTGGTGGCGATGCCAATCCTCCGCCGGAGGAAGATCCTACTCCGGCCACCCAGTCGCCGCCGTCTCCTC
CCTCAGAAACTTACTCGCCTCCTACTCCGAGCCCCCCACCTTCAACTCCGACGAACCCAAATTCTCCTTCCAATTCGTATTCTCCACCAACACCGGCGACTCCGTCGTCT
CCGCCGTCTCCATCTGCCGGAACTACTCCAAGCCCCCCAACTATTTCATCACCTCCTCCAGTTACTTCAACGCCACCGCCATCACCTGTTCAAACCCCGCCTAAACCTAC
CCCGCCTGTGACCTCACCGCCGCCCTATTCACATTTTCCCTCAACGCCTCCGGCGAATCCCAACCCTCCGCCGACACCATCCACTCCAAACTCTCCATCGCCTTCACTGT
CGTCTCCACCTTCTCCACCAAAGTATCCTCCAACTGCACCATCTCCGCCAGCTCCCTCGGCCGGAACTCCAAGCCCTCCTACTGTTTCTCCGTCAACGCCTCCAGCTACA
ACCCCATCCACTCCAAATTCTCCATCGCTTTCGCCTCCACCGACACCATCAGAAACTCCCAGCTCCCCAACGACTAATACTCCTCCTCCGCCTTCTCCAACACCATCTCC
ACCACGTTCTCCACCAGCTTCTCCACCACGTCCACCGCCTACCGGTGTTGCTGGCCAACCACCAGCTCCATTATCTCCACAGTCTTCTTCGGCCGGCGCAGCTAAGAGAG
TCAGATGCAAAAATGCGAATTATCCTCAATGTTATAACATGATTCACACTTGTCCCAGCGCTTGCCCTGGTGGATGCGAAGTTGATTGCGTGACTTGCAAACCTGTCTGC
CATTGTGACAGACCAGGGGCAGTGTGCCAAGATCCTCGTTTCATCGGTGGCGATGGCATCACCTTCTACTTCCACGGCAAGAAAGACCGTAATTTCTGTCTTGTTTCCAA
TCCCAACCTCCATATCAACGCCCATCTGATCGGAAAACGAAACCAGAACTTAAAAAGAGACTTCACCTGGGTTCAATCCCTCGGAATCCTCTTCGGCAGATACCAGCTCT
TCATAGGCGCCCAAGAAAGGGCCGCCTGGGATGATTCCGTCGACAGCCTCGCCGTCGCCCTGAACGGGCAGCTGGTCGCCCTCCCGGAAGCTGAAGGCAGCCAGTGGCAG
TACCCCATCGAAAATCCGAGCATCTTCATCGTCCGGCTTGCTCCGACCAACCACGTCATTGTGCAAGCCAAAGGACTGTTCAGAATCACCGTCAAGGTGGTGCCGATAAC
CGAAGAAGACTCACGGATTCACAACTATGGAATAACGAAAGAAAATTCGTTTGCCCATTTGGACGTGGGGTTCAAATTCTTCTCGTTGAGCAGCGGAGTGAGCGGCGTGT
TGGGCCAGACGTACAGCCCTGAGTACACAAGTCGCATAAATGTGAAGGCCACGATGCCGGTGATGGGCAGGGAGGATGAGTTTGAGACGTCGAGCCTGTTTGCGGCGGAC
TGCGCGGTGGCCAGATTTGGCGGCGGCGGCTCTGAGGAGGAGGCTTGA
Protein sequenceShow/hide protein sequence
MILPRPLMLFALLQVLLAGASNIAYPSVVDTGATDCSLSDGGDGGLTPVRREVYDNGRIIDISHRFTADMPSWESDKGLGQFLWLPKSMKNGSLANNSEMKLPAHTGTHV
DAPGHVFDHYFDAGFDVDTLDLEVLNGPGLLIDVPRDKNITAEVMKSLNIPKGVRRVLFRTLNTDRRLMWKREFDTSYVGFMKDGAKWLVENTDIKLVGIDYLSVAAFDD
LIPSHLVFLEGREIIIVEGLKLDDVQPGLYSIHCLPLRLLGAEGSPIRCILIKYFLLWPPQTREEVEVPVFLGSSSSSSSSSSSLCLEMAYSSGFSLLAPLVVAVVMAAM
AEATPPGIAKNPSHATCKIKKYKHCYNLVHVCPKFCPDQCTVECASCKPICGGDANPPPEEDPTPATQSPPSPPSETYSPPTPSPPPSTPTNPNSPSNSYSPPTPATPSS
PPSPSAGTTPSPPTISSPPPVTSTPPPSPVQTPPKPTPPVTSPPPYSHFPSTPPANPNPPPTPSTPNSPSPSLSSPPSPPKYPPTAPSPPAPSAGTPSPPTVSPSTPPAT
TPSTPNSPSLSPPPTPSETPSSPTTNTPPPPSPTPSPPRSPPASPPRPPPTGVAGQPPAPLSPQSSSAGAAKRVRCKNANYPQCYNMIHTCPSACPGGCEVDCVTCKPVC
HCDRPGAVCQDPRFIGGDGITFYFHGKKDRNFCLVSNPNLHINAHLIGKRNQNLKRDFTWVQSLGILFGRYQLFIGAQERAAWDDSVDSLAVALNGQLVALPEAEGSQWQ
YPIENPSIFIVRLAPTNHVIVQAKGLFRITVKVVPITEEDSRIHNYGITKENSFAHLDVGFKFFSLSSGVSGVLGQTYSPEYTSRINVKATMPVMGREDEFETSSLFAAD
CAVARFGGGGSEEEA