; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G013070 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G013070
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCiama_Chr01:25994376..25996978
RNA-Seq ExpressionCaUC01G013070
SyntenyCaUC01G013070
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573073.1 hypothetical protein SDJN03_26960, partial [Cucurbita argyrosperma subsp. sororia]5.1e-21385.58Show/hide
Query:  SSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE
        SSSS SS S FVV LL+FTS +SVFSTSI+HQ+P KNQTLFHPTKELKKLKHIR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+P LE
Subjt:  SSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE

Query:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
        P ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
Subjt:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHS
        FSLSQIW+ISGSFGNDLNTIEAGWQ+   LY  ++      + TDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWK        
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHS

Query:  FEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH
             DPKHGHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADH
Subjt:  FEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH

Query:  SDCYDIRQATNNVWGTYFYYGGPGRNVKCP
        SDCYDIRQ +NNVWGTYFYYGGPGR V+CP
Subjt:  SDCYDIRQATNNVWGTYFYYGGPGRNVKCP

XP_022994621.1 uncharacterized protein LOC111490277 [Cucurbita maxima]8.7e-21385.48Show/hide
Query:  MASS-SSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT
        MASS SSS SS S FVVFLL+FTS +SVFSTSI+HQ+P KNQTLFHPTKELKKLK+IR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+
Subjt:  MASS-SSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT

Query:  PILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        P LEP ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  PILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSF
        DQYEFSLSQIW+ISGSFGNDLNTIEAGWQ+   LY  ++      + TDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWK    
Subjt:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSF

Query:  LIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHL
                 DPKHGHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+
Subjt:  LIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHL

Query:  LADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP
        LADHSDCYDIRQ +NNVWGTYFYYGGPGR V+CP
Subjt:  LADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP

XP_023542314.1 uncharacterized protein LOC111802247 [Cucurbita pepo subsp. pepo]1.9e-21285.42Show/hide
Query:  ASSSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPI
        +SSSSSSSS S FVV LL+FTS +S FSTSI+HQ+P KNQTLFHPTKELKKLK+IR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+P 
Subjt:  ASSSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPI

Query:  LEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
        LEP ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ
Subjt:  LEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ

Query:  YEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLI
        YEFSLSQIWVISGSFGNDLNTIEAGWQ+   LY  ++      + TDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWK      
Subjt:  YEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLI

Query:  HSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLA
               DPKHGHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LA
Subjt:  HSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLA

Query:  DHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP
        DHSDCYDIRQ +NNVWGTYFYYGGPGR V+CP
Subjt:  DHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP

XP_038895687.1 uncharacterized protein LOC120083860 isoform X1 [Benincasa hispida]8.1e-21989.2Show/hide
Query:  MASSSSSSS-SSSSFVVFLLLF-TSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSSSSSS S S FVVFLL+F TSFSSVFS+SISHQIPPKNQT FHP KELKKLKHIR+YLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSSSSSS-SSSSFVVFLLLF-TSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  TPILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        TP LEP ERPRGNNS EEVAEN QLWSASGDFCPEGTIPIRRTTE+DIFRASSFRRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  TPILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNS
        TDQYEFSLSQIWVISGSFGNDLNTIEAGWQ+   LY  ++      + TDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSY GKQFDIGLMVWK   
Subjt:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNS

Query:  FLIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH
                  DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH
Subjt:  FLIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH

Query:  LLADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP
        LLADHSDCYDIRQA NNVWGTYFYYGGPGRNVKCP
Subjt:  LLADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP

XP_038895688.1 uncharacterized protein LOC120083860 isoform X2 [Benincasa hispida]6.2e-21990.4Show/hide
Query:  MASSSSSSS-SSSSFVVFLLLF-TSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGH
        MASSSSSSS S S FVVFLL+F TSFSSVFS+SISHQIPPKNQT FHP KELKKLKHIR+YLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGH
Subjt:  MASSSSSSS-SSSSFVVFLLLF-TSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGH

Query:  TPILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        TP LEP ERPRGNNS EEVAEN QLWSASGDFCPEGTIPIRRTTE+DIFRASSFRRFGRKPIR VRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
Subjt:  TPILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQILYVLDSKFHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHSFEF
        TDQYEFSLSQIWVISGSFGNDLNTIEAGWQ          TDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSY GKQFDIGLMVWK           
Subjt:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQILYVLDSKFHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHSFEF

Query:  VEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDC
          DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDC
Subjt:  VEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDC

Query:  YDIRQATNNVWGTYFYYGGPGRNVKCP
        YDIRQA NNVWGTYFYYGGPGRNVKCP
Subjt:  YDIRQATNNVWGTYFYYGGPGRNVKCP

TrEMBL top hitse value%identityAlignment
A0A0A0LTK3 Uncharacterized protein7.5e-21085.09Show/hide
Query:  MASSSSSSSSS-SSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT
        MASSSSSSSSS S FVV LL+FTSFSSV S+SISHQIP KNQTLFHP KELKKLKHIR+YLRKINKPPIK I+SSDGDVIDCVLSHLQPAFDHP+LKGH+
Subjt:  MASSSSSSSSS-SSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT

Query:  PILEPGERPRGN-NSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV
        P LEP ERPRGN NS EE  EN QLWS SG+FCPEGTIPIRRTTEKDI+RASS+RR+GRKPI+HV+RDSSGNGHEHAVV+VNGEQYYGAKASLNIWAPRV
Subjt:  PILEPGERPRGN-NSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRV

Query:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNS
        TDQYEFS+SQIWVISGSF NDLNTIEAGWQ+   LY  ++      + TDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWK   
Subjt:  TDQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNS

Query:  FLIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL
                  DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRS SGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL
Subjt:  FLIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRS-SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNL

Query:  HLLADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP
         +LADHSDCYDIRQ TNNVWGTYFYYGGPGRNVKCP
Subjt:  HLLADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP

A0A1S3AXP9 uncharacterized protein LOC1034837239.7e-21084.16Show/hide
Query:  MASSSSSSS-------SSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHP
        MASSSSSSS       S S FVV LL+FTSFSSVFS+SISHQIP KNQT FHP +ELKKLKHIR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHP
Subjt:  MASSSSSSS-------SSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHP

Query:  ELKGHTPILEPGERPRG-NNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPI-RHVRRDSSGNGHEHAVVFVNGEQYYGAKASL
        +LKGHTP LEP ERPRG NNS+EE  EN QLWS SG+FCPEGTIPIRRTTEKDI+RASS+RR+GRKPI RHVRRDSSGNGHEHAVV+VNGEQYYGAKASL
Subjt:  ELKGHTPILEPGERPRG-NNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPI-RHVRRDSSGNGHEHAVVFVNGEQYYGAKASL

Query:  NIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGL
        NIWAPRVTDQYEFS+SQIWVISGSF NDLNTIEAGWQ+   LY  ++      + TDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGL
Subjt:  NIWAPRVTDQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGL

Query:  MVWKVNSFLIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNL
        MVWK             DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSGFHT TQMGSGHFAEEGFGKASYFRNLQVVDWDNNL
Subjt:  MVWKVNSFLIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNL

Query:  LPLTNLHLLADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP
        LPLTNL +LADHSDCYDIRQ TN+VWGTYFYYGGPGRNVKCP
Subjt:  LPLTNLHLLADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP

A0A6J1GRA4 uncharacterized protein LOC1114568102.1e-21285.35Show/hide
Query:  SSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE
        SSSS SS S FVV LL+FTS +SVFSTSI+HQ+P KNQTLFHPTKELKKLKHIR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+P LE
Subjt:  SSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE

Query:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
        P ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
Subjt:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHS
        FSLSQIW+ISGSFGNDLNTIEAGWQ+   LY  ++      + TDAYQATGCYNLLCSGFVQTN+RIAIGAAISP+SSYRGKQFDIGLMVWK        
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHS

Query:  FEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH
             DPKHGHWWLEYGSG LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+LADH
Subjt:  FEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH

Query:  SDCYDIRQATNNVWGTYFYYGGPGRNVKCP
        SDCYDIRQ +NNVWGTYFYYGGPGR V+CP
Subjt:  SDCYDIRQATNNVWGTYFYYGGPGRNVKCP

A0A6J1JZN8 uncharacterized protein LOC1114902774.2e-21385.48Show/hide
Query:  MASS-SSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT
        MASS SSS SS S FVVFLL+FTS +SVFSTSI+HQ+P KNQTLFHPTKELKKLK+IR+YLRKINKPPIKTI+SSDGDVIDCVLSHLQPAFDHPELKGH+
Subjt:  MASS-SSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT

Query:  PILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        P LEP ERPR N SMEEVA+N QLWSASG+FCPEGTIPIRRTTEKDIFRA+S RRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
Subjt:  PILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSF
        DQYEFSLSQIW+ISGSFGNDLNTIEAGWQ+   LY  ++      + TDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSYRGKQFDIGLMVWK    
Subjt:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSF

Query:  LIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHL
                 DPKHGHWWLEYGSG+LVGYWPAFLFSHLRSHASMVQFGGE+VN R+SGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLH+
Subjt:  LIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHL

Query:  LADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP
        LADHSDCYDIRQ +NNVWGTYFYYGGPGR V+CP
Subjt:  LADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP

A0A6J1KJ26 uncharacterized protein LOC1114950545.7e-21084.71Show/hide
Query:  SSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERP
        SSSS FVV LL+FTSFSSVF TSI+H+ PPKNQT FHP+KEL +LKHIR+YLRKINKPP KTI+SSDGDVIDCVLSHLQPAFDHP LKGHTP L P ERP
Subjt:  SSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERP

Query:  RGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ
        RGNNS EEVAEN QLWSASGDFCPEGTIPIRRTTE+DI+RASSFRRFGRKPIRH+RRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQ EFSLSQ
Subjt:  RGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ

Query:  IWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHSFEFVE
        IWVISGSFGNDLNTIEAGWQ+   LY  ++      + TDAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY GKQFD+G+MVWK             
Subjt:  IWVISGSFGNDLNTIEAGWQI---LYVLDSK-----FHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHSFEFVE

Query:  DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYD
        DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSH SMVQFGGEIVNSR SGFHTAT+MGSGHF EEGF KASYFRNLQVVDWDNNLLPLTNLH+LADHSDCYD
Subjt:  DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYD

Query:  IRQATNNVWGTYFYYGGPGRNVKCP
        IRQ TN+VWGTYFYYGGPGRNVKCP
Subjt:  IRQATNNVWGTYFYYGGPGRNVKCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)9.1e-16065.41Show/hide
Query:  SSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERP
        SS+  F+  LLL +SFSSV S ++S    P+NQTL  P  EL KLK I  +LRKINKP IKTI S DGD+IDCVL H QPAFDHP L+G  P L+P ERP
Subjt:  SSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERP

Query:  RGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ
        RG+N      ++ QLW   G+ CPEGT+PIRRT E+DI RA+S   FG+K +RH RRD+S NGHEHAV +V+GE+YYGAKAS+N+WAP+V +QYEFSLSQ
Subjt:  RGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQ

Query:  IWVISGSFGNDLNTIEAGWQILYVLDS----KFHT----DAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHSFEFVE
        IW+ISGSFGNDLNTIEAGWQ+   L      +F T    DAYQATGCYNLLCSGFVQTN+ IAIGAAISP SSY+G QFDI L++WK             
Subjt:  IWVISGSFGNDLNTIEAGWQILYVLDS----KFHT----DAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHSFEFVE

Query:  DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYD
        DPKHG+WWLE+GSG+LVGYWP+FLF+HL+ HASMVQ+GGEIVNS   G HT+TQMGSGHFAEEGF K+SYFRN+QVVDWDNNL+P  NL +LADH +CYD
Subjt:  DPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYD

Query:  IRQATNNVWGTYFYYGGPGRNVKCP
        I+  +N  WG+YFYYGGPG+N KCP
Subjt:  IRQATNNVWGTYFYYGGPGRNVKCP

AT1G23340.1 Protein of Unknown Function (DUF239)5.9e-15965.12Show/hide
Query:  SSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE
        SSSSS    +F++ L LF+S++S  S S S  +P        P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P ++
Subjt:  SSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE

Query:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
        P E P G +   E  EN QLWS  G+ CPEGTIPIRRTTE+D+ RA+S RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYE
Subjt:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQILYVL----DSKFHT----DAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHS
        FSLSQIW+I+GSF  DLNTIEAGWQI   L    + +F T    DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+G QFDI L++WK        
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQILYVL----DSKFHT----DAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHS

Query:  FEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH
             DPKHGHWWL++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH
Subjt:  FEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH

Query:  SDCYDIRQATNNVWGTYFYYGGPGRNVKCP
         +CYDIR   N VWG +FYYGGPG+N KCP
Subjt:  SDCYDIRQATNNVWGTYFYYGGPGRNVKCP

AT1G23340.2 Protein of Unknown Function (DUF239)5.9e-15965.12Show/hide
Query:  SSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE
        SSSSS    +F++ L LF+S++S  S S S  +P        P +E++K+K IR  L+KINKP IKTI SSDGD IDCV SH QPAFDHP L+G  P ++
Subjt:  SSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILE

Query:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE
        P E P G +   E  EN QLWS  G+ CPEGTIPIRRTTE+D+ RA+S RRFGRK IR VRRDSS NGHEHAV +V+G QYYGAKAS+N+W PRV  QYE
Subjt:  PGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYE

Query:  FSLSQIWVISGSFGNDLNTIEAGWQILYVL----DSKFHT----DAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHS
        FSLSQIW+I+GSF  DLNTIEAGWQI   L    + +F T    DAYQATGCYNLLCSGFVQTNNRIAIGAAISP+SSY+G QFDI L++WK        
Subjt:  FSLSQIWVISGSFGNDLNTIEAGWQILYVL----DSKFHT----DAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHS

Query:  FEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH
             DPKHGHWWL++GSG LVGYWP  LF+HLR H +MVQFGGEIVN+R  G HT+TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P++NL +LADH
Subjt:  FEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADH

Query:  SDCYDIRQATNNVWGTYFYYGGPGRNVKCP
         +CYDIR   N VWG +FYYGGPG+N KCP
Subjt:  SDCYDIRQATNNVWGTYFYYGGPGRNVKCP

AT1G70550.2 Protein of Unknown Function (DUF239)8.5e-15865.25Show/hide
Query:  SSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERPRG
        SSSF+  +LL    SS FS++ S            P +EL+KL  IR  L KINKP +KTI+SSDGD IDCV +H QPAFDHP L+G  P L+P E P+G
Subjt:  SSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPGERPRG

Query:  NNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW
         +  +   EN QLWS SG+ CPEGTIPIRRTTE+D+ RASS +RFGRK IR V+RDS+ NGHEHAV +V G QYYGAKAS+N+W+PRVT QYEFSLSQIW
Subjt:  NNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW

Query:  VISGSFGNDLNTIEAGWQILYVLDS----KFHT----DAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHSFEFVEDP
        VI+GSF +DLNTIEAGWQI   L      +F T    DAY+ TGCYNLLCSGFVQTN RIAIGAAISP SSY+G QFDI L++WK             DP
Subjt:  VISGSFGNDLNTIEAGWQILYVLDS----KFHT----DAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHSFEFVEDP

Query:  KHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIR
        KHGHWWL++GSG LVGYWPAFLF+HL+ H SMVQFGGEIVN+R  G HT TQMGSGHFA EGFGKASYFRNLQ+VDWDN L+P +NL +LADH +CYDIR
Subjt:  KHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIR

Query:  QATNNVWGTYFYYGGPGRNVKCP
          TN VWG YFYYGGPG+N +CP
Subjt:  QATNNVWGTYFYYGGPGRNVKCP

AT5G50150.1 Protein of Unknown Function (DUF239)4.0e-17169.12Show/hide
Query:  MASSSSSSSSSSSFV-VFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT
        MASSSSSSS++S+    F+ L    S      +   I  KNQT F P +E++KL+ + +YL KINKP IKTI S DGDVI+CV SHLQPAFDHP+L+G  
Subjt:  MASSSSSSSSSSSFV-VFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHT

Query:  PILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT
        P+  P    +GN +  E + N QLWS SG+ CP G+IPIR+TT+ D+ RA+S RRFGRK  R +RRDSSG GHEHAVVFVNGEQYYGAKAS+N+WAPRVT
Subjt:  PILEPGERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVT

Query:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LY-----VLDSKFHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSF
        D YEFSLSQIW+ISGSFG+DLNTIEAGWQ+   LY        + + TDAYQATGCYNLLCSGFVQTNN+IAIGAAISP SSY G+QFDIGLM+WK    
Subjt:  DQYEFSLSQIWVISGSFGNDLNTIEAGWQI---LY-----VLDSKFHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSF

Query:  LIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHL
                 DPKHGHWWLE G+GLLVGYWPAFLFSHLRSHASMVQFGGE+VNSRSSG HT TQMGSGHFA+EGF KA+YFRNLQVVDWDNNLLPL NLH+
Subjt:  LIHSFEFVEDPKHGHWWLEYGSGLLVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHL

Query:  LADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP
        LADH  CYDIRQ  NNVWGTYFYYGGPGRN +CP
Subjt:  LADHSDCYDIRQATNNVWGTYFYYGGPGRNVKCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTTTGTTGTTTTCCTTCTGCTTTTTACTTCTTTCTCCTCTGTTTTCTCAACTTCCATTTCCCATCAA
ATCCCACCAAAAAACCAAACTCTTTTCCATCCAACCAAAGAGCTGAAGAAACTAAAACACATCAGATCTTATTTACGCAAAATCAACAAGCCTCCAATCAAGACA
ATTCGGAGTTCAGATGGTGATGTTATAGACTGTGTTCTTTCTCATCTTCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGGCATACTCCAATATTGGAACCAGGT
GAGAGGCCAAGAGGGAACAACTCCATGGAAGAAGTAGCAGAAAATTTGCAATTATGGTCAGCTTCAGGGGATTTTTGCCCAGAAGGAACAATTCCAATAAGAAGA
ACAACAGAAAAAGACATTTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGACTCTTCTGGCAATGGCCATGAGCATGCT
GTTGTATTTGTGAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCGCCACGTGTAACGGATCAATATGAATTTAGTTTATCACAAATATGG
GTAATTTCAGGGTCTTTTGGGAATGATTTGAACACCATTGAAGCTGGATGGCAGATTCTTTACGTATTGGACAGTAAGTTTCATACCGATGCTTATCAAGCGACT
GGGTGTTATAATTTACTTTGTTCTGGGTTCGTTCAAACCAACAATAGGATCGCCATTGGAGCAGCAATCTCGCCTATATCTTCTTATAGAGGAAAGCAATTCGAT
ATTGGTTTAATGGTTTGGAAGGTAAATTCTTTTTTAATTCATTCATTCGAGTTCGTGGAAGATCCGAAGCACGGGCATTGGTGGTTGGAATACGGTTCGGGTTTG
CTAGTCGGGTATTGGCCAGCATTTCTGTTCAGCCATTTAAGAAGTCATGCTAGCATGGTACAATTTGGAGGAGAAATAGTGAACAGCAGATCATCAGGGTTTCAC
ACCGCCACACAGATGGGGAGTGGTCATTTTGCTGAAGAAGGGTTTGGAAAAGCTTCTTATTTCAGGAACTTGCAAGTGGTTGATTGGGATAATAATTTGCTTCCT
CTTACAAATCTTCATCTCTTGGCTGACCATTCTGATTGTTATGATATTAGACAAGCCACTAATAATGTTTGGGGCACTTATTTTTACTATGGAGGGCCTGGTAGA
AATGTCAAATGCCCTTAG
mRNA sequenceShow/hide mRNA sequence
AAAACATTCTAACAAAATGGCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTTTGTTGTTTTCCTTCTGCTTTTTACTTCTTTCTCCTCTGTTTTCTCAAC
TTCCATTTCCCATCAAATCCCACCAAAAAACCAAACTCTTTTCCATCCAACCAAAGAGCTGAAGAAACTAAAACACATCAGATCTTATTTACGCAAAATCAACAA
GCCTCCAATCAAGACAATTCGGAGTTCAGATGGTGATGTTATAGACTGTGTTCTTTCTCATCTTCAGCCTGCTTTTGACCATCCTGAGCTCAAAGGGCATACTCC
AATATTGGAACCAGGTGAGAGGCCAAGAGGGAACAACTCCATGGAAGAAGTAGCAGAAAATTTGCAATTATGGTCAGCTTCAGGGGATTTTTGCCCAGAAGGAAC
AATTCCAATAAGAAGAACAACAGAAAAAGACATTTTCAGAGCAAGTTCTTTTCGAAGATTTGGAAGAAAACCCATTAGACATGTGAGAAGAGACTCTTCTGGCAA
TGGCCATGAGCATGCTGTTGTATTTGTGAATGGAGAACAATATTATGGAGCAAAGGCGAGTTTAAACATATGGGCGCCACGTGTAACGGATCAATATGAATTTAG
TTTATCACAAATATGGGTAATTTCAGGGTCTTTTGGGAATGATTTGAACACCATTGAAGCTGGATGGCAGATTCTTTACGTATTGGACAGTAAGTTTCATACCGA
TGCTTATCAAGCGACTGGGTGTTATAATTTACTTTGTTCTGGGTTCGTTCAAACCAACAATAGGATCGCCATTGGAGCAGCAATCTCGCCTATATCTTCTTATAG
AGGAAAGCAATTCGATATTGGTTTAATGGTTTGGAAGGTAAATTCTTTTTTAATTCATTCATTCGAGTTCGTGGAAGATCCGAAGCACGGGCATTGGTGGTTGGA
ATACGGTTCGGGTTTGCTAGTCGGGTATTGGCCAGCATTTCTGTTCAGCCATTTAAGAAGTCATGCTAGCATGGTACAATTTGGAGGAGAAATAGTGAACAGCAG
ATCATCAGGGTTTCACACCGCCACACAGATGGGGAGTGGTCATTTTGCTGAAGAAGGGTTTGGAAAAGCTTCTTATTTCAGGAACTTGCAAGTGGTTGATTGGGA
TAATAATTTGCTTCCTCTTACAAATCTTCATCTCTTGGCTGACCATTCTGATTGTTATGATATTAGACAAGCCACTAATAATGTTTGGGGCACTTATTTTTACTA
TGGAGGGCCTGGTAGAAATGTCAAATGCCCTTAGAATTATTTCTTCTTTTTTATTATTATTATTATTATTATTATT
Protein sequenceShow/hide protein sequence
MASSSSSSSSSSSFVVFLLLFTSFSSVFSTSISHQIPPKNQTLFHPTKELKKLKHIRSYLRKINKPPIKTIRSSDGDVIDCVLSHLQPAFDHPELKGHTPILEPG
ERPRGNNSMEEVAENLQLWSASGDFCPEGTIPIRRTTEKDIFRASSFRRFGRKPIRHVRRDSSGNGHEHAVVFVNGEQYYGAKASLNIWAPRVTDQYEFSLSQIW
VISGSFGNDLNTIEAGWQILYVLDSKFHTDAYQATGCYNLLCSGFVQTNNRIAIGAAISPISSYRGKQFDIGLMVWKVNSFLIHSFEFVEDPKHGHWWLEYGSGL
LVGYWPAFLFSHLRSHASMVQFGGEIVNSRSSGFHTATQMGSGHFAEEGFGKASYFRNLQVVDWDNNLLPLTNLHLLADHSDCYDIRQATNNVWGTYFYYGGPGR
NVKCP