; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg19744 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg19744
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionGPI-anchored adhesin-like protein
Genome locationCarg_Chr15:10218036..10219854
RNA-Seq ExpressionCarg19744
SyntenyCarg19744
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017172.1 hypothetical protein SDJN02_22284, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MRPSSSSSSSPSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSH
        MRPSSSSSSSPSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSH
Subjt:  MRPSSSSSSSPSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSH

Query:  VRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSDFNFTP
        VRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSDFNFTP
Subjt:  VRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSDFNFTP

Query:  LREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEEDG
        LREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEEDG
Subjt:  LREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEEDG

Query:  SNKRETSNSEDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTAD
        SNKRETSNSEDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTAD
Subjt:  SNKRETSNSEDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTAD

Query:  SCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADT
        SCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADT
Subjt:  SCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADT

Query:  CSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFDC
        CSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFDC
Subjt:  CSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFDC

XP_022928860.1 uncharacterized protein LOC111435650 [Cucurbita moschata]0.0e+0091.71Show/hide
Query:  MRP-SSSSSSSPSSQIHAANNKLK---------QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKS
        MRP SSSSSSSPSSQIHAAN KLK         QQQ HQ PRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKS
Subjt:  MRP-SSSSSSSPSSQIHAANNKLK---------QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKS

Query:  SSKNPKSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNK
        SSKNPKSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPA+P STKLEPLKKNSPSLYRWPSGKKPCSLG HKPKILASDGEELERHGAH VVRMVDD NK
Subjt:  SSKNPKSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNK

Query:  CEPSDFNFTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVK
        CEPSDF+FTP+REI+NGSGLDPTA KVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVG+NTPSISNVK
Subjt:  CEPSDFNFTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVK

Query:  PIQNFEEEDGSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPF
        PIQNFEE DGSN+RETSNS                      EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPF
Subjt:  PIQNFEEEDGSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPF

Query:  SIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRV
        SIDSLSSDNVIRTPQSDSNSAPKHF PWLTADSCDKHDQ+SASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLS SQMRV
Subjt:  SIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRV

Query:  SWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASG
        SWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSI+LSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASG
Subjt:  SWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASG

Query:  DSDWHLCYKNGLFDC
        DSDWHLCYKNGLFDC
Subjt:  DSDWHLCYKNGLFDC

XP_022969790.1 uncharacterized protein LOC111468888 [Cucurbita maxima]0.0e+0092.62Show/hide
Query:  MRP-SSSSSSSPSSQIHAANNKLK----QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNP
        MRP SSSSSSSPSSQIHA N KLK    QQ HHQ PR PFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNP
Subjt:  MRP-SSSSSSSPSSQIHAANNKLK----QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNP

Query:  KSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSD
        KSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPA+PSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDG+ELER GAHSVVRMVDD NKC+PSD
Subjt:  KSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSD

Query:  FNFTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNF
        FNFTP+REIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSAL+PTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNF
Subjt:  FNFTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNF

Query:  EEEDGSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSL
        EEEDGSNKRETSNS                      EDED QDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSL
Subjt:  EEEDGSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSL

Query:  SSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREG
        SSDNVIRTPQSDSNSAPKHF PWLTADSC KHDQ+SASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLS SQMRVSWREG
Subjt:  SSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREG

Query:  LMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWH
        LMSRIYEMDEFDSCRCLSD+EEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWH
Subjt:  LMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWH

Query:  LCYKNGLFDC
        LCYKNGLFDC
Subjt:  LCYKNGLFDC

XP_023550090.1 uncharacterized protein LOC111808388 [Cucurbita pepo subsp. pepo]0.0e+0095.05Show/hide
Query:  MRPSSSSSSSPSSQIHAANNKLK-QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTS
        MRPSSSSSSSPSSQIHAAN KLK QQQ HQ PRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTS
Subjt:  MRPSSSSSSSPSSQIHAANNKLK-QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTS

Query:  HVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSDFNFT
        HVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSDFNFT
Subjt:  HVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSDFNFT

Query:  PLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEED
        PLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRP+GVLIVGDNTPSISNVKPIQNFEEED
Subjt:  PLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEED

Query:  GSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSLSSDN
        GSNKRETSNS                      EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSLSSDN
Subjt:  GSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSLSSDN

Query:  VIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSR
        VIRTPQSDS+SAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLS SQMRVSWREGLMSR
Subjt:  VIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSR

Query:  IYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWHLCYK
        IYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWHLCY+
Subjt:  IYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWHLCYK

Query:  NGLFDC
        NGLFDC
Subjt:  NGLFDC

XP_038906910.1 uncharacterized protein LOC120092781 [Benincasa hispida]2.7e-22869.47Show/hide
Query:  SSSSPSSQIHAANNKLK-QQQHHQCPRGPFRVLNGISFSPACNTASVGSDAS--STSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRP
        SSSS SSQIHA NNK K +QQ  Q  R PF VLNGISFS ACN++S+ SDAS  STST+APRGCLRFFLSHS++SSKT  PANK K SSK PKSTS++RP
Subjt:  SSSSPSSQIHAANNKLK-QQQHHQCPRGPFRVLNGISFSPACNTASVGSDAS--STSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRP

Query:  LKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCE-------PSDF
        +KPLRSKPLKENAPKRA+ H SRA A+PSSTKL+PLKKNSP LYRWPSGKKPCSLGTHK K+LAS GEELE++G H VVRMVDD  KCE       PSDF
Subjt:  LKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCE-------PSDF

Query:  NFTPLREIENG-SGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNF
        NFTP+R++E G SGLDPT DKV ALE SN D +KTPPVQASVSPELQCGSA+MPT+TP+CYGAGYVVSG+SDKRKCRPRG+LIVGDNT SIS VKPIQ F
Subjt:  NFTPLREIENG-SGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNF

Query:  EEEDGSNKRETSNS----------------------EDEDHQDKSENAS-------------------------------SHFEGFLEPLSFEDISPSCA
         EEDG+  R+TSNS                      EDEDH+ KS NAS                                 F+GFLEPLSFE+ S SCA
Subjt:  EEEDGSNKRETSNS----------------------EDEDHQDKSENAS-------------------------------SHFEGFLEPLSFEDISPSCA

Query:  PNCLDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSAS----------DSHKAMTSITDLSFQFDCLAT
        PNCLDVIL EGRGQ RY+VNGENSPFSIDSLSSDNVIRTP SDS+ A K F PWLTADSC K DQ+SAS          DS  A+TSITDLSFQFDCLAT
Subjt:  PNCLDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSAS----------DSHKAMTSITDLSFQFDCLAT

Query:  ISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSS
        I NSMDL+QFQK+LEDQAF NSNSSCE+L  S+MRVSWREGLMSRIYEMDEFD+CRCLS DEEEN D+C   LSDIL KTPL+HNDCEADPI+ N  CS 
Subjt:  ISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSS

Query:  RLLVNEEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD
         LLV+EEA+EY+K     S++V C CAESISTDGGGL+ASGDSDW+LCYKNGLFD
Subjt:  RLLVNEEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD

TrEMBL top hitse value%identityAlignment
A0A0A0LHN8 Uncharacterized protein1.0e-22568.97Show/hide
Query:  SSSSSPSSQIHAANNKLK-QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPL
        +SSSSPSS     N K K QQQ  Q PR PF VLN ISF  ACNT+S+GSDASSTST+APRGCLRFFL HSS+SSK  TPANKLK SSK PKS S+VRP+
Subjt:  SSSSSPSSQIHAANNKLK-QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPL

Query:  KPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPS-------DFN
        KPLRSKPLKENAPK  +  +SRA A P+STKL+PLKKNSP LYRWPSGKKP SL THK K+LAS GEE  +HGAHSVVRMVDD  KCEPS       DFN
Subjt:  KPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPS-------DFN

Query:  FTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEE
        FTP+R++ENGSG DPT D VVALE SN DH+KTPPVQAS+SPELQCGSA+MP +TPVCYGAGYVVSGISDKRKCRPRG+LIVGDN  SIS VKPIQ F E
Subjt:  FTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEE

Query:  EDGSNKRETSNS----------------------EDEDHQDKSENASS-------------------------------HFEGFLEPLSFEDISPSCAPN
        ED S  ++TSNS                      EDEDH++ S+NAS+                                F+GF+EPLSFED SPSCA N
Subjt:  EDGSNKRETSNS----------------------EDEDHQDKSENASS-------------------------------HFEGFLEPLSFEDISPSCAPN

Query:  CLDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSAS--------DSHKAMTSITDLSFQFDCLATISNS
         L+VIL EGRGQ RY+VNGENSPFSIDSLSSDNVI+TPQSDSNSA K F PWL+ADSC+K+DQ+SAS        DS  A+TSITDLSFQFDCLATISNS
Subjt:  CLDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSAS--------DSHKAMTSITDLSFQFDCLATISNS

Query:  MDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLV
        MDL+QFQK+LEDQAFRN+NSSCE+L  S+MRVSWREGLMSR+YEMDEFD+CRCLS DEEEN D+CSISLSDI+ KTPL+H DCE DPI+ NSSCS  LLV
Subjt:  MDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLV

Query:  NEEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD
        NEEAEEY K     S++V C CAESISTDGGGL+ASGDSDW+LCY+NGLFD
Subjt:  NEEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD

A0A1S4E1G9 uncharacterized protein LOC1034968883.3e-21667.23Show/hide
Query:  SSSSSPSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPLK
        +SSSSPS  I     + +QQ+  QC   PF VLN ISF  ACNT+S+GSDASSTST+APRGCLRFFL HSS+SSK  TPANKLK SSK PKS S+VR +K
Subjt:  SSSSSPSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPLK

Query:  PLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCE-------PSDFNF
        PLRSKPLKE APK A+  +SRA A P+STKL+PLKKNSP LYRWPSGKKP SL THK K+LAS GEEL  HGAHSVVRMVDD  KCE       PSDFNF
Subjt:  PLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCE-------PSDFNF

Query:  TPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEE
        TP+R++ENGSG DPT D VVALE SN DH+KTPPVQAS+SPELQCGSA+MP +TPVCYGAGYVVSGISDKRKCRPRG+LIVGDN  SIS VKPIQ F EE
Subjt:  TPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEE

Query:  DGSNKRETSNS----------------------EDEDHQDKSENASS-------------------------------HFEGFLEPLSFEDISPSCAPNC
        DGS  ++TSNS                      EDEDH++ S+NAS+                                F+GFLEPLS ED S SCA N 
Subjt:  DGSNKRETSNS----------------------EDEDHQDKSENASS-------------------------------HFEGFLEPLSFEDISPSCAPNC

Query:  LDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSAS--------DSHKAMTSITDLSFQFDCLATISNSM
        L+VIL E RGQ RY+VNGENSPFS+DSLSSDNVI+TPQSDSNSA K F PWL+ADS +KH+Q+SAS        DS   +TSITDLSFQFDCLATISNSM
Subjt:  LDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSAS--------DSHKAMTSITDLSFQFDCLATISNSM

Query:  DLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVN
        DL+QFQK+LEDQAFRN+NSSCE+L  S+MRVSWREGLMSR+YEMDEFD+CRCLS DEEEN D+CSISLSDIL KTPL+  D E DPI+  S CS  LLVN
Subjt:  DLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVN

Query:  EEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD
        EEAEEY K     S++V C CAESISTDGGGL+ASGDSDW+LCY+NGLFD
Subjt:  EEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD

A0A5D3BQU9 Uncharacterized protein3.3e-21667.23Show/hide
Query:  SSSSSPSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPLK
        +SSSSPS  I     + +QQ+  QC   PF VLN ISF  ACNT+S+GSDASSTST+APRGCLRFFL HSS+SSK  TPANKLK SSK PKS S+VR +K
Subjt:  SSSSSPSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPLK

Query:  PLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCE-------PSDFNF
        PLRSKPLKE APK A+  +SRA A P+STKL+PLKKNSP LYRWPSGKKP SL THK K+LAS GEEL  HGAHSVVRMVDD  KCE       PSDFNF
Subjt:  PLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCE-------PSDFNF

Query:  TPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEE
        TP+R++ENGSG DPT D VVALE SN DH+KTPPVQAS+SPELQCGSA+MP +TPVCYGAGYVVSGISDKRKCRPRG+LIVGDN  SIS VKPIQ F EE
Subjt:  TPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEE

Query:  DGSNKRETSNS----------------------EDEDHQDKSENASS-------------------------------HFEGFLEPLSFEDISPSCAPNC
        DGS  ++TSNS                      EDEDH++ S+NAS+                                F+GFLEPLS ED S SCA N 
Subjt:  DGSNKRETSNS----------------------EDEDHQDKSENASS-------------------------------HFEGFLEPLSFEDISPSCAPNC

Query:  LDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSAS--------DSHKAMTSITDLSFQFDCLATISNSM
        L+VIL E RGQ RY+VNGENSPFS+DSLSSDNVI+TPQSDSNSA K F PWL+ADS +KH+Q+SAS        DS   +TSITDLSFQFDCLATISNSM
Subjt:  LDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSAS--------DSHKAMTSITDLSFQFDCLATISNSM

Query:  DLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVN
        DL+QFQK+LEDQAFRN+NSSCE+L  S+MRVSWREGLMSR+YEMDEFD+CRCLS DEEEN D+CSISLSDIL KTPL+  D E DPI+  S CS  LLVN
Subjt:  DLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVN

Query:  EEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD
        EEAEEY K     S++V C CAESISTDGGGL+ASGDSDW+LCY+NGLFD
Subjt:  EEAEEYEK-----SNEVACCCAESISTDGGGLMASGDSDWHLCYKNGLFD

A0A6J1ELH1 uncharacterized protein LOC1114356500.0e+0091.71Show/hide
Query:  MRP-SSSSSSSPSSQIHAANNKLK---------QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKS
        MRP SSSSSSSPSSQIHAAN KLK         QQQ HQ PRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKS
Subjt:  MRP-SSSSSSSPSSQIHAANNKLK---------QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKS

Query:  SSKNPKSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNK
        SSKNPKSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPA+P STKLEPLKKNSPSLYRWPSGKKPCSLG HKPKILASDGEELERHGAH VVRMVDD NK
Subjt:  SSKNPKSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNK

Query:  CEPSDFNFTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVK
        CEPSDF+FTP+REI+NGSGLDPTA KVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVG+NTPSISNVK
Subjt:  CEPSDFNFTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVK

Query:  PIQNFEEEDGSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPF
        PIQNFEE DGSN+RETSNS                      EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPF
Subjt:  PIQNFEEEDGSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPF

Query:  SIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRV
        SIDSLSSDNVIRTPQSDSNSAPKHF PWLTADSCDKHDQ+SASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLS SQMRV
Subjt:  SIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRV

Query:  SWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASG
        SWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSI+LSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASG
Subjt:  SWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASG

Query:  DSDWHLCYKNGLFDC
        DSDWHLCYKNGLFDC
Subjt:  DSDWHLCYKNGLFDC

A0A6J1I200 uncharacterized protein LOC1114688880.0e+0092.62Show/hide
Query:  MRP-SSSSSSSPSSQIHAANNKLK----QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNP
        MRP SSSSSSSPSSQIHA N KLK    QQ HHQ PR PFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNP
Subjt:  MRP-SSSSSSSPSSQIHAANNKLK----QQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNP

Query:  KSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSD
        KSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPA+PSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDG+ELER GAHSVVRMVDD NKC+PSD
Subjt:  KSTSHVRPLKPLRSKPLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSD

Query:  FNFTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNF
        FNFTP+REIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSAL+PTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNF
Subjt:  FNFTPLREIENGSGLDPTADKVVALEASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNF

Query:  EEEDGSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSL
        EEEDGSNKRETSNS                      EDED QDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSL
Subjt:  EEEDGSNKRETSNS----------------------EDEDHQDKSENASSHFEGFLEPLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSL

Query:  SSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREG
        SSDNVIRTPQSDSNSAPKHF PWLTADSC KHDQ+SASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLS SQMRVSWREG
Subjt:  SSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREG

Query:  LMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWH
        LMSRIYEMDEFDSCRCLSD+EEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWH
Subjt:  LMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVACCCAESISTDGGGLMASGDSDWH

Query:  LCYKNGLFDC
        LCYKNGLFDC
Subjt:  LCYKNGLFDC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G43990.1 unknown protein3.1e-3328.3Show/hide
Query:  RPSSSSSSS----PSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQT-------PANKLKS
        R +SSS++     P SQ        K+ +  + P  P R  +  + S   ++ S  S + STS +A  GC RF LSHS SSS + +       P   ++S
Subjt:  RPSSSSSSS----PSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQT-------PANKLKS

Query:  SSKNPKSTSHVRPLKPLRSKPLKENAPK--RALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKP------KILASDGEELERHGAHSVV
         +KNPKS        P+ SKPL    P     +   S    +P+  K +  K N        SGK+P    T KP      K  +S    ++   + + +
Subjt:  SSKNPKSTSHVRPLKPLRSKPLKENAPK--RALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKP------KILASDGEELERHGAHSVV

Query:  RMVDDPNKCEPSDFNFTPLREIENGSGL------DPTADKVVALEASNKDHTKTPPVQASVSPELQCGSAL---MPTLTPVCYGAGYVVSGISDKRKCRP
        R VDD      S    TP+ ++E GS L      + T D  ++  +S+    +TPPVQASVSPE+QCGS++       +  CY AG+++SG+SDKRKC+P
Subjt:  RMVDDPNKCEPSDFNFTPLREIENGSGL------DPTADKVVALEASNKDHTKTPPVQASVSPELQCGSAL---MPTLTPVCYGAGYVVSGISDKRKCRP

Query:  RGVLIVGDNTPSISNVKPIQN---FEEEDGSNKRETSN-------------------SEDEDHQDK-SENASSHFEGFLEPLSFEDISP-----------
        +G+L VG+N   +   K + +   F+E D  N     +                    E+++H+ + S++  S F+  +E +  E  SP           
Subjt:  RGVLIVGDNTPSISNVKPIQN---FEEEDGSNKRETSN-------------------SEDEDHQDK-SENASSHFEGFLEPLSFEDISP-----------

Query:  -----------------------SCAPNCL-----DVILTEGRGQ-------------PRYEVNGE-NSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWL
                               S +PN L      + L+   G+             P   + G+ +SP S+D+L S+NVI+TP+S+S+      L   
Subjt:  -----------------------SCAPNCL-----DVILTEGRGQ-------------PRYEVNGE-NSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWL

Query:  TADSCDKHDQDSASDSHKAMTSITDL--------------SFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDE
         A+   KHD  S  +S         L              SF FD LAT S+S+DL+QFQ+ L D++  + + + + +S + +RV               
Subjt:  TADSCDKHDQDSASDSHKAMTSITDL--------------SFQFDCLATISNSMDLNQFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDE

Query:  FDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNE-VACCCAESISTDGGGLMASGDSDWHLCYKN
                  E+ N+    +    I         D E D  I N          E A    K  E + C  AESISTDGGGL+ S DS+W  CYKN
Subjt:  FDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNE-VACCCAESISTDGGGLMASGDSDWHLCYKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACCTTCCTCTTCCTCTTCTTCTTCGCCTTCGTCTCAAATCCACGCCGCTAACAACAAGCTCAAGCAGCAGCAGCACCACCAGTGTCCTCGGGGCCCCTTCCGTGT
TCTTAATGGCATTTCCTTTTCCCCCGCCTGCAACACCGCCAGCGTCGGCAGTGACGCTTCCTCCACCTCCACCGATGCTCCCAGAGGCTGTCTCAGGTTCTTTCTCTCTC
ATTCCTCTTCATCTTCTAAAACACAGACCCCTGCTAATAAGCTTAAATCATCTTCCAAAAACCCTAAATCTACTTCTCATGTTCGCCCCCTCAAGCCCCTTAGATCTAAG
CCGCTTAAGGAGAATGCTCCCAAACGCGCTCTTACACACAATTCTAGGGCGCCCGCAGAACCTTCCTCCACAAAGTTGGAGCCATTGAAGAAAAACTCCCCCTCCCTGTA
TAGATGGCCGTCCGGGAAGAAGCCCTGTTCATTAGGTACCCACAAACCTAAAATCTTGGCATCTGATGGTGAGGAGTTGGAGAGACATGGGGCACATAGTGTAGTGAGGA
TGGTTGATGATCCCAACAAATGTGAACCGTCCGATTTCAACTTCACTCCCCTGCGTGAAATCGAAAACGGCTCTGGTTTGGATCCAACAGCTGACAAGGTTGTAGCCCTA
GAGGCTTCAAACAAAGATCATACTAAAACGCCTCCTGTTCAAGCCTCTGTGTCACCTGAATTACAATGTGGTTCAGCACTTATGCCTACACTTACTCCTGTCTGCTATGG
TGCTGGATATGTCGTTTCTGGGATCTCTGACAAGAGAAAGTGTAGACCCAGAGGAGTTCTTATTGTGGGAGACAATACTCCATCCATTTCTAATGTCAAGCCCATTCAGA
ATTTTGAGGAAGAAGATGGAAGCAACAAAAGGGAGACTTCCAATTCTGAGGATGAGGATCACCAAGACAAGTCTGAAAACGCTTCATCTCACTTTGAAGGTTTCTTGGAG
CCATTATCCTTTGAGGACATCTCTCCCTCATGCGCTCCTAATTGCTTGGATGTGATATTAACGGAGGGAAGAGGACAACCGAGGTATGAAGTCAATGGGGAGAATTCTCC
ATTCTCAATTGACTCATTAAGCAGTGACAATGTCATCCGAACACCACAATCAGACTCAAATTCAGCTCCAAAACATTTCCTTCCATGGTTAACTGCTGACAGTTGTGATA
AACATGATCAGGATTCAGCGTCTGACAGCCATAAAGCAATGACAAGTATAACAGATTTAAGTTTCCAATTTGATTGTCTGGCCACAATATCCAATTCCATGGATCTTAAC
CAATTTCAAAAGCTTCTTGAAGATCAGGCTTTTAGGAATAGCAATTCTTCGTGTGAGAATTTGTCAACATCCCAAATGAGAGTATCATGGAGGGAAGGGTTAATGAGCCG
GATCTACGAGATGGACGAATTTGATAGTTGTCGATGCTTATCAGACGACGAAGAAGAGAATGCTGACACTTGCAGCATTAGCTTGTCAGATATCCTTAAGAAGACTCCTC
TGAAACATAATGATTGTGAGGCTGATCCTATAATTTGCAACAGTTCTTGTTCTTCTCGATTATTAGTGAATGAGGAAGCCGAAGAATATGAGAAAAGTAATGAAGTAGCA
TGTTGTTGTGCGGAATCCATTAGCACGGATGGAGGTGGGTTGATGGCATCAGGGGACTCAGATTGGCATTTATGCTACAAAAATGGATTGTTTGATTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGACCTTCCTCTTCCTCTTCTTCTTCGCCTTCGTCTCAAATCCACGCCGCTAACAACAAGCTCAAGCAGCAGCAGCACCACCAGTGTCCTCGGGGCCCCTTCCGTGT
TCTTAATGGCATTTCCTTTTCCCCCGCCTGCAACACCGCCAGCGTCGGCAGTGACGCTTCCTCCACCTCCACCGATGCTCCCAGAGGCTGTCTCAGGTTCTTTCTCTCTC
ATTCCTCTTCATCTTCTAAAACACAGACCCCTGCTAATAAGCTTAAATCATCTTCCAAAAACCCTAAATCTACTTCTCATGTTCGCCCCCTCAAGCCCCTTAGATCTAAG
CCGCTTAAGGAGAATGCTCCCAAACGCGCTCTTACACACAATTCTAGGGCGCCCGCAGAACCTTCCTCCACAAAGTTGGAGCCATTGAAGAAAAACTCCCCCTCCCTGTA
TAGATGGCCGTCCGGGAAGAAGCCCTGTTCATTAGGTACCCACAAACCTAAAATCTTGGCATCTGATGGTGAGGAGTTGGAGAGACATGGGGCACATAGTGTAGTGAGGA
TGGTTGATGATCCCAACAAATGTGAACCGTCCGATTTCAACTTCACTCCCCTGCGTGAAATCGAAAACGGCTCTGGTTTGGATCCAACAGCTGACAAGGTTGTAGCCCTA
GAGGCTTCAAACAAAGATCATACTAAAACGCCTCCTGTTCAAGCCTCTGTGTCACCTGAATTACAATGTGGTTCAGCACTTATGCCTACACTTACTCCTGTCTGCTATGG
TGCTGGATATGTCGTTTCTGGGATCTCTGACAAGAGAAAGTGTAGACCCAGAGGAGTTCTTATTGTGGGAGACAATACTCCATCCATTTCTAATGTCAAGCCCATTCAGA
ATTTTGAGGAAGAAGATGGAAGCAACAAAAGGGAGACTTCCAATTCTGAGGATGAGGATCACCAAGACAAGTCTGAAAACGCTTCATCTCACTTTGAAGGTTTCTTGGAG
CCATTATCCTTTGAGGACATCTCTCCCTCATGCGCTCCTAATTGCTTGGATGTGATATTAACGGAGGGAAGAGGACAACCGAGGTATGAAGTCAATGGGGAGAATTCTCC
ATTCTCAATTGACTCATTAAGCAGTGACAATGTCATCCGAACACCACAATCAGACTCAAATTCAGCTCCAAAACATTTCCTTCCATGGTTAACTGCTGACAGTTGTGATA
AACATGATCAGGATTCAGCGTCTGACAGCCATAAAGCAATGACAAGTATAACAGATTTAAGTTTCCAATTTGATTGTCTGGCCACAATATCCAATTCCATGGATCTTAAC
CAATTTCAAAAGCTTCTTGAAGATCAGGCTTTTAGGAATAGCAATTCTTCGTGTGAGAATTTGTCAACATCCCAAATGAGAGTATCATGGAGGGAAGGGTTAATGAGCCG
GATCTACGAGATGGACGAATTTGATAGTTGTCGATGCTTATCAGACGACGAAGAAGAGAATGCTGACACTTGCAGCATTAGCTTGTCAGATATCCTTAAGAAGACTCCTC
TGAAACATAATGATTGTGAGGCTGATCCTATAATTTGCAACAGTTCTTGTTCTTCTCGATTATTAGTGAATGAGGAAGCCGAAGAATATGAGAAAAGTAATGAAGTAGCA
TGTTGTTGTGCGGAATCCATTAGCACGGATGGAGGTGGGTTGATGGCATCAGGGGACTCAGATTGGCATTTATGCTACAAAAATGGATTGTTTGATTGCTAA
Protein sequenceShow/hide protein sequence
MRPSSSSSSSPSSQIHAANNKLKQQQHHQCPRGPFRVLNGISFSPACNTASVGSDASSTSTDAPRGCLRFFLSHSSSSSKTQTPANKLKSSSKNPKSTSHVRPLKPLRSK
PLKENAPKRALTHNSRAPAEPSSTKLEPLKKNSPSLYRWPSGKKPCSLGTHKPKILASDGEELERHGAHSVVRMVDDPNKCEPSDFNFTPLREIENGSGLDPTADKVVAL
EASNKDHTKTPPVQASVSPELQCGSALMPTLTPVCYGAGYVVSGISDKRKCRPRGVLIVGDNTPSISNVKPIQNFEEEDGSNKRETSNSEDEDHQDKSENASSHFEGFLE
PLSFEDISPSCAPNCLDVILTEGRGQPRYEVNGENSPFSIDSLSSDNVIRTPQSDSNSAPKHFLPWLTADSCDKHDQDSASDSHKAMTSITDLSFQFDCLATISNSMDLN
QFQKLLEDQAFRNSNSSCENLSTSQMRVSWREGLMSRIYEMDEFDSCRCLSDDEEENADTCSISLSDILKKTPLKHNDCEADPIICNSSCSSRLLVNEEAEEYEKSNEVA
CCCAESISTDGGGLMASGDSDWHLCYKNGLFDC