; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028757 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028757
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNeuronal PAS domain protein
Genome locationtig00153206:1694494..1701076
RNA-Seq ExpressionSgr028757
SyntenySgr028757
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030639.1 hypothetical protein SDJN02_04676, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0082.17Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKT+S S SIC+S+S +S+L +VAAKNLHSPTITLS+IADFS PISLWTSKPLKT T SSNL DEE+MS LLLNCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV

Query:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK
        L+YGSNQ+KN     LKLD T + +EIFNLAFLTLIFLICIYEAPTDLRS+CL TLKHHLAN  SRQ S VLMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK

Query:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER
        AN  TLKTPSPL+SYSFST GLWKVQLYCPIIAMD IENSSNPSTDERLQFSLNYHQLEGVLQFNY+ VVR+KWIDMRVHVDNIRCDI+RLVNETLLSER
Subjt:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTP   TNIMSVSVSKSS+NP+IEVG E++FEAGFEP  P PGLKL++GET  +SLKPWKFEQFVHGNAA LNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEW-ELRGWIWNLPPLSQWKTTSISISICSSSSTNSSLNVVA
        VASTKPS L LI PKAWFRDRYSSA+RPFNKQGG+IFA D+YGE+VWWKID KA+GKTM++ ++  WI NLPPLSQWKTTSIS SICSSSS+NSSLNVVA
Subjt:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEW-ELRGWIWNLPPLSQWKTTSISISICSSSSTNSSLNVVA

Query:  TVNLHFPTITFSVIADLSFPISLWTSKPLKISTKSTSLL-DEESMSSLLLNFVHDVLHYGSNQQNNSSLNFLKLDITFNLNEIFNLAFLTLIFLICIYEA
          +LH  TIT SVIAD S PISLW+S+PLK STKS++LL D+ES+SSLLLN + DVLHYGS+QQ N S  FLKL+ITFN  EIFN+ FL L+FLICIYEA
Subjt:  TVNLHFPTITFSVIADLSFPISLWTSKPLKISTKSTSLL-DEESMSSLLLNFVHDVLHYGSNQQNNSSLNFLKLDITFNLNEIFNLAFLTLIFLICIYEA

Query:  PTNLRSNCLMTLKHHLANCQSRQTSKVLMKLLGSNLEEQWMRSLNLAVTNWILELKANSCALKTPSPLFSYSFSTHGLWKVQLYCPITAMDNIENSSNPS
        PT LR +CL TLK+HL N QSRQ SK+LMKLLGSN+EEQWMRS+NLA+TNWI+ELKANSC LKTPSPLFSYSFSTHGLWKVQLYCP+ AMD IENS NPS
Subjt:  PTNLRSNCLMTLKHHLANCQSRQTSKVLMKLLGSNLEEQWMRSLNLAVTNWILELKANSCALKTPSPLFSYSFSTHGLWKVQLYCPITAMDNIENSSNPS

Query:  TDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIRLVNETLLSKRGVGGSEKHFPSRISLQITPALQTNVMSVSVSKSSYNPAIDIETE
        TDERLQ SLNYHQLEG+LQFNYKAEVREKW++LRVHVDNIRC+II LVN+ LLSKRGVGGSEK+FPSRISLQ+TP LQTN+MSVSVSKSS NP I++ TE
Subjt:  TDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIRLVNETLLSKRGVGGSEKHFPSRISLQITPALQTNVMSVSVSKSSYNPAIDIETE

Query:  KTFEAGFQPATPYPGLKLAVGGTVTASLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLALINPRAWFRDRYTSAFRPFNKQGGVIFAGDEYGE
        KT EAGF+P+ PYPGLKLAVG TVTASLKPWKFEQ VYGNT ILNWYLHDSSDGKEVAS KPSKLALINPRAWFRDRY+SAFRPFN+QGGVIFAGDEYGE
Subjt:  KTFEAGFQPATPYPGLKLAVGGTVTASLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLALINPRAWFRDRYTSAFRPFNKQGGVIFAGDEYGE

Query:  SVWWKIEGEARGKTMEWEIR
        SVWWKI+G AR KT+EWEIR
Subjt:  SVWWKIEGEARGKTMEWEIR

XP_022146787.1 uncharacterized protein LOC111015909 [Momordica charantia]2.6e-23886.91Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV
        MASCSF DVYSWI+NLPPLSQWKTTS S +IC+SSSTNS+L+V+AAK+LHSPTIT S++A FS PISLWTSKPL   TKSSNLLDEES+S+LLLNCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV

Query:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK
        L+YGSNQQK+  LNFLK +  FN KE FNLAFLTL+FLICIYEAPTDLRSDCLTTLKHHLANCRSRQ S VLMKLLGSNLEEQWMRSLNL+ITNWI ELK
Subjt:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK

Query:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER
         NS T+KTPSPLFSYSFSTDGLWKVQLYCPIIAMD +ENSSNPS DERLQFSLNYHQLEGVLQFN+KAVVREKW D+RVHVDNIRCDIIRLVNETLLSER
Subjt:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEPPNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVA
        G GGSEKHFPSRISL++TP +QTNIMSVSVSKSSDNPRI+VGNEK+FEAGFEPPNP LKLAIGETV+MSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVA
Subjt:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEPPNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVA

Query:  STKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW
        ST+PS LALI PKAWFRDRYSSAYRPFNKQGGVIFA D+YGESVWWKIDAKA+GK MEWE+RGWIW
Subjt:  STKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW

XP_022146788.1 uncharacterized protein LOC111015910 [Momordica charantia]3.7e-23289.4Show/hide
Query:  WIWNLPPLSQWKTTSISISICSSSSTNSSLNVVATVNLHFPTITFSVIADLSFPISLWTSKPLKISTKSTSLLDEESMSSLLLNFVHDVLHYGSNQQNNS
        WI NLPPLSQWK TS S SICSSSSTNSSLN VAT NLH PT+TFSVIAD+SFPISLWTSKPLKISTKSTSL+DEES+S LLLNFVHDVLHYGSNQQ N 
Subjt:  WIWNLPPLSQWKTTSISISICSSSSTNSSLNVVATVNLHFPTITFSVIADLSFPISLWTSKPLKISTKSTSLLDEESMSSLLLNFVHDVLHYGSNQQNNS

Query:  SLNFLKLDITFNLNEIFNLAFLTLIFLICIYEAPTNLRSNCLMTLKHHLANCQSRQTSKVLMKLLGSNLEEQWMRSLNLAVTNWILELKANSCALKTPSP
        SLNFL+L+ITFN  EIFNLAFLTLIFLICIYEAPT LRS+CL TLKHHLANCQSRQTSK+LMKLLGSNLEEQWMRS+NLA+TNWILELKANSC LKTPSP
Subjt:  SLNFLKLDITFNLNEIFNLAFLTLIFLICIYEAPTNLRSNCLMTLKHHLANCQSRQTSKVLMKLLGSNLEEQWMRSLNLAVTNWILELKANSCALKTPSP

Query:  LFSYSFSTHGLWKVQLYCPITAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIRLVNETLLSKRGVGGSEKHFPS
        LFSYSFST GLWKVQLYCP+ AMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDII+LVN+TLLSKRGVG SEKHFPS
Subjt:  LFSYSFSTHGLWKVQLYCPITAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIRLVNETLLSKRGVGGSEKHFPS

Query:  RISLQITPALQTNVMSVSVSKSSYNPAIDIETEKTFEAGFQPATPYPGLKLAVGGTVTASLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLAL
        RISLQ+TP LQTN+MSVSVSKSS NP ID+ TEKTFEAGF+PA  YPGLKLAVG +VT SLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLAL
Subjt:  RISLQITPALQTNVMSVSVSKSSYNPAIDIETEKTFEAGFQPATPYPGLKLAVGGTVTASLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLAL

Query:  INPRAWFRDRYTSAFRPFNKQGGVIFAGDEYGESVWWKIEGEARGKTMEWEIR
        INPRAWFRDRY+SA RPFNKQGGVIFAGDEYGESVWWKIE +ARGKTMEWEIR
Subjt:  INPRAWFRDRYTSAFRPFNKQGGVIFAGDEYGESVWWKIEGEARGKTMEWEIR

XP_022995522.1 uncharacterized protein LOC111491028 [Cucurbita maxima]1.3e-22984.62Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKT+S S SIC+S+S +S+L +VAAKNLHSPTITLS+IADFS PISLWTSKPLKT T SSNL DEE+MS LLLNCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV

Query:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK
        L+YGSNQ+KN     LKLD T + K+IFNLAFLTLIFLICIYEAPTDLRS+CL TLKHHLAN  SRQ S VLMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK

Query:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER
        AN  TLKTPSPL+SYSFST GLWKVQLYCPIIAMD IENSSNPSTDERLQFSLNYHQLEGVLQFNY+ VVR+KWIDMRVHVDNIRCDIIRLVNETLLSER
Subjt:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTP   TNIMSVSVSKSS+NP+IEVG E++FEAGFEP  P PGLKL++GET  +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW
        VASTKPS LALI PKAWFRDRYSSA+RPFNKQGGVIFA D+YGE+VWWKID KA+GKTMEWE++GWIW
Subjt:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW

XP_023544514.1 uncharacterized protein LOC111804061 [Cucurbita pepo subsp. pepo]1.0e-22984.62Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKTTS S SIC+S+S +S+L +VAAKNLHSPTITLS+IADFS PISLWTSKPLKT T SSNL DEE+MS LLLNCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV

Query:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK
        L+YGSNQ+KN     LKLD T + +EIFNLAFLTLIFLICIYEAPTDLRS+CL TLKHHLAN  SRQ S VLMKLLGSNLE+QWMRS+NLAITNW+LELK
Subjt:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK

Query:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER
        AN  TLKTPSPL+SYSFS+ GLWKVQLYCPIIAMD IENSSNPSTDERLQFSLNYHQLEGVLQFNY+ VVR+KWIDMRVHVDNIRCDIIRLVNETLLSER
Subjt:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTP   TNIMSVSVSKSS+NP+IEVG E++FEAGFEP  P PGLKL++GET  +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW
        VASTKPS LALI PKAWFRDRYSSA+RPFNKQGGVIFA D+YGE+VWWKID KA+GKTMEWE+RGWIW
Subjt:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW

TrEMBL top hitse value%identityAlignment
A0A6J1CZH7 uncharacterized protein LOC1110159101.8e-23289.4Show/hide
Query:  WIWNLPPLSQWKTTSISISICSSSSTNSSLNVVATVNLHFPTITFSVIADLSFPISLWTSKPLKISTKSTSLLDEESMSSLLLNFVHDVLHYGSNQQNNS
        WI NLPPLSQWK TS S SICSSSSTNSSLN VAT NLH PT+TFSVIAD+SFPISLWTSKPLKISTKSTSL+DEES+S LLLNFVHDVLHYGSNQQ N 
Subjt:  WIWNLPPLSQWKTTSISISICSSSSTNSSLNVVATVNLHFPTITFSVIADLSFPISLWTSKPLKISTKSTSLLDEESMSSLLLNFVHDVLHYGSNQQNNS

Query:  SLNFLKLDITFNLNEIFNLAFLTLIFLICIYEAPTNLRSNCLMTLKHHLANCQSRQTSKVLMKLLGSNLEEQWMRSLNLAVTNWILELKANSCALKTPSP
        SLNFL+L+ITFN  EIFNLAFLTLIFLICIYEAPT LRS+CL TLKHHLANCQSRQTSK+LMKLLGSNLEEQWMRS+NLA+TNWILELKANSC LKTPSP
Subjt:  SLNFLKLDITFNLNEIFNLAFLTLIFLICIYEAPTNLRSNCLMTLKHHLANCQSRQTSKVLMKLLGSNLEEQWMRSLNLAVTNWILELKANSCALKTPSP

Query:  LFSYSFSTHGLWKVQLYCPITAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIRLVNETLLSKRGVGGSEKHFPS
        LFSYSFST GLWKVQLYCP+ AMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDII+LVN+TLLSKRGVG SEKHFPS
Subjt:  LFSYSFSTHGLWKVQLYCPITAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIRLVNETLLSKRGVGGSEKHFPS

Query:  RISLQITPALQTNVMSVSVSKSSYNPAIDIETEKTFEAGFQPATPYPGLKLAVGGTVTASLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLAL
        RISLQ+TP LQTN+MSVSVSKSS NP ID+ TEKTFEAGF+PA  YPGLKLAVG +VT SLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLAL
Subjt:  RISLQITPALQTNVMSVSVSKSSYNPAIDIETEKTFEAGFQPATPYPGLKLAVGGTVTASLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLAL

Query:  INPRAWFRDRYTSAFRPFNKQGGVIFAGDEYGESVWWKIEGEARGKTMEWEIR
        INPRAWFRDRY+SA RPFNKQGGVIFAGDEYGESVWWKIE +ARGKTMEWEIR
Subjt:  INPRAWFRDRYTSAFRPFNKQGGVIFAGDEYGESVWWKIEGEARGKTMEWEIR

A0A6J1D0J9 uncharacterized protein LOC1110159091.3e-23886.91Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV
        MASCSF DVYSWI+NLPPLSQWKTTS S +IC+SSSTNS+L+V+AAK+LHSPTIT S++A FS PISLWTSKPL   TKSSNLLDEES+S+LLLNCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV

Query:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK
        L+YGSNQQK+  LNFLK +  FN KE FNLAFLTL+FLICIYEAPTDLRSDCLTTLKHHLANCRSRQ S VLMKLLGSNLEEQWMRSLNL+ITNWI ELK
Subjt:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK

Query:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER
         NS T+KTPSPLFSYSFSTDGLWKVQLYCPIIAMD +ENSSNPS DERLQFSLNYHQLEGVLQFN+KAVVREKW D+RVHVDNIRCDIIRLVNETLLSER
Subjt:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEPPNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVA
        G GGSEKHFPSRISL++TP +QTNIMSVSVSKSSDNPRI+VGNEK+FEAGFEPPNP LKLAIGETV+MSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVA
Subjt:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEPPNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVA

Query:  STKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW
        ST+PS LALI PKAWFRDRYSSAYRPFNKQGGVIFA D+YGESVWWKIDAKA+GK MEWE+RGWIW
Subjt:  STKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW

A0A6J1FUY0 uncharacterized protein LOC1114470811.8e-22984.19Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKT+S S SIC+S+S +S+L +VAAKNLHSPTITLS+IADFS PISLWTSKPLKT T SSNL DEE+MS LLLNCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV

Query:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK
        L+YGSNQ+KN     LKLD T + +EIFNLAFLTLIFLICIYEAPTDLRS+CL TLKHHLAN  SRQ S VLMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK

Query:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER
        AN  TLKTPSPL+SYSFST GLWKVQLYCPIIAMD IENSSNPSTDERLQFSLNYHQLEGVLQFNY+ VVR+KWIDMRVHVDNIRCDI+RLVNETLLSER
Subjt:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTP   TNIMSVSVSKSS+NP+IE+G E++FEAGFEP  P PGLKL++GET  +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW
        VASTKPS L LI PKAWFRDRYSSA+RPFNKQGGVIFA D+YGE+VWWKID KA+GKTMEWE+RGWIW
Subjt:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW

A0A6J1GXT2 uncharacterized protein LOC1114581753.5e-22884.4Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV
        MASCSFPDVYSWIQNLPPLSQWKTTS S SICSSSSTNS+L +VAAK LHSPTIT SV ADFS  ISLWTS+PLKT TK+SNLL++ESMS LLLNCV DV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV

Query:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK
        L+YGSN ++N   N LKLD T +LKEIFN  FLTLIFLIC YEAP DLRS+CL TLKHHLANC SRQTS VLMKLLGSNLE+QWMRS+NLAITNWILELK
Subjt:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK

Query:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER
        A   TLKTPSPLFSYSFST GLWKVQLYCPIIAMD IENSSNPSTDERLQFSLNYHQLEGVLQFNY+AV REKWID+RVHVDNIRCDIIRLV+ETLLSER
Subjt:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFE--PPNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTP   TNIMSVSVSKSS+NP++E+G EK+FEAGFE   P PGLKLA+GETV +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFE--PPNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW
        VASTKPS L LI PKAWFRDRYSSA RPFNKQGGV+FA D+YGESVWWKID KA+GKTMEWE+RGWIW
Subjt:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW

A0A6J1JZ56 uncharacterized protein LOC1114910286.3e-23084.62Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKT+S S SIC+S+S +S+L +VAAKNLHSPTITLS+IADFS PISLWTSKPLKT T SSNL DEE+MS LLLNCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDV

Query:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK
        L+YGSNQ+KN     LKLD T + K+IFNLAFLTLIFLICIYEAPTDLRS+CL TLKHHLAN  SRQ S VLMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LHYGSNQQKNPCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELK

Query:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER
        AN  TLKTPSPL+SYSFST GLWKVQLYCPIIAMD IENSSNPSTDERLQFSLNYHQLEGVLQFNY+ VVR+KWIDMRVHVDNIRCDIIRLVNETLLSER
Subjt:  ANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTP   TNIMSVSVSKSS+NP+IEVG E++FEAGFEP  P PGLKL++GET  +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEP--PNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW
        VASTKPS LALI PKAWFRDRYSSA+RPFNKQGGVIFA D+YGE+VWWKID KA+GKTMEWE++GWIW
Subjt:  VASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15020.1 unknown protein1.8e-3525.76Show/hide
Query:  TITFSVIAD---LSFPISLWTSKPLKISTKSTSLLDEESMSSLLLNFVHDVLHYGSNQQNNSSLNFLKLDI-------------TFNLNEIFNLAFLTLI
        ++TF+V+A+   L    ++W S    +S+       E+    L+L  + +++       + +   F +L+I               + + +FNL  LT +
Subjt:  TITFSVIAD---LSFPISLWTSKPLKISTKSTSLLDEESMSSLLLNFVHDVLHYGSNQQNNSSLNFLKLDI-------------TFNLNEIFNLAFLTLI

Query:  FLICIYEAPTNLRSNCLMTLKHHLAN---CQSRQTSKVLMKLLGSNLEEQWMRSLNLAVTNWI---------LELKANSCALKTPSPL-FSYSFSTHGLW
        F +C+++AP+ + S     L     N   CQ     +  +  LG + E   +R+ + A++ W+         L LK  S +L     L FSY+   HGLW
Subjt:  FLICIYEAPTNLRSNCLMTLKHHLAN---CQSRQTSKVLMKLLGSNLEEQWMRSLNLAVTNWI---------LELKANSCALKTPSPL-FSYSFSTHGLW

Query:  KVQLYCPITAMDNIENSSN---------PSTDER---LQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIRLVNETLLSKRGVG--------
         ++ Y PI +M+   NSSN         P  + +   L+++L++ Q E ++QF Y  +  E ++ +   VDNIR  + +L       K GVG        
Subjt:  KVQLYCPITAMDNIENSSN---------PSTDER---LQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIRLVNETLLSKRGVG--------

Query:  -GSEKHFPSRISLQITPAL-QTNVMSVSVSKSSYNPAIDIETEKTFEAGFQPATPYPGLKLAVGGTVTASLKPWKFEQFVYGNTAILNWYLHDSSDGKEV
           E++FPSR+ + + P L  ++V  +S+ +S+ N   DIE  +  +  F      P +K          +K W+ EQ   GN A+ +  L+D   G+EV
Subjt:  -GSEKHFPSRISLQITPAL-QTNVMSVSVSKSSYNPAIDIETEKTFEAGFQPATPYPGLKLAVGGTVTASLKPWKFEQFVYGNTAILNWYLHDSSDGKEV

Query:  ASTKPSKLALINPRAWFRDRYTSAFRPFNKQGGVIFAGDEYGESVWWKIEGEARGKTMEWEI
         + KP            +         F K GG++F  DEYG+ V W++  E  G  ++W +
Subjt:  ASTKPSKLALINPRAWFRDRYTSAFRPFNKQGGVIFAGDEYGESVWWKIEGEARGKTMEWEI

AT2G40390.1 unknown protein1.4e-15257.32Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPT-ITLSVIADFSLPISLWTSKPLKT-DTKSSNLLDEESMSNLLLNCVH
        MASC  PD ++W+Q LPPLS WK    S  ICS +S++ +LN    +   SP   T S++A+F  PI+L+ SK  +T  T S+  L+E  +S LL+  V 
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPT-ITLSVIADFSLPISLWTSKPLKT-DTKSSNLLDEESMSNLLLNCVH

Query:  DVLHYGSNQQKNPC-LNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWIL
         VL+Y  N ++  C +    L +T NLK++FNLAF T +FLICIYEAPT LR+ CL T+K  L  CRSRQ S +LM  LGSNLEEQWMRSLNLAITNWI+
Subjt:  DVLHYGSNQQKNPC-LNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWIL

Query:  ELKANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLL
        E+KA    LK+PSPLFSY+FST GLWKV +YCP++AM+ +E+ ++   DERL FSLNYHQLEGV+Q N++  VREKW ++ V++DN+RCDIIRLVNE LL
Subjt:  ELKANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLL

Query:  SERGVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEPPNP--GLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSD
        SERG+G  EKHFPSRISLQLTP  Q+NI+ VSV KSS+NP  E   EK  EA  +PPN   GLK++  ET T S+KPWKFE++VHG +A L W+LHD  D
Subjt:  SERGVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEPPNP--GLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSD

Query:  GKEVASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW
        G+EV+S+KPS ++++ P+AWF++RYSSA+RPF KQGGV+FA D YG+SV WK+D  A GK ME+E++G +W
Subjt:  GKEVASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW

AT5G64190.1 unknown protein2.8e-13754.04Show/hide
Query:  FPDVYSWIQNLPPLSQWKTTSFSKSICSSSS--TNSTLNVVAAKNLHSPTITLSVIADFS--LPISLWTSK-PLKTDTKSSNLLDEESMSNLLLNCVHDV
        FPDV++WIQN+P +++W+TTS    IC S+S   NSTLN+ A K+     +T S+I   +   P+ LWT+K  L  +  S N  DE ++ +LL N V  +
Subjt:  FPDVYSWIQNLPPLSQWKTTSFSKSICSSSS--TNSTLNVVAAKNLHSPTITLSVIADFS--LPISLWTSK-PLKTDTKSSNLLDEESMSNLLLNCVHDV

Query:  LHYGSNQQKNPCLNFLKLDAT--FNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILE
        L Y SN      +     D++    LK+I N   LTL F++C+YEAP  LR +CL TLK+HL  C +R+ +  LMKLLGSNLEEQWMR++NLA TNWI+E
Subjt:  LHYGSNQQKNPCLNFLKLDAT--FNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILE

Query:  LKANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLS
         + +  T  T +PLFSY+ S  GLWKVQLYCP+ AM+ +E SSNP+ D RL FSL ++QLEGV+QFN+K VVR+ WID+ V +DNIR D+I+LVNE L+S
Subjt:  LKANSGTLKTPSPLFSYSFSTDGLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLS

Query:  ERGVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEPPNP-GLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSS-DG
         RG G  EKHFPSRISLQLTP LQT+ +SVSVSKSS+NP  E   E+S E  F+PPN  GL++A  E  TM++ PWK EQ V G  A LNW L+DSS  G
Subjt:  ERGVGGSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPRIEVGNEKSFEAGFEPPNP-GLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSS-DG

Query:  KEVASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW
        +EV STKPS  +++ P++WF+DRY+ AYR F ++GGVIFA D+YGESV WKI   A G TMEWE++G+IW
Subjt:  KEVASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKYGESVWWKIDAKAKGKTMEWELRGWIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTGTAGCTTTCCTGATGTTTATTCCTGGATACAGAACCTACCACCACTTTCTCAATGGAAGACAACTTCCTTTTCTAAATCCATATGCTCTTCAAGCTCAAC
CAACTCCACTCTGAATGTTGTTGCTGCCAAAAACCTTCATTCCCCAACCATTACTTTATCAGTTATTGCAGATTTCAGCCTTCCTATCTCCCTTTGGACATCAAAACCCT
TGAAGACCGACACCAAATCCTCAAATTTATTAGATGAAGAAAGCATGTCCAATCTCTTGCTTAACTGTGTTCATGATGTTCTTCATTATGGCTCAAACCAACAAAAGAAT
CCTTGCCTTAATTTTCTCAAACTAGATGCTACTTTCAACTTGAAAGAAATCTTCAATCTTGCATTTCTTACCCTCATATTCCTAATTTGCATCTATGAAGCCCCGACTGA
TCTCCGTTCTGATTGTCTTACAACTCTCAAGCATCACCTGGCAAATTGTCGGTCAAGGCAGACATCAACGGTGCTTATGAAACTGTTGGGGTCTAATCTAGAAGAGCAAT
GGATGAGGTCCCTAAACCTTGCAATCACCAACTGGATATTGGAGCTCAAAGCCAACAGCGGTACTTTAAAAACGCCCTCACCTTTGTTCTCTTATTCATTTTCAACAGAT
GGGTTGTGGAAAGTTCAACTCTATTGCCCTATCATTGCAATGGATTATATTGAGAACTCAAGCAATCCTTCAACTGATGAAAGATTGCAATTCTCTTTAAATTATCACCA
GCTTGAAGGGGTTCTGCAGTTCAATTACAAGGCTGTGGTTCGAGAAAAGTGGATTGATATGAGGGTACATGTTGATAACATAAGGTGTGACATCATTCGACTTGTGAACG
AGACGCTCTTGTCTGAACGAGGAGTTGGAGGATCGGAAAAGCATTTTCCTTCACGAATCTCACTGCAACTTACTCCAATCTTGCAGACAAACATAATGAGCGTCTCAGTA
AGCAAATCCTCAGACAACCCTAGAATAGAAGTTGGAAATGAAAAATCATTTGAAGCTGGATTTGAACCCCCAAACCCTGGCCTCAAATTAGCAATTGGAGAGACTGTAAC
CATGAGCTTGAAGCCATGGAAATTTGAGCAATTCGTCCATGGCAATGCTGCAACCCTTAATTGGTACCTCCACGACAGTTCGGATGGGAAAGAGGTGGCCTCCACCAAGC
CATCAATGCTTGCACTTATAAAACCTAAAGCTTGGTTTCGAGACCGTTACTCAAGTGCTTACAGGCCTTTTAACAAACAGGGAGGGGTGATATTCGCAAGAGACAAGTAT
GGAGAGAGTGTGTGGTGGAAGATTGATGCAAAGGCGAAAGGGAAAACCATGGAGTGGGAGTTGAGAGGTTGGATTTGGAACCTACCACCACTTTCTCAATGGAAAACAAC
TTCCATTTCTATATCCATATGCTCTTCAAGCTCAACCAACTCCTCTTTGAATGTTGTTGCTACCGTAAACCTTCATTTCCCAACCATTACTTTCTCAGTTATTGCAGATT
TGAGCTTTCCTATCTCCCTTTGGACATCAAAACCCTTGAAGATCAGCACCAAATCCACAAGTTTATTAGATGAAGAAAGCATGTCCAGTCTCTTGCTTAACTTTGTTCAT
GATGTTCTTCATTATGGCTCAAACCAACAAAATAATTCTAGCCTTAATTTCCTCAAACTCGACATCACTTTCAACTTGAACGAAATCTTCAATCTTGCATTTCTTACCCT
CATATTCCTAATCTGCATCTATGAAGCTCCAACCAACCTCCGTTCGAATTGTCTTATGACTCTCAAGCATCATTTGGCAAATTGTCAGTCAAGACAAACATCAAAAGTGC
TTATGAAACTGTTGGGGTCTAATCTAGAAGAGCAATGGATGAGGTCCCTAAACCTTGCAGTCACCAACTGGATATTAGAGCTCAAGGCCAACAGCTGCGCCCTCAAAACA
CCCTCACCATTGTTCTCTTATTCATTTTCAACACATGGGTTGTGGAAAGTTCAACTCTATTGCCCTATCACTGCAATGGATAATATTGAGAACTCAAGCAATCCTTCAAC
TGATGAAAGATTGCAATTCTCTTTAAATTATCACCAGCTTGAAGGGGTTCTGCAGTTCAATTACAAGGCTGAGGTTCGAGAAAAGTGGGTTGATCTGAGGGTTCACGTTG
ATAACATAAGGTGCGACATCATCCGGCTAGTGAACGAAACGCTCTTATCCAAACGAGGAGTTGGCGGATCAGAAAAGCATTTTCCATCAAGAATCTCACTGCAAATCACT
CCAGCTCTGCAGACAAACGTAATGAGCGTCTCAGTAAGCAAATCATCATACAACCCTGCAATCGACATCGAAACTGAAAAAACATTCGAAGCCGGATTCCAACCCGCAAC
ACCTTACCCAGGCCTCAAATTAGCAGTCGGAGGAACCGTAACGGCGAGCTTGAAGCCATGGAAATTCGAGCAGTTCGTCTATGGCAACACCGCGATCCTCAACTGGTACC
TCCACGACAGTTCAGACGGGAAAGAGGTGGCCTCCACGAAGCCTTCGAAACTTGCGCTCATAAACCCTAGAGCTTGGTTCCGGGACCGTTACACGAGCGCTTTCCGGCCA
TTCAACAAACAGGGAGGGGTGATATTCGCCGGAGACGAGTACGGAGAGAGTGTTTGGTGGAAGATTGAGGGAGAGGCGAGAGGGAAAACCATGGAATGGGAGATCAGAGA
CCATCATGAACAAGGCCGCTATTCGCCATCTTTTGGCCGGCGAATCTCCCTCTTAACCGCTGCTGCTGCACCACCAAGACCGTCGACGTGCTCCCGAAGTCCGGTGATGC
AAGCATGTCGCCGACCACCCCGAGCTCCGGACACTCGCTCCACTGGTGAAGCCCGCCAATCCTACCCCATCTCTCACCACCTCTTCCACAACCACAAAATGCTCATTCCC
CAAGTTGGCATGCCGATACTCATTGATCAGCTCGGTGTCATGCTTCCTATCCTTGCTGTTTTCCTCTCCAAACAGAAGGAACCGCACGACCGTCAAGTCGACAATGTGGT
GCCTTGCCATGCGAGCTCCAAACGCCAGCGATTCCGCATCATCCGGGCCGCCAATGAACAGCACTGCAATATGATACGGAGCTTGCGCGGTCAAGACAGGCACCTGCTTG
CTCAAAACTCCCCTATCTACCAGAATCCCAATGGAACAAGGCGCCATTTCAAGGATCTGTATGTTCATATTTTGGATGGCTCTGTTCACCTTCCCAATCGTGCCATCGAT
CGCCCACTGCTTATGGAAAGGAAGGATGACAATAGTGGCCCTCTTGTCAAACGCGACTCTACAAATGTCGTCGTGCATAAGATTAGCGTTGATGATATGGATTGCTTTGG
ATGTGCTTCTCTCAAGTGTGCAATCGGGTTGGTGTGCTATGAGAATCGGGTTGCTTCTGCCAACGAGCTCAACCAAGATGAGAGCTATGGCAGCCAAGGGGCTGTCTTTC
GAAGCATGGGACACTTCGAGGAGGTTAATGATGGTGGGGATGTTGTCGTTGTGGTGGATGCAGACCAAGACTCGAAACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTGTAGCTTTCCTGATGTTTATTCCTGGATACAGAACCTACCACCACTTTCTCAATGGAAGACAACTTCCTTTTCTAAATCCATATGCTCTTCAAGCTCAAC
CAACTCCACTCTGAATGTTGTTGCTGCCAAAAACCTTCATTCCCCAACCATTACTTTATCAGTTATTGCAGATTTCAGCCTTCCTATCTCCCTTTGGACATCAAAACCCT
TGAAGACCGACACCAAATCCTCAAATTTATTAGATGAAGAAAGCATGTCCAATCTCTTGCTTAACTGTGTTCATGATGTTCTTCATTATGGCTCAAACCAACAAAAGAAT
CCTTGCCTTAATTTTCTCAAACTAGATGCTACTTTCAACTTGAAAGAAATCTTCAATCTTGCATTTCTTACCCTCATATTCCTAATTTGCATCTATGAAGCCCCGACTGA
TCTCCGTTCTGATTGTCTTACAACTCTCAAGCATCACCTGGCAAATTGTCGGTCAAGGCAGACATCAACGGTGCTTATGAAACTGTTGGGGTCTAATCTAGAAGAGCAAT
GGATGAGGTCCCTAAACCTTGCAATCACCAACTGGATATTGGAGCTCAAAGCCAACAGCGGTACTTTAAAAACGCCCTCACCTTTGTTCTCTTATTCATTTTCAACAGAT
GGGTTGTGGAAAGTTCAACTCTATTGCCCTATCATTGCAATGGATTATATTGAGAACTCAAGCAATCCTTCAACTGATGAAAGATTGCAATTCTCTTTAAATTATCACCA
GCTTGAAGGGGTTCTGCAGTTCAATTACAAGGCTGTGGTTCGAGAAAAGTGGATTGATATGAGGGTACATGTTGATAACATAAGGTGTGACATCATTCGACTTGTGAACG
AGACGCTCTTGTCTGAACGAGGAGTTGGAGGATCGGAAAAGCATTTTCCTTCACGAATCTCACTGCAACTTACTCCAATCTTGCAGACAAACATAATGAGCGTCTCAGTA
AGCAAATCCTCAGACAACCCTAGAATAGAAGTTGGAAATGAAAAATCATTTGAAGCTGGATTTGAACCCCCAAACCCTGGCCTCAAATTAGCAATTGGAGAGACTGTAAC
CATGAGCTTGAAGCCATGGAAATTTGAGCAATTCGTCCATGGCAATGCTGCAACCCTTAATTGGTACCTCCACGACAGTTCGGATGGGAAAGAGGTGGCCTCCACCAAGC
CATCAATGCTTGCACTTATAAAACCTAAAGCTTGGTTTCGAGACCGTTACTCAAGTGCTTACAGGCCTTTTAACAAACAGGGAGGGGTGATATTCGCAAGAGACAAGTAT
GGAGAGAGTGTGTGGTGGAAGATTGATGCAAAGGCGAAAGGGAAAACCATGGAGTGGGAGTTGAGAGGTTGGATTTGGAACCTACCACCACTTTCTCAATGGAAAACAAC
TTCCATTTCTATATCCATATGCTCTTCAAGCTCAACCAACTCCTCTTTGAATGTTGTTGCTACCGTAAACCTTCATTTCCCAACCATTACTTTCTCAGTTATTGCAGATT
TGAGCTTTCCTATCTCCCTTTGGACATCAAAACCCTTGAAGATCAGCACCAAATCCACAAGTTTATTAGATGAAGAAAGCATGTCCAGTCTCTTGCTTAACTTTGTTCAT
GATGTTCTTCATTATGGCTCAAACCAACAAAATAATTCTAGCCTTAATTTCCTCAAACTCGACATCACTTTCAACTTGAACGAAATCTTCAATCTTGCATTTCTTACCCT
CATATTCCTAATCTGCATCTATGAAGCTCCAACCAACCTCCGTTCGAATTGTCTTATGACTCTCAAGCATCATTTGGCAAATTGTCAGTCAAGACAAACATCAAAAGTGC
TTATGAAACTGTTGGGGTCTAATCTAGAAGAGCAATGGATGAGGTCCCTAAACCTTGCAGTCACCAACTGGATATTAGAGCTCAAGGCCAACAGCTGCGCCCTCAAAACA
CCCTCACCATTGTTCTCTTATTCATTTTCAACACATGGGTTGTGGAAAGTTCAACTCTATTGCCCTATCACTGCAATGGATAATATTGAGAACTCAAGCAATCCTTCAAC
TGATGAAAGATTGCAATTCTCTTTAAATTATCACCAGCTTGAAGGGGTTCTGCAGTTCAATTACAAGGCTGAGGTTCGAGAAAAGTGGGTTGATCTGAGGGTTCACGTTG
ATAACATAAGGTGCGACATCATCCGGCTAGTGAACGAAACGCTCTTATCCAAACGAGGAGTTGGCGGATCAGAAAAGCATTTTCCATCAAGAATCTCACTGCAAATCACT
CCAGCTCTGCAGACAAACGTAATGAGCGTCTCAGTAAGCAAATCATCATACAACCCTGCAATCGACATCGAAACTGAAAAAACATTCGAAGCCGGATTCCAACCCGCAAC
ACCTTACCCAGGCCTCAAATTAGCAGTCGGAGGAACCGTAACGGCGAGCTTGAAGCCATGGAAATTCGAGCAGTTCGTCTATGGCAACACCGCGATCCTCAACTGGTACC
TCCACGACAGTTCAGACGGGAAAGAGGTGGCCTCCACGAAGCCTTCGAAACTTGCGCTCATAAACCCTAGAGCTTGGTTCCGGGACCGTTACACGAGCGCTTTCCGGCCA
TTCAACAAACAGGGAGGGGTGATATTCGCCGGAGACGAGTACGGAGAGAGTGTTTGGTGGAAGATTGAGGGAGAGGCGAGAGGGAAAACCATGGAATGGGAGATCAGAGA
CCATCATGAACAAGGCCGCTATTCGCCATCTTTTGGCCGGCGAATCTCCCTCTTAACCGCTGCTGCTGCACCACCAAGACCGTCGACGTGCTCCCGAAGTCCGGTGATGC
AAGCATGTCGCCGACCACCCCGAGCTCCGGACACTCGCTCCACTGGTGAAGCCCGCCAATCCTACCCCATCTCTCACCACCTCTTCCACAACCACAAAATGCTCATTCCC
CAAGTTGGCATGCCGATACTCATTGATCAGCTCGGTGTCATGCTTCCTATCCTTGCTGTTTTCCTCTCCAAACAGAAGGAACCGCACGACCGTCAAGTCGACAATGTGGT
GCCTTGCCATGCGAGCTCCAAACGCCAGCGATTCCGCATCATCCGGGCCGCCAATGAACAGCACTGCAATATGATACGGAGCTTGCGCGGTCAAGACAGGCACCTGCTTG
CTCAAAACTCCCCTATCTACCAGAATCCCAATGGAACAAGGCGCCATTTCAAGGATCTGTATGTTCATATTTTGGATGGCTCTGTTCACCTTCCCAATCGTGCCATCGAT
CGCCCACTGCTTATGGAAAGGAAGGATGACAATAGTGGCCCTCTTGTCAAACGCGACTCTACAAATGTCGTCGTGCATAAGATTAGCGTTGATGATATGGATTGCTTTGG
ATGTGCTTCTCTCAAGTGTGCAATCGGGTTGGTGTGCTATGAGAATCGGGTTGCTTCTGCCAACGAGCTCAACCAAGATGAGAGCTATGGCAGCCAAGGGGCTGTCTTTC
GAAGCATGGGACACTTCGAGGAGGTTAATGATGGTGGGGATGTTGTCGTTGTGGTGGATGCAGACCAAGACTCGAAACTCTGA
Protein sequenceShow/hide protein sequence
MASCSFPDVYSWIQNLPPLSQWKTTSFSKSICSSSSTNSTLNVVAAKNLHSPTITLSVIADFSLPISLWTSKPLKTDTKSSNLLDEESMSNLLLNCVHDVLHYGSNQQKN
PCLNFLKLDATFNLKEIFNLAFLTLIFLICIYEAPTDLRSDCLTTLKHHLANCRSRQTSTVLMKLLGSNLEEQWMRSLNLAITNWILELKANSGTLKTPSPLFSYSFSTD
GLWKVQLYCPIIAMDYIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAVVREKWIDMRVHVDNIRCDIIRLVNETLLSERGVGGSEKHFPSRISLQLTPILQTNIMSVSV
SKSSDNPRIEVGNEKSFEAGFEPPNPGLKLAIGETVTMSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVASTKPSMLALIKPKAWFRDRYSSAYRPFNKQGGVIFARDKY
GESVWWKIDAKAKGKTMEWELRGWIWNLPPLSQWKTTSISISICSSSSTNSSLNVVATVNLHFPTITFSVIADLSFPISLWTSKPLKISTKSTSLLDEESMSSLLLNFVH
DVLHYGSNQQNNSSLNFLKLDITFNLNEIFNLAFLTLIFLICIYEAPTNLRSNCLMTLKHHLANCQSRQTSKVLMKLLGSNLEEQWMRSLNLAVTNWILELKANSCALKT
PSPLFSYSFSTHGLWKVQLYCPITAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIRLVNETLLSKRGVGGSEKHFPSRISLQIT
PALQTNVMSVSVSKSSYNPAIDIETEKTFEAGFQPATPYPGLKLAVGGTVTASLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLALINPRAWFRDRYTSAFRP
FNKQGGVIFAGDEYGESVWWKIEGEARGKTMEWEIRDHHEQGRYSPSFGRRISLLTAAAAPPRPSTCSRSPVMQACRRPPRAPDTRSTGEARQSYPISHHLFHNHKMLIP
QVGMPILIDQLGVMLPILAVFLSKQKEPHDRQVDNVVPCHASSKRQRFRIIRAANEQHCNMIRSLRGQDRHLLAQNSPIYQNPNGTRRHFKDLYVHILDGSVHLPNRAID
RPLLMERKDDNSGPLVKRDSTNVVVHKISVDDMDCFGCASLKCAIGLVCYENRVASANELNQDESYGSQGAVFRSMGHFEEVNDGGDVVVVVDADQDSKL