; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029837 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029837
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionspindle pole body component 110-like isoform X1
Genome locationscaffold6:8242535..8246545
RNA-Seq ExpressionSpg029837
SyntenySpg029837
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031171.1 hypothetical protein SDJN02_05211, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-20870.41Show/hide
Query:  ISSNFEMAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDP
        ISS F+MA +SHSV +   SIDDD+LE RSISELVSILRTAFR+++FDKVE VLVAKEVKMRK+IENKNKEYELLQSKYEFLRLD +T ES LE+DKVDP
Subjt:  ISSNFEMAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDP

Query:  KGFEKWKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKL
        KGFEKWKE YEELKE+ESEI +LK+LI KV+EDREKKKSALE FEKLLE VKKTQED R+T+EKL HKNSELEC +EV+KK KE+  KT+EELR KN KL
Subjt:  KGFEKWKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKL

Query:  ECAVEELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAG
        EC++EEL  + S LE  +EVVKKTE       EELKCKNS LE AKRE EHNYELCRRKYDEL RRVSQLEN    +  G  +         GGSRKL G
Subjt:  ECAVEELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAG

Query:  KRGAENDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSP
         + AEN   TGT G +VEI+SDDDHAP +N SRARR +KR WDLLLND EDYD E+       LP ET GKE+LKKVGAM+ TPPH RPDNHVLKR  S 
Subjt:  KRGAENDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSP

Query:  GTNDSKTVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCIL
         TNDSK V SSSRA AV+LRE+                Y+N DC+  S+ D  IQNRYDS+HLKSK QGKR NKKW SEAEMRAAF ENDMLCMEAVCIL
Subjt:  GTNDSKTVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCIL

Query:  YRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
        YR SSL GK RSAY PSR RGF+E D+LRGSTLALFL  GDSQG+LK+SVMEL KFDI GLIDCRRI+IEHL+QLFEIYKN+EDQF+F
Subjt:  YRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

XP_022942684.1 uncharacterized protein LOC111447644 isoform X1 [Cucurbita moschata]6.8e-20670.45Show/hide
Query:  MAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEKW
        MA +SHSV +   SIDDD+LE RSISELVSILRTAFR+++FDKVE VLVAKEVKMRK+IENKNKEYELLQSKYEFLRLD +T ES LE+DKVDPKGFEKW
Subjt:  MAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEKW

Query:  KETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVEE
        KE YEELKE+ESEI +LK+LI KV+EDREKKKSALE FEKLLE VKKTQED R+T+EKL HKNSELEC +EV+KK KE+  KT+EELR KN KLEC++EE
Subjt:  KETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVEE

Query:  LRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAEN
        L  + S LE  +EVVKKTE       EELKCKNS LE AKRE EHNYELCRRKYDEL RRVSQLEN    +  G  +         GGSRKL G + AEN
Subjt:  LRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAEN

Query:  DKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDSK
           TGT G +VEI+SDDDHAP +N SRARR +KR WDLLLND EDYD E+       LP ET GKE+LKKVGAM+ TPPH RPDNHVLKR  S  TNDSK
Subjt:  DKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDSK

Query:  TVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSL
         V SSSRA AV+LRE+                Y+N DC+  S+ D  IQNRYDS+HLKSK QGKR NKKW SEAEMRAAF ENDMLCMEAVCILYR SSL
Subjt:  TVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSL

Query:  TGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
         GK RSAY PSR RGF+E D+LRGSTLALFL  GDSQG+LK+SVMEL KFDI GLIDCRRI+IEHL+QLFEIYKN+EDQF+F
Subjt:  TGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

XP_022980041.1 uncharacterized protein LOC111479556 isoform X1 [Cucurbita maxima]6.8e-20670.84Show/hide
Query:  MAIESHSV-PAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEK
        M+I+SH V  AA  SIDDD+LE RSISELVSILRTAFR+++FDKVE VLVAKEVKMRK+IENKNKEYELLQSKYEFLRLD +T ES LE+DKVDPKGFEK
Subjt:  MAIESHSV-PAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEK

Query:  WKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVE
        WKE YEELKE+ESEI +LK+LI KV+EDREKKKSALE FEKLLE VKKTQED R+T+EKL HKNSELEC +EV+KK KE+  KT+EELR KN KLEC++E
Subjt:  WKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVE

Query:  ELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAE
        EL  + S+LE  +EVVKKTE       EELK KNS LE AKRE EHNYELCRRKYDEL+RRVSQLEN    +  G  +         GGSRKL G + AE
Subjt:  ELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAE

Query:  NDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDS
        N   TGT G +VEI+SDDDHAP +N SRARR QKR WDLLLND EDYD E+       LP ET GKE+LKKVGAM+ TPPH RPDNHVLKR FS  TNDS
Subjt:  NDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDS

Query:  KTVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSS
        K V SSSRA AV+LRE+                Y+N DC+  S+ D  IQNRYDS+HLKSK QGKR NKKWESEAEMRAAF ENDMLCMEAVCILYR SS
Subjt:  KTVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSS

Query:  LTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
        L GK RSAY PSR RGF+E D+LRGSTLALFL  GDSQG+LK+SVMEL KFDI GLIDCRRI+IEHL+QLFEIYKN+EDQF+F
Subjt:  LTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

XP_023550780.1 uncharacterized protein LOC111808815 isoform X1 [Cucurbita pepo subsp. pepo]5.2e-20670.27Show/hide
Query:  MAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEKW
        MA +SHSV +   SIDDD+LE RSISELVSILRTAFR+++FDKVE VLVAKEVKMRK+IENKNKEYELLQSKYEFLRLD +T ES LE+DKVDPKGFEKW
Subjt:  MAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEKW

Query:  KETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVEE
        KE YE+LKE+ESEI +LK+LI KV+EDREKKKSALE FEKLLE VKK QED R+T+EKL HKNSELEC +EV+KK KE+  KT+EELR KN KLEC++EE
Subjt:  KETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVEE

Query:  LRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAEN
        L  + S+LE  +EVVKKTE       EELKCKNS LE AKRE EHNYELCRRKYDEL RRVSQLEN    +  G  +         GGSRKL G + AEN
Subjt:  LRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAEN

Query:  DKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDSK
           TGT G +VEI+SDDDHAP +N SRARR +KR WDLLLND EDYD E+       LP ET GKE+LKKVGAM+ TPPH RPDNHVLKR  S  TNDSK
Subjt:  DKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDSK

Query:  TVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSL
         V SSSRA AV+LRE+                Y+N DC+  S+ D  IQNRYDS+HLKSK QGKR NKKWESEAEMRAAF ENDMLCMEAVCILYR SSL
Subjt:  TVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSL

Query:  TGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
         GK RSAY PSR RGF+E D+LRGSTLALFL  GDSQG+LK+SVMEL KFDI GLIDCRRI+IEHL+QLFEIYKN+EDQF+F
Subjt:  TGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

XP_038904233.1 uncharacterized protein LOC120090576 [Benincasa hispida]1.3e-22071.88Show/hide
Query:  ISSNFEMAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDK--V
        +SSNFEMAIES    A  +S D+D+LESRSISELVS LRTAFR+K+FDKVEEVLV++EVKMRK+IE+KNKEYELLQS+YEFLRLDS+T+ES +E+DK  V
Subjt:  ISSNFEMAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDK--V

Query:  DPKGFEKWKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNS
        DPKGFEKWKETYEELKE+ESEI +LKELIVKV+EDREKKKS LE+FE++LE+VKKTQEDDRL +EKLNHKNSEL+  IEV+KK K++ EKT+EELR KN 
Subjt:  DPKGFEKWKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNS

Query:  KLECAVEELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKL
        KLECA+EEL  K S+LE A+ +VKKTE       E+LKCKNS LECAKREVEHNYELCRRK++EL RR+SQL+N  T+V+ GEPIAPNRND   GGSRKL
Subjt:  KLECAVEELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKL

Query:  AGKRGAENDKI---TGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLK
         GKRGAENDK+   TGTGGC+VEI+SDDDHAPAENLSR++RNQ RK   LLND EDYDAE+      I P  +KGKESLKKVGAMF+TPPH R D H LK
Subjt:  AGKRGAENDKI---TGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLK

Query:  RSFSPGTNDSKTVTSSSRAAAVLLRE-----------------------DYYND-CAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVE
          FSP T+D K VT+SSRAAAV+LR+                       DY+ND C I S+ +G+IQNRY SSHLKS DQGK+CNKKWE EAEMRAAF E
Subjt:  RSFSPGTNDSKTVTSSSRAAAVLLRE-----------------------DYYND-CAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVE

Query:  NDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
        NDMLCMEAVCILYR +SL GK  S Y PSR RGFNEAD+LRGSTLALFL +GD  GRLKKSVMEL KFDISGLIDCRRIAI+HLKQLFEIYKNNEDQFIF
Subjt:  NDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

Query:  H
        H
Subjt:  H

TrEMBL top hitse value%identityAlignment
A0A6J1FS14 uncharacterized protein LOC111447644 isoform X13.3e-20670.45Show/hide
Query:  MAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEKW
        MA +SHSV +   SIDDD+LE RSISELVSILRTAFR+++FDKVE VLVAKEVKMRK+IENKNKEYELLQSKYEFLRLD +T ES LE+DKVDPKGFEKW
Subjt:  MAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEKW

Query:  KETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVEE
        KE YEELKE+ESEI +LK+LI KV+EDREKKKSALE FEKLLE VKKTQED R+T+EKL HKNSELEC +EV+KK KE+  KT+EELR KN KLEC++EE
Subjt:  KETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVEE

Query:  LRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAEN
        L  + S LE  +EVVKKTE       EELKCKNS LE AKRE EHNYELCRRKYDEL RRVSQLEN    +  G  +         GGSRKL G + AEN
Subjt:  LRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAEN

Query:  DKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDSK
           TGT G +VEI+SDDDHAP +N SRARR +KR WDLLLND EDYD E+       LP ET GKE+LKKVGAM+ TPPH RPDNHVLKR  S  TNDSK
Subjt:  DKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDSK

Query:  TVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSL
         V SSSRA AV+LRE+                Y+N DC+  S+ D  IQNRYDS+HLKSK QGKR NKKW SEAEMRAAF ENDMLCMEAVCILYR SSL
Subjt:  TVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSL

Query:  TGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
         GK RSAY PSR RGF+E D+LRGSTLALFL  GDSQG+LK+SVMEL KFDI GLIDCRRI+IEHL+QLFEIYKN+EDQF+F
Subjt:  TGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

A0A6J1FWR7 uncharacterized protein LOC111447644 isoform X23.1e-20469.93Show/hide
Query:  MAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEKW
        MA +SHSV +   SIDDD+LE RSISELVSILRTAFR+++FDKVE VLVAKEVKMRK+IENKNKEYELLQSKYEFLRLD +T ES LE+DKVDPKGFEKW
Subjt:  MAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEKW

Query:  KETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVEE
        KE YEELKE+ESEI +LK+LI KV+EDREKKKSALE FEKLLE VKKTQED R+T+EKL HKNSELEC +EV+KK KE+  KT+EELR KN KLEC++EE
Subjt:  KETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVEE

Query:  LRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAEN
        L  + S LE  +EVVKKTE       EELKCKNS LE AKRE EHNYELCRRKYDEL RRVSQLEN    +  G  +         GGSRKL G + AEN
Subjt:  LRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAEN

Query:  DKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDSK
               G +VEI+SDDDHAP +N SRARR +KR WDLLLND EDYD E+       LP ET GKE+LKKVGAM+ TPPH RPDNHVLKR  S  TNDSK
Subjt:  DKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDSK

Query:  TVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSL
         V SSSRA AV+LRE+                Y+N DC+  S+ D  IQNRYDS+HLKSK QGKR NKKW SEAEMRAAF ENDMLCMEAVCILYR SSL
Subjt:  TVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSL

Query:  TGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
         GK RSAY PSR RGF+E D+LRGSTLALFL  GDSQG+LK+SVMEL KFDI GLIDCRRI+IEHL+QLFEIYKN+EDQF+F
Subjt:  TGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

A0A6J1ISG5 uncharacterized protein LOC111479556 isoform X13.3e-20670.84Show/hide
Query:  MAIESHSV-PAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEK
        M+I+SH V  AA  SIDDD+LE RSISELVSILRTAFR+++FDKVE VLVAKEVKMRK+IENKNKEYELLQSKYEFLRLD +T ES LE+DKVDPKGFEK
Subjt:  MAIESHSV-PAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEK

Query:  WKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVE
        WKE YEELKE+ESEI +LK+LI KV+EDREKKKSALE FEKLLE VKKTQED R+T+EKL HKNSELEC +EV+KK KE+  KT+EELR KN KLEC++E
Subjt:  WKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVE

Query:  ELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAE
        EL  + S+LE  +EVVKKTE       EELK KNS LE AKRE EHNYELCRRKYDEL+RRVSQLEN    +  G  +         GGSRKL G + AE
Subjt:  ELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAE

Query:  NDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDS
        N   TGT G +VEI+SDDDHAP +N SRARR QKR WDLLLND EDYD E+       LP ET GKE+LKKVGAM+ TPPH RPDNHVLKR FS  TNDS
Subjt:  NDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDS

Query:  KTVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSS
        K V SSSRA AV+LRE+                Y+N DC+  S+ D  IQNRYDS+HLKSK QGKR NKKWESEAEMRAAF ENDMLCMEAVCILYR SS
Subjt:  KTVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSS

Query:  LTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
        L GK RSAY PSR RGF+E D+LRGSTLALFL  GDSQG+LK+SVMEL KFDI GLIDCRRI+IEHL+QLFEIYKN+EDQF+F
Subjt:  LTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

A0A6J1ISI0 uncharacterized protein LOC111479556 isoform X31.9e-20171.38Show/hide
Query:  MAIESHSV-PAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEK
        M+I+SH V  AA  SIDDD+LE RSISELVSILRTAFR+++FDKVE VLVAKEVKMRK+IENKNKEYELLQSKYEFLRLD +T ES LE+DKVDPKGFEK
Subjt:  MAIESHSV-PAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEK

Query:  WKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVE
        WKE YEELKE+ESEI +LK+LI KV+EDREKKKSALE FEKLLE VKKTQED R+T+EKL HKNSELEC +EV+KK KE+  KT+EELR KN KLEC++E
Subjt:  WKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVE

Query:  ELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAE
        EL  + S+LE  +EVVKKTE       EELK KNS LE AKRE EHNYELCRRKYDEL+RRVSQLEN    +  G  +         GGSRKL G + AE
Subjt:  ELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAE

Query:  NDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDS
        N   TGT G +VEI+SDDDHAP +N SRARR QKR WDLLLND EDYD E+       LP ET GKE+LKKVGAM+ TPPH RPDNHVLKR FS  TNDS
Subjt:  NDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDS

Query:  KTVTSSSRAAAVLLREDYYNDCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGF
        K V SSSRA AV+LRE+  +          N   +YDS+HLKSK QGKR NKKWESEAEMRAAF ENDMLCMEAVCILYR SSL GK RSAY PSR RGF
Subjt:  KTVTSSSRAAAVLLREDYYNDCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGF

Query:  NEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
        +E D+LRGSTLALFL  GDSQG+LK+SVMEL KFDI GLIDCRRI+IEHL+QLFEIYKN+EDQF+F
Subjt:  NEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

A0A6J1IY94 uncharacterized protein LOC111479556 isoform X23.1e-20470.33Show/hide
Query:  MAIESHSV-PAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEK
        M+I+SH V  AA  SIDDD+LE RSISELVSILRTAFR+++FDKVE VLVAKEVKMRK+IENKNKEYELLQSKYEFLRLD +T ES LE+DKVDPKGFEK
Subjt:  MAIESHSV-PAAFNSIDDDDLESRSISELVSILRTAFRSKEFDKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEK

Query:  WKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVE
        WKE YEELKE+ESEI +LK+LI KV+EDREKKKSALE FEKLLE VKKTQED R+T+EKL HKNSELEC +EV+KK KE+  KT+EELR KN KLEC++E
Subjt:  WKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQEDDRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVE

Query:  ELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAE
        EL  + S+LE  +EVVKKTE       EELK KNS LE AKRE EHNYELCRRKYDEL+RRVSQLEN    +  G  +         GGSRKL G + AE
Subjt:  ELRSKNSKLECAIEVVKKTE-------EELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNRTMVKNGEPIAPNRNDSMSGGSRKLAGKRGAE

Query:  NDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDS
        N       G +VEI+SDDDHAP +N SRARR QKR WDLLLND EDYD E+       LP ET GKE+LKKVGAM+ TPPH RPDNHVLKR FS  TNDS
Subjt:  NDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEK-----MILPCETKGKESLKKVGAMFTTPPHSRPDNHVLKRSFSPGTNDS

Query:  KTVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSS
        K V SSSRA AV+LRE+                Y+N DC+  S+ D  IQNRYDS+HLKSK QGKR NKKWESEAEMRAAF ENDMLCMEAVCILYR SS
Subjt:  KTVTSSSRAAAVLLRED----------------YYN-DCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSS

Query:  LTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF
        L GK RSAY PSR RGF+E D+LRGSTLALFL  GDSQG+LK+SVMEL KFDI GLIDCRRI+IEHL+QLFEIYKN+EDQF+F
Subjt:  LTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53220.1 unknown protein6.9e-2341.43Show/hide
Query:  LSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRL
        + E DG +      S L+ + + ++  +KWE EA+M A F ++  LCM AVC+L+R  +   K   +   S  RGF++ D +RG+++ALFL  GDS G +
Subjt:  LSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRL

Query:  KKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQF
        KKSV EL  FD  G+  C  +A ++ KQLF+IY N ED F
Subjt:  KKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQF

AT5G53220.2 unknown protein6.9e-2341.43Show/hide
Query:  LSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRL
        + E DG +      S L+ + + ++  +KWE EA+M A F ++  LCM AVC+L+R  +   K   +   S  RGF++ D +RG+++ALFL  GDS G +
Subjt:  LSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRL

Query:  KKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQF
        KKSV EL  FD  G+  C  +A ++ KQLF+IY N ED F
Subjt:  KKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQF

AT5G53220.3 unknown protein6.9e-2341.43Show/hide
Query:  LSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRL
        + E DG +      S L+ + + ++  +KWE EA+M A F ++  LCM AVC+L+R  +   K   +   S  RGF++ D +RG+++ALFL  GDS G +
Subjt:  LSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSLTGKSRSAYFPSRRRGFNEADMLRGSTLALFLISGDSQGRL

Query:  KKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQF
        KKSV EL  FD  G+  C  +A ++ KQLF+IY N ED F
Subjt:  KKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTCTCTCTTCACTTCCCTCACTATGATTCTGCAACTTCAGCATTTAAACCCTCCTCTTTCGCGTTCTTGAAGAAGAACATCGAACGATTCGCCTCCTTCTCAGAGCTTGT
GGGAAACTCCTCGGACAAGACTCTCAACTCTGTATATTTTGTTGTTCAAGAAACCGCATTTGTCCACTCTTCGAGAGCCATTTCTTCCAATTTTGAAATGGCGATCGAGT
CGCATTCCGTGCCTGCTGCTTTCAATTCTATCGACGACGATGACCTAGAAAGTCGGAGCATTTCAGAGTTGGTTTCGATTCTGCGGACGGCTTTCCGGTCAAAAGAATTC
GACAAGGTGGAGGAGGTTTTGGTCGCGAAAGAAGTAAAAATGAGGAAAGATATTGAGAACAAGAACAAAGAGTATGAATTGCTCCAAAGTAAATATGAGTTTCTAAGACT
CGATAGCATGACTCAAGAATCCACGCTCGAGGAGGATAAGGTTGACCCTAAAGGATTCGAGAAGTGGAAGGAAACGTACGAGGAGTTGAAGGAGAGAGAGAGTGAGATTC
TAAAGCTCAAGGAATTGATTGTTAAAGTAGATGAGGATAGGGAGAAGAAGAAAAGCGCTTTGGAGAGATTTGAGAAATTGTTAGAAGTGGTAAAAAAAACGCAAGAAGAT
GATAGATTGACGATGGAGAAGCTTAACCACAAGAATTCAGAATTAGAATGTGAAATAGAAGTGATCAAGAAAACGAAGGAAGAGTGTGAGAAGACTGTAGAGGAGCTCAG
AAGCAAAAATTCAAAATTAGAATGTGCAGTAGAGGAGCTTAGGAGCAAAAATTCAAAATTAGAATGTGCAATAGAAGTGGTCAAGAAAACAGAGGAAGAGCTTAAATGCA
AGAACTCAAACTTAGAATGTGCAAAGAGAGAAGTTGAGCATAACTACGAGTTGTGTAGAAGGAAATATGATGAACTTGCACGCCGAGTCTCACAATTAGAGAACAACAGA
ACAATGGTAAAAAATGGGGAGCCCATTGCTCCAAACAGAAACGACTCTATGTCAGGAGGTTCCAGAAAATTGGCTGGGAAGAGAGGGGCAGAAAATGACAAAATAACTGG
TACTGGAGGATGCATTGTTGAGATCCTTAGTGATGATGATCATGCTCCAGCTGAAAATTTATCTAGAGCACGCAGAAATCAAAAGAGAAAATGGGATTTATTATTAAATG
ATTGGGAAGATTATGATGCTGAAAAGATGATTCTGCCCTGTGAAACGAAGGGTAAGGAGTCATTGAAGAAAGTAGGGGCAATGTTTACAACACCTCCACATAGTCGTCCG
GACAATCATGTCTTGAAAAGGAGTTTTTCTCCTGGTACTAATGATTCTAAGACGGTCACGTCCTCTTCAAGGGCCGCTGCAGTCTTGTTGAGGGAAGACTACTACAATGA
CTGTGCCATTTTATCAGAATTTGATGGCAATATTCAAAATAGATATGACTCGAGTCATCTCAAGTCAAAAGACCAAGGGAAGAGGTGTAATAAAAAATGGGAGTCGGAGG
CTGAAATGCGTGCTGCATTTGTTGAAAACGATATGCTTTGCATGGAGGCTGTTTGTATTCTCTATAGACTATCAAGTTTAACAGGAAAGTCTCGTAGTGCATATTTTCCT
TCCAGACGTAGAGGATTTAATGAGGCTGATATGCTCAGGGGTTCCACATTGGCATTGTTTCTAATAAGCGGAGATTCACAAGGGAGATTGAAGAAATCTGTGATGGAGTT
AGCGAAATTTGACATAAGTGGTCTTATTGACTGCAGAAGAATCGCGATCGAGCATTTGAAGCAGTTGTTTGAAATATATAAGAACAATGAAGATCAATTCATATTCCATT
AA
mRNA sequenceShow/hide mRNA sequence
TTCTCTCTTCACTTCCCTCACTATGATTCTGCAACTTCAGCATTTAAACCCTCCTCTTTCGCGTTCTTGAAGAAGAACATCGAACGATTCGCCTCCTTCTCAGAGCTTGT
GGGAAACTCCTCGGACAAGACTCTCAACTCTGTATATTTTGTTGTTCAAGAAACCGCATTTGTCCACTCTTCGAGAGCCATTTCTTCCAATTTTGAAATGGCGATCGAGT
CGCATTCCGTGCCTGCTGCTTTCAATTCTATCGACGACGATGACCTAGAAAGTCGGAGCATTTCAGAGTTGGTTTCGATTCTGCGGACGGCTTTCCGGTCAAAAGAATTC
GACAAGGTGGAGGAGGTTTTGGTCGCGAAAGAAGTAAAAATGAGGAAAGATATTGAGAACAAGAACAAAGAGTATGAATTGCTCCAAAGTAAATATGAGTTTCTAAGACT
CGATAGCATGACTCAAGAATCCACGCTCGAGGAGGATAAGGTTGACCCTAAAGGATTCGAGAAGTGGAAGGAAACGTACGAGGAGTTGAAGGAGAGAGAGAGTGAGATTC
TAAAGCTCAAGGAATTGATTGTTAAAGTAGATGAGGATAGGGAGAAGAAGAAAAGCGCTTTGGAGAGATTTGAGAAATTGTTAGAAGTGGTAAAAAAAACGCAAGAAGAT
GATAGATTGACGATGGAGAAGCTTAACCACAAGAATTCAGAATTAGAATGTGAAATAGAAGTGATCAAGAAAACGAAGGAAGAGTGTGAGAAGACTGTAGAGGAGCTCAG
AAGCAAAAATTCAAAATTAGAATGTGCAGTAGAGGAGCTTAGGAGCAAAAATTCAAAATTAGAATGTGCAATAGAAGTGGTCAAGAAAACAGAGGAAGAGCTTAAATGCA
AGAACTCAAACTTAGAATGTGCAAAGAGAGAAGTTGAGCATAACTACGAGTTGTGTAGAAGGAAATATGATGAACTTGCACGCCGAGTCTCACAATTAGAGAACAACAGA
ACAATGGTAAAAAATGGGGAGCCCATTGCTCCAAACAGAAACGACTCTATGTCAGGAGGTTCCAGAAAATTGGCTGGGAAGAGAGGGGCAGAAAATGACAAAATAACTGG
TACTGGAGGATGCATTGTTGAGATCCTTAGTGATGATGATCATGCTCCAGCTGAAAATTTATCTAGAGCACGCAGAAATCAAAAGAGAAAATGGGATTTATTATTAAATG
ATTGGGAAGATTATGATGCTGAAAAGATGATTCTGCCCTGTGAAACGAAGGGTAAGGAGTCATTGAAGAAAGTAGGGGCAATGTTTACAACACCTCCACATAGTCGTCCG
GACAATCATGTCTTGAAAAGGAGTTTTTCTCCTGGTACTAATGATTCTAAGACGGTCACGTCCTCTTCAAGGGCCGCTGCAGTCTTGTTGAGGGAAGACTACTACAATGA
CTGTGCCATTTTATCAGAATTTGATGGCAATATTCAAAATAGATATGACTCGAGTCATCTCAAGTCAAAAGACCAAGGGAAGAGGTGTAATAAAAAATGGGAGTCGGAGG
CTGAAATGCGTGCTGCATTTGTTGAAAACGATATGCTTTGCATGGAGGCTGTTTGTATTCTCTATAGACTATCAAGTTTAACAGGAAAGTCTCGTAGTGCATATTTTCCT
TCCAGACGTAGAGGATTTAATGAGGCTGATATGCTCAGGGGTTCCACATTGGCATTGTTTCTAATAAGCGGAGATTCACAAGGGAGATTGAAGAAATCTGTGATGGAGTT
AGCGAAATTTGACATAAGTGGTCTTATTGACTGCAGAAGAATCGCGATCGAGCATTTGAAGCAGTTGTTTGAAATATATAAGAACAATGAAGATCAATTCATATTCCATT
AA
Protein sequenceShow/hide protein sequence
FSLHFPHYDSATSAFKPSSFAFLKKNIERFASFSELVGNSSDKTLNSVYFVVQETAFVHSSRAISSNFEMAIESHSVPAAFNSIDDDDLESRSISELVSILRTAFRSKEF
DKVEEVLVAKEVKMRKDIENKNKEYELLQSKYEFLRLDSMTQESTLEEDKVDPKGFEKWKETYEELKERESEILKLKELIVKVDEDREKKKSALERFEKLLEVVKKTQED
DRLTMEKLNHKNSELECEIEVIKKTKEECEKTVEELRSKNSKLECAVEELRSKNSKLECAIEVVKKTEEELKCKNSNLECAKREVEHNYELCRRKYDELARRVSQLENNR
TMVKNGEPIAPNRNDSMSGGSRKLAGKRGAENDKITGTGGCIVEILSDDDHAPAENLSRARRNQKRKWDLLLNDWEDYDAEKMILPCETKGKESLKKVGAMFTTPPHSRP
DNHVLKRSFSPGTNDSKTVTSSSRAAAVLLREDYYNDCAILSEFDGNIQNRYDSSHLKSKDQGKRCNKKWESEAEMRAAFVENDMLCMEAVCILYRLSSLTGKSRSAYFP
SRRRGFNEADMLRGSTLALFLISGDSQGRLKKSVMELAKFDISGLIDCRRIAIEHLKQLFEIYKNNEDQFIFH