; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G010570 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G010570
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptiontrihelix transcription factor ASIL1-like
Genome locationCmo_Chr19:9174264..9179792
RNA-Seq ExpressionCmoCh19G010570
SyntenyCmoCh19G010570
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007789 - Protein of unknown function DUF688
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572395.1 hypothetical protein SDJN03_29123, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0098.74Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKAR EPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTV TQSSKRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
        AQKMNQEVPTSINASLDENARETVEE+TSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA

Query:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
        MASETPPHTIRKQTVERPREIKLVTNGDRQSR NLHVQTQHVKEIFREESDDEDDDFD+SEYAS RGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
Subjt:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR

Query:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG
        NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKAT+SV NEYDEPRRRSLNSFHALLG
Subjt:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG

Query:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
        DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
Subjt:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE

Query:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
        VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
Subjt:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE

Query:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
        KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
Subjt:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST

Query:  VKSSDTNHLHLQFSKTL
        VKSSDTNHLHLQFSK L
Subjt:  VKSSDTNHLHLQFSKTL

KAG7011999.1 hypothetical protein SDJN02_26907 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0099.26Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTV TQSSKRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
        AQKMNQEVPTSINASLDENARETVEE+TSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA

Query:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
        MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
Subjt:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR

Query:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG
        NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG
Subjt:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG

Query:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
        DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
Subjt:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE

Query:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
        VKVDSNFSRLKPEHTQDAAK+TSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTT VSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
Subjt:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE

Query:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSF
        KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQS F
Subjt:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSF

XP_022952581.1 uncharacterized protein LOC111455232 [Cucurbita moschata]0.0e+0099.86Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
        AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA

Query:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
        MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
Subjt:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR

Query:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG
        NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG
Subjt:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG

Query:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
        DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
Subjt:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE

Query:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
        VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
Subjt:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE

Query:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
        KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
Subjt:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST

Query:  VKSSDTNHLHLQFSKTL
        VKSSDTNHLHLQFSK L
Subjt:  VKSSDTNHLHLQFSKTL

XP_022968993.1 uncharacterized protein LOC111468130 [Cucurbita maxima]0.0e+0096.37Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQ SKRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
        AQKMNQEVPTSINASLDEN RETVEE+TSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPH+R+FMMDRFLPAAKA
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA

Query:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
        MASETPPHTIRKQTVERPREIKLV NGDRQSR NLHVQTQHVKEIFREESD EDDDFDES YASTRGCGFLPRFCLKGSLVL+NPVPGMRMQATSVRRVR
Subjt:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR

Query:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG
        NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKAT+SVTNEYDEP RRSLNSFHALLG
Subjt:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG

Query:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
        DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISY+GD+IDDASIKSTEIKELCKLDS TQDVKNLNAVGEK+TLRPDSLKSMDSCLLTCSNRSLFE
Subjt:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE

Query:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
        VKVDSNFSRLKPEHTQDAAKSTSSQF NNKKFNLENQFPLKPSSRGD NGLAKDNTTLVSSYGI SEKVNLD+KQPEKS+YGRN+VIVPEY KKHESDGE
Subjt:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE

Query:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
        KL+PVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
Subjt:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST

Query:  VKSSDTNHLHLQFSKTL
        VKSSDT HLHLQFSK L
Subjt:  VKSSDTNHLHLQFSKTL

XP_023511564.1 uncharacterized protein LOC111776366 [Cucurbita pepo subsp. pepo]0.0e+0097.21Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEK RPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQ SKRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGN-DDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAK
        AQKMNQEVPTSINASLDENARETVEE+TSCKSGN DD+EEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTR+FMMDRFLPAAK
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGN-DDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAK

Query:  AMASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRV
        AMASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDES YASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRV
Subjt:  AMASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRV

Query:  RNSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALL
        RNSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTF LYRHLQGEGISKYP EPSQAVHGNVNPSLGYTVKAT+SVTNEYDEPRRRSLNSFHALL
Subjt:  RNSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALL

Query:  GDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLF
        GDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDS TQDVKNLN VG K+TLRPDSLKSMDSCLLTCSNRSLF
Subjt:  GDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLF

Query:  EVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDG
        EVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSY IHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDG
Subjt:  EVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDG

Query:  EKLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPS
        EK APVEDSPGLRTSERATNGEKDSRNQ LKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQ+NPVSTTASPDRKTPS
Subjt:  EKLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPS

Query:  TVKSSDTNHLHLQFSKTL
        TVKSSD NHLHLQFSK L
Subjt:  TVKSSDTNHLHLQFSKTL

TrEMBL top hitse value%identityAlignment
A0A0A0K1U5 Uncharacterized protein2.9e-30578.71Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLD NQPLLSVRRFTST TS +TNEK RPE +IP  PVYKSELKSGPVR PGTVPF+WE+TPGKPKDES+  TQ  KRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
         QK+NQEV TS+NASLDENARE VEE+TSCKSGNDDEEE++EVYRDAND FSRSESFFLNCSISGVSGLDDSEIKPS  SSMDPHTR+FMMDRFLPAAKA
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA

Query:  MASETPPHTIRKQTV--ERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRR
        MASETPPHTIRKQTV  ERPRE+KLVTN DRQSRPNLHVQT+HVKEIF EESDDEDDD+DES Y+ST+GCGFLPRFCLKGS  LLNPVPGMRMQATSVRR
Subjt:  MASETPPHTIRKQTV--ERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRR

Query:  VRNSSTGSSKDAVNERRSSHGQGITRQKLEENA---------SNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRR
        +RNSS G SKDAVNERR  HGQGIT+Q+LEENA         SNIQE D FSLYRHLQ E +S YPNEPSQAVH NVNPSL Y  KAT SVTNEY+E RR
Subjt:  VRNSSTGSSKDAVNERRSSHGQGITRQKLEENA---------SNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRR

Query:  RSLNSFHALLGDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSC
        RSLNSF ALL DESGSASPVEKTLYIDSVHKI SP SSSNSLD KGISYSGDMIDD  IKSTE+KELC LDS T DVKN+N VGEK+  RPDSLKS+DSC
Subjt:  RSLNSFHALLGDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSC

Query:  LLTCSNRSLFEVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVP
        L TCS+ SLF+VK+D+N+SRLKPEHTQDAAK TSS+FA NKKF+LENQFPLKPSSR D+N L KDNT                RKQPEKS Y  NNV  P
Subjt:  LLTCSNRSLFEVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVP

Query:  EYGKKHESDGEKLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVST
        EYGKKHE D EK+ PVE S  LRTSE ATNG KDSRN+  KRVGN DGS  GYSQ RL FAPPPPKSPSESWLKRTLPTSSRNT FLQSSFAM+VNP+S 
Subjt:  EYGKKHESDGEKLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVST

Query:  TASPDRKTPSTVKSSDTNHLHLQFSKTL
        T SP+    STV++ DTN+LHLQFSK L
Subjt:  TASPDRKTPSTVKSSDTNHLHLQFSKTL

A0A1S3C0L4 uncharacterized protein LOC1034954981.2e-30378.85Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLD NQPLLSVRRFTST TSS+TNEK RPE +IP  PVYKSELKSGPVR PGTVPFVWE+TPGKPKDES+  TQ SKRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
         QK+NQEVPTS+NAS DENARE VEE+ SC SGND+EEE+DEVYRDAND FSRSESFFLNCSISGVSGLDDSEIKPS  SSMDPHTR+FMMDRFLPAAKA
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA

Query:  MASETPPHTIRKQT--VERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRR
        MASETPPHTIRKQT  VERPRE+KLVTN DRQSRPNLHVQT+HVKEIFREESDDEDDD+DES Y+ST+GCGFLPRFCLKGS  LLNPVPGMRMQATSVRR
Subjt:  MASETPPHTIRKQT--VERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRR

Query:  VRNSSTGSSKDAVNERRSSHGQGITRQKLEENA---------SNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRR
        +RNSS GSSKDAVNER+  HGQGIT+Q+LEENA         SNIQE D FSLYRHLQGE +S YPNEPSQAVH NVN SLG+T KAT SVTNE++E RR
Subjt:  VRNSSTGSSKDAVNERRSSHGQGITRQKLEENA---------SNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRR

Query:  RSLNSFHALLGDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSC
        RSLNSF ALL DESGS SPVEKTLYIDSVHKI SP SSSNSLD KGISYSGDMIDD  IKSTE+KELC LDS T DVKN+N VGE++  RPDSLKS+DSC
Subjt:  RSLNSFHALLGDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSC

Query:  LLTCSNRSLFEVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVP
        L TCS+ SLF+VK+D+++SRLKPEHTQDAAK TSS+FA NKKF+LENQFPLKP SR D+N L KDNT                RKQPEKS Y  NNVI P
Subjt:  LLTCSNRSLFEVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVP

Query:  EYGKKHESDGEKLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVST
        EYG KHE D EKL PVE S  LRTSE ATNG  DSRN   KRVGNEDGS  GYSQ RL FAPPPPKSPSESWLKRTLPTSSRNT FLQSSFAM+VN VS 
Subjt:  EYGKKHESDGEKLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVST

Query:  TASPDRKTPSTVKSSDTNHLHLQFSKTL
        TASP+    STVK+ DTN+LHLQFSK L
Subjt:  TASPDRKTPSTVKSSDTNHLHLQFSKTL

A0A5A7SPV3 Uncharacterized protein5.5e-30478.85Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLD NQPLLSVRRFTST TSS+TNEK RPE +IP  PVYKSELKSGPVR PGTVPFVWE+TPGKPKDES+  TQ SKRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
         QK+NQEV TS+NASLDEN RE VEE+ SC SGND+EEE+DEVYRDAND FSRSESFFLNCSISGVSGLDDSEIKPS  SSMDPHTR+FMMDRFLPAAKA
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA

Query:  MASETPPHTIRKQTV--ERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRR
        MASETPPHTIRKQTV  ERPRE+KLVTN DRQSRPNLHVQT+HVKEIFREESDDEDDD+DES Y+ST+GCGFLPRFCLKGS  LLNPVPGMRMQATSVRR
Subjt:  MASETPPHTIRKQTV--ERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRR

Query:  VRNSSTGSSKDAVNERRSSHGQGITRQKLEENA---------SNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRR
        +RNSS GSSKDAVNER+  HGQGIT+Q+LEENA         SNIQE D FSLYRHLQGE +S YPNEPSQAVH NVN SLG+T KAT SVTNE++E RR
Subjt:  VRNSSTGSSKDAVNERRSSHGQGITRQKLEENA---------SNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRR

Query:  RSLNSFHALLGDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSC
        RSLNSF ALL DESGS SPVEKTLYIDSVHKI SP SSSNSLD KGISYSGDMIDD  IKSTE+KELC LDS T DVKN+N VGE++  RPDSLK++DSC
Subjt:  RSLNSFHALLGDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSC

Query:  LLTCSNRSLFEVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVP
        L TCS+ SLF+VK+D+++SRLKPEHTQDAAK TSS+FA NKKF+LENQFPLKPSSR D+N L KDNT                RKQPEKS Y  NNVI P
Subjt:  LLTCSNRSLFEVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVP

Query:  EYGKKHESDGEKLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVST
        EYG KHE D EKL PVE S  LRTSE ATNG  DSRN   KRVGNEDGS  GYSQ RL FAPPPPKSPSESWLKRTLPTSSRNT FLQSSFAM+VNPVS 
Subjt:  EYGKKHESDGEKLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVST

Query:  TASPDRKTPSTVKSSDTNHLHLQFSKTL
        TASP+    STVK+ DTN+LHLQFSK L
Subjt:  TASPDRKTPSTVKSSDTNHLHLQFSKTL

A0A6J1GL08 uncharacterized protein LOC1114552320.0e+0099.86Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
        AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA

Query:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
        MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
Subjt:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR

Query:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG
        NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG
Subjt:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG

Query:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
        DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
Subjt:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE

Query:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
        VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
Subjt:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE

Query:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
        KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
Subjt:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST

Query:  VKSSDTNHLHLQFSKTL
        VKSSDTNHLHLQFSK L
Subjt:  VKSSDTNHLHLQFSKTL

A0A6J1HZQ3 uncharacterized protein LOC1114681300.0e+0096.37Show/hide
Query:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR
        MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQ SKRPPLVPKLPPGR
Subjt:  MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGR

Query:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA
        AQKMNQEVPTSINASLDEN RETVEE+TSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPH+R+FMMDRFLPAAKA
Subjt:  AQKMNQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKA

Query:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR
        MASETPPHTIRKQTVERPREIKLV NGDRQSR NLHVQTQHVKEIFREESD EDDDFDES YASTRGCGFLPRFCLKGSLVL+NPVPGMRMQATSVRRVR
Subjt:  MASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVR

Query:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG
        NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKAT+SVTNEYDEP RRSLNSFHALLG
Subjt:  NSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLG

Query:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE
        DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISY+GD+IDDASIKSTEIKELCKLDS TQDVKNLNAVGEK+TLRPDSLKSMDSCLLTCSNRSLFE
Subjt:  DESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSGDMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFE

Query:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE
        VKVDSNFSRLKPEHTQDAAKSTSSQF NNKKFNLENQFPLKPSSRGD NGLAKDNTTLVSSYGI SEKVNLD+KQPEKS+YGRN+VIVPEY KKHESDGE
Subjt:  VKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANGLAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGE

Query:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
        KL+PVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST
Subjt:  KLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSESWLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPST

Query:  VKSSDTNHLHLQFSKTL
        VKSSDT HLHLQFSK L
Subjt:  VKSSDTNHLHLQFSKTL

SwissProt top hitse value%identityAlignment
Q9LJG8 Trihelix transcription factor ASIL24.7e-2629.86Show/hide
Query:  RRQPSGLTVALMALSL----------PSPAPVREDCWTLYDTSTLIEAWGERHIDLNRGSLRLKHWQEIANAVNSSHGHERKSIRTAIQCKNRIDTLKRK
        ++ P  L +AL+ +            P+    REDCW+   T+ LI+AWGER+++L+RG+L+ KHW+E+A  V+S   +  K  +T IQCKNRIDT+K+K
Subjt:  RRQPSGLTVALMALSL----------PSPAPVREDCWTLYDTSTLIEAWGERHIDLNRGSLRLKHWQEIANAVNSSHGHERKSIRTAIQCKNRIDTLKRK

Query:  YKIEKARIQESGGSYDCAWPFFSCLDDLIGNNHKAST---------------PVAV----------SNCKAAPV----------TTPRLSLFS-------
        YK EK RI   GG     W FF  LD LIG+  K  T               P+ +             KAA             T R+S  S       
Subjt:  YKIEKARIQESGGSYDCAWPFFSCLDDLIGNNHKAST---------------PVAV----------SNCKAAPV----------TTPRLSLFS-------

Query:  ----------KVPVAPRS---GTKKRRSTHVYRSF------------CDS-----YLRRDVISNENEGKNSMESDNSLS------SSRFKDREA------
                   +P++ RS   G + R      R+             C       + +R+   +++E + +M  D+  S      S R K  E       
Subjt:  ----------KVPVAPRS---GTKKRRSTHVYRSF------------CDS-----YLRRDVISNENEGKNSMESDNSLS------SSRFKDREA------

Query:  ----GYRKLAEAIGTITDIYERVEVAKQRHMIELEMQRMQFVKDLEYQRMQLLMEMQLQIQKIKR
             +R+L  AI    + YE+ E AK + ++E+E +RM+F+K+LE QRMQ  ++ QL+I ++K+
Subjt:  ----GYRKLAEAIGTITDIYERVEVAKQRHMIELEMQRMQFVKDLEYQRMQLLMEMQLQIQKIKR

Q9SYG2 Trihelix transcription factor ASIL14.9e-2334.94Show/hide
Query:  REDCWTLYDTSTLIEAWGERHIDLNRGSLRLKHWQEIANAVNSSHGHERKSIRTAIQCKNRIDTLKRKYKIEKARIQESGGSYDCAWPFFSCLDDLIGNN
        R+DCW+   T  LIEAWG+R  +  +G+L+ +HW+E+A  VN S   + K  +T IQCKNRIDT+K+KYK EKA+I  + G     W FF  L+ LIG  
Subjt:  REDCWTLYDTSTLIEAWGERHIDLNRGSLRLKHWQEIANAVNSSHGHERKSIRTAIQCKNRIDTLKRKYKIEKARIQESGGSYDCAWPFFSCLDDLIGNN

Query:  HK--ASTPVAVSNCKAAPVTTPRLSLF---SKVPVAPRSGTKKRRSTHVYRSFCDSYLRRDVISNENEGKNSMESDNSLSSS------------------
            AS+  +        +   R S+F   +K     +   +KR S  +   F      R   ++E E ++  E + S   S                  
Subjt:  HK--ASTPVAVSNCKAAPVTTPRLSLF---SKVPVAPRSGTKKRRSTHVYRSFCDSYLRRDVISNENEGKNSMESDNSLSSS------------------

Query:  -RFK-DRE----AGYRKLAEAIGTITDIYERVEVAKQRHMIELEMQRMQFVKDLEYQRMQLLMEMQLQI
         R K D+     +G   +A AI   T+ YE+ E AK + M ELE +RM+F K++E QRMQ L + QL+I
Subjt:  -RFK-DRE----AGYRKLAEAIGTITDIYERVEVAKQRHMIELEMQRMQFVKDLEYQRMQLLMEMQLQI

Arabidopsis top hitse value%identityAlignment
AT2G30990.1 Protein of unknown function (DUF688)1.0e-5539.68Show/hide
Query:  LMEEKQLDLNQPLLSVRRFTSTGTS-SKTNEKARPEPEI-PRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGRAQKM
        +MEEKQLD N+PL+S+RR T T  S SKT         I P PPVYKS++KSGPVRNPGTVPF WE  PGKPKDE     QS   P  VPKLPPGR + +
Subjt:  LMEEKQLDLNQPLLSVRRFTSTGTS-SKTNEKARPEPEI-PRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGRAQKM

Query:  N-QEVPTSINA-----SLDENARETVEELTSCKSGNDDEEED-DEVYRDANDIFSRSESFFLNCS-ISGVSGLDDSEI--KPSGVSSMDPHTRNFMMDRF
             P S  A     ++  + +  VE+  S  S  DD+++D D  Y DA D  SR+ESFF NCS +SG SGLD S I  +P G  S D  T++ MM RF
Subjt:  N-QEVPTSINA-----SLDENARETVEELTSCKSGNDDEEED-DEVYRDANDIFSRSESFFLNCS-ISGVSGLDDSEI--KPSGVSSMDPHTRNFMMDRF

Query:  LPAAKAMASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQ--
        LPAAKA+ SE+PPH  RK         +L+     +   N +         FR   D E++D + S   ++  CG LP+ CL+ SL LLNPVP +RMQ  
Subjt:  LPAAKAMASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQ--

Query:  -ATSVRRVRNSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRS
         A SVRR+R+    S+    NE  +   +   + KL E+ +              QGE +S       +    NV  +      + + ++  + E     
Subjt:  -ATSVRRVRNSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRS

Query:  LNSFHALLGDESGSASPV-EKTLYIDSVHKI
         N++      E  S +PV EKTLY+D VH +
Subjt:  LNSFHALLGDESGSASPV-EKTLYIDSVHKI

AT2G30990.2 Protein of unknown function (DUF688)1.0e-5539.68Show/hide
Query:  LMEEKQLDLNQPLLSVRRFTSTGTS-SKTNEKARPEPEI-PRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGRAQKM
        +MEEKQLD N+PL+S+RR T T  S SKT         I P PPVYKS++KSGPVRNPGTVPF WE  PGKPKDE     QS   P  VPKLPPGR + +
Subjt:  LMEEKQLDLNQPLLSVRRFTSTGTS-SKTNEKARPEPEI-PRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGRAQKM

Query:  N-QEVPTSINA-----SLDENARETVEELTSCKSGNDDEEED-DEVYRDANDIFSRSESFFLNCS-ISGVSGLDDSEI--KPSGVSSMDPHTRNFMMDRF
             P S  A     ++  + +  VE+  S  S  DD+++D D  Y DA D  SR+ESFF NCS +SG SGLD S I  +P G  S D  T++ MM RF
Subjt:  N-QEVPTSINA-----SLDENARETVEELTSCKSGNDDEEED-DEVYRDANDIFSRSESFFLNCS-ISGVSGLDDSEI--KPSGVSSMDPHTRNFMMDRF

Query:  LPAAKAMASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQ--
        LPAAKA+ SE+PPH  RK         +L+     +   N +         FR   D E++D + S   ++  CG LP+ CL+ SL LLNPVP +RMQ  
Subjt:  LPAAKAMASETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQ--

Query:  -ATSVRRVRNSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRS
         A SVRR+R+    S+    NE  +   +   + KL E+ +              QGE +S       +    NV  +      + + ++  + E     
Subjt:  -ATSVRRVRNSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRS

Query:  LNSFHALLGDESGSASPV-EKTLYIDSVHKI
         N++      E  S +PV EKTLY+D VH +
Subjt:  LNSFHALLGDESGSASPV-EKTLYIDSVHKI

AT2G30990.3 Protein of unknown function (DUF688)1.6e-4536.64Show/hide
Query:  LMEEKQLDLNQPLLSVRRFTSTGTS-SKTNEKARPEPEI-PRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGRAQKM
        +MEEKQLD N+PL+S+RR T T  S SKT         I P PPVYKS++KSGPVRNPGTVPF WE  PGKPKDE     QS   P  VPKLPPGR + +
Subjt:  LMEEKQLDLNQPLLSVRRFTSTGTS-SKTNEKARPEPEI-PRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGRAQKM

Query:  NQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEI--KPSGVSSMDPHTRNFMMDRFLPAAKAMA
          E+     ++  ++  +TV               D  +  DA    SR        ++SG SGLD S I  +P G  S D  T++ MM RFLPAAKA+ 
Subjt:  NQEVPTSINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEI--KPSGVSSMDPHTRNFMMDRFLPAAKAMA

Query:  SETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQ---ATSVRRV
        SE+PPH  RK         +L+     +   N +         FR   D E++D + S   ++  CG LP+ CL+ SL LLNPVP +RMQ   A SVRR+
Subjt:  SETPPHTIRKQTVERPREIKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQ---ATSVRRV

Query:  RNSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALL
        R+    S+    NE  +   +   + KL E+ +              QGE +S       +    NV  +      + + ++  + E      N++    
Subjt:  RNSSTGSSKDAVNERRSSHGQGITRQKLEENASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALL

Query:  GDESGSASPV-EKTLYIDSVHKI
          E  S +PV EKTLY+D VH +
Subjt:  GDESGSASPV-EKTLYIDSVHKI

AT3G11100.1 sequence-specific DNA binding transcription factors6.8e-3637.69Show/hide
Query:  REDCWTLYDTSTLIEAWGERHIDLNRGSLRLKHWQEIANAVNSSHGHERKSIRTAIQCKNRIDTLKRKYKIEKARIQESGGSYDCAWPFFSCLDDLIGNN
        RED W+   T+TLIEAWG+R+++LNRG+LR   W+E+A+AVNSSHG+ R   +T +QCKNRIDTLK+KYK EKA+   +       W FF  LD LIG  
Subjt:  REDCWTLYDTSTLIEAWGERHIDLNRGSLRLKHWQEIANAVNSSHGHERKSIRTAIQCKNRIDTLKRKYKIEKARIQESGGSYDCAWPFFSCLDDLIGNN

Query:  HKASTPVAVSNCKAAPVTTPRLSLFSKVPVAPRSGTKKRRSTHVYRSFCDSYLRRDVISNENEGKNSMESDNSLSSSRFKDRE------------AGYRK
         K S+   V +    P   P             +G+K   S+              +  ++++  +  E D+      F  R+            + +R+
Subjt:  HKASTPVAVSNCKAAPVTTPRLSLFSKVPVAPRSGTKKRRSTHVYRSFCDSYLRRDVISNENEGKNSMESDNSLSSSRFKDRE------------AGYRK

Query:  LAEAIGTITDIYERVEVAKQRHMIELEMQRMQFVKDLEYQRMQLLMEMQLQIQKIKRARRASGAGESL
        LA +I  + + +ER+E  KQ+ MIELE QRM+  K+LE QRM +LMEMQL+++K K  +R + +G+ L
Subjt:  LAEAIGTITDIYERVEVAKQRHMIELEMQRMQFVKDLEYQRMQLLMEMQLQIQKIKRARRASGAGESL

AT3G58630.1 sequence-specific DNA binding transcription factors1.8e-4941.29Show/hide
Query:  SLPSPAPV-REDCWTLYDTSTLIEAWGERHIDLNRGSLRLKHWQEIANAVNSSHGHERKSI--------RTAIQCKNRIDTLKRKYKIEKARIQESG-GS
        S PSPA + REDCW+   T TLI+AWG R++DL+RG+LR KHWQE+ANAVN  H +  +++        RT +QCKNRIDTLK+KYK+EKAR+ ES  G+
Subjt:  SLPSPAPV-REDCWTLYDTSTLIEAWGERHIDLNRGSLRLKHWQEIANAVNSSHGHERKSI--------RTAIQCKNRIDTLKRKYKIEKARIQESG-GS

Query:  YDCAWPFFSCLDDLIGNNHKASTPVAVSNCKAAPVTTPRLSL---FSKVPVAPRSGTKKRRST------HV------YRSFCDSYLRRDVIS------NE
        Y   WPFFS LDDL+    + S P + +      +   RLSL    + VPVAPRS   +R +T      H       +R   +++      +      ++
Subjt:  YDCAWPFFSCLDDLIGNNHKASTPVAVSNCKAAPVTTPRLSL---FSKVPVAPRSGTKKRRST------HV------YRSFCDSYLRRDVIS------NE

Query:  NEGKNSMESDNSLSSSRFK---DREAGYRKLAEAIGTITDIYERVEVAKQRHMIELEMQRMQFVKDLEYQRMQLLMEMQLQIQKIKRARRASGAGESLAS
        +EG  S  S  S S+ + +   +++ GY+++A+AI  +  IYERVE  K++ M+ELE QRM+F K+LE  RMQL  EMQ+++ K++R   + G   S AS
Subjt:  NEGKNSMESDNSLSSSRFK---DREAGYRKLAEAIGTITDIYERVEVAKQRHMIELEMQRMQFVKDLEYQRMQLLMEMQLQIQKIKRARRASGAGESLAS

Query:  NSFYYFLYSF
         +  Y +  F
Subjt:  NSFYYFLYSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCATTGAGGAATTTGATGGAAGAGAAACAGCTGGATTTGAATCAACCTCTTCTTTCTGTGAGGCGTTTTACATCCACAGGAACATCATCCAAGACTAATGAAAA
AGCCAGACCTGAACCCGAAATTCCTCGTCCTCCTGTATATAAATCAGAATTGAAATCTGGTCCAGTGAGGAATCCTGGAACTGTTCCTTTTGTATGGGAGCAAACTCCAG
GAAAACCCAAGGATGAAAGCACTGTGAAAACTCAGAGTTCCAAACGGCCTCCACTTGTCCCAAAACTCCCACCTGGAAGAGCCCAAAAAATGAATCAAGAAGTTCCTACT
TCTATAAATGCTTCATTAGATGAAAATGCGAGAGAAACAGTTGAAGAGCTGACCAGCTGTAAGTCAGGAAATGACGACGAAGAAGAAGATGACGAGGTTTACAGAGATGC
AAACGATATATTTTCTCGATCCGAATCGTTCTTCTTGAATTGTAGTATTAGTGGAGTGAGTGGATTGGATGATTCTGAAATTAAACCTTCAGGAGTCTCCTCCATGGATC
CACATACTAGGAATTTCATGATGGACCGGTTTCTACCTGCAGCAAAGGCAATGGCATCAGAAACACCTCCACATACTATTAGAAAACAAACTGTCGAGCGACCGAGGGAA
ATAAAATTGGTGACAAATGGGGATAGGCAGAGTCGGCCGAACCTGCACGTTCAGACACAACATGTAAAAGAAATATTCAGGGAAGAAAGTGATGATGAAGATGATGATTT
TGATGAGTCTGAATATGCTTCCACTCGGGGATGTGGTTTTCTTCCTCGGTTTTGCTTAAAAGGTTCACTTGTTCTTTTAAATCCAGTACCTGGAATGAGAATGCAGGCCA
CTTCGGTGCGCCGAGTTCGTAATAGCTCAACTGGGAGTTCCAAGGATGCTGTAAATGAGAGACGATCAAGTCATGGTCAAGGAATAACTCGGCAGAAGCTCGAAGAAAAT
GCAAGTAATATTCAGGAGATCGATACATTTTCTTTATACAGACATTTGCAGGGCGAGGGCATATCCAAATACCCAAATGAACCTTCCCAGGCTGTTCATGGAAATGTAAA
TCCTTCTCTTGGTTACACTGTAAAGGCTACCAATTCTGTGACGAATGAATACGACGAACCACGTCGAAGAAGCCTAAATAGCTTTCATGCATTACTGGGTGACGAGTCGG
GTTCAGCTAGTCCTGTAGAGAAAACTCTGTACATAGACTCTGTTCATAAAATTACATCTCCTGACTCAAGTTCAAATTCTTTGGATAGGAAGGGGATAAGTTACAGTGGA
GATATGATTGATGATGCTTCGATAAAAAGCACCGAAATTAAAGAACTGTGTAAACTGGATTCTGAAACCCAAGATGTCAAGAATTTGAATGCTGTTGGCGAAAAAAGTAC
TCTGAGACCTGACAGTTTGAAATCTATGGATTCCTGCTTGCTCACTTGCTCGAATAGATCATTGTTTGAGGTAAAAGTAGATTCAAATTTCTCCAGGCTCAAACCAGAAC
ATACTCAGGATGCTGCGAAGTCGACAAGCTCACAATTCGCTAACAACAAGAAGTTTAATCTGGAAAACCAATTTCCCTTGAAGCCGAGCAGCCGAGGCGATGCGAATGGT
CTTGCTAAAGATAACACTACATTGGTATCCTCATATGGAATTCACAGTGAGAAGGTTAATTTAGACAGGAAACAACCTGAGAAGTCTGTTTATGGAAGGAATAACGTAAT
TGTTCCAGAATATGGCAAGAAGCACGAGTCGGATGGCGAAAAGCTTGCTCCTGTTGAGGACTCCCCTGGTCTGAGAACTTCAGAACGTGCTACCAATGGAGAGAAGGATT
CAAGAAATCAACTTCTCAAGAGGGTAGGTAATGAAGATGGTTCTCGCGGTGGCTACTCACAGTCGCGGCTGCGTTTTGCTCCGCCTCCGCCAAAATCTCCATCAGAATCT
TGGTTGAAACGTACTTTGCCAACTTCTTCAAGAAACACAGTTTTTTTGCAGTCTTCTTTTGCAATGCAGGTTAACCCTGTTTCCACGACCGCGTCTCCAGATCGAAAGAC
ACCGTCTACTGTTAAAAGCTCCGATACAAACCATCTGCATCTGCAGTTTTCAAAGACTCTCCGCCGGCAACCCTCTGGACTCACCGTCGCCCTCATGGCTCTGTCACTGC
CGTCGCCGGCTCCAGTTCGTGAAGACTGTTGGACTTTGTATGATACTTCCACTCTCATCGAGGCCTGGGGTGAACGTCATATCGACTTGAACCGCGGCAGCCTTAGGCTC
AAACACTGGCAAGAAATCGCCAACGCTGTCAACTCCAGTCACGGTCACGAGAGGAAGTCCATTCGTACGGCTATTCAGTGCAAAAACCGCATCGACACGCTTAAGAGAAA
GTATAAAATCGAGAAGGCGAGAATTCAAGAATCCGGTGGCTCGTACGACTGCGCTTGGCCGTTCTTCTCGTGCCTCGACGATCTTATAGGCAACAACCACAAGGCTTCCA
CGCCGGTCGCCGTATCTAATTGTAAAGCCGCGCCAGTAACGACTCCAAGGTTGTCGCTGTTCTCAAAGGTCCCCGTTGCCCCTCGATCCGGAACTAAGAAGCGTCGTTCA
ACTCACGTCTATAGGAGTTTTTGCGACTCGTACCTTCGCCGGGATGTAATCTCGAATGAAAATGAAGGAAAAAATAGTATGGAATCCGATAATTCGCTCTCGAGTTCGAG
GTTCAAGGACAGAGAGGCAGGTTATAGAAAACTGGCCGAGGCAATAGGGACGATCACTGATATATATGAGAGAGTGGAGGTAGCAAAACAGAGGCACATGATAGAATTGG
AGATGCAACGGATGCAATTCGTGAAGGATTTAGAGTATCAGAGAATGCAACTACTTATGGAGATGCAACTTCAGATTCAAAAGATCAAGCGCGCCAGGCGAGCATCTGGA
GCCGGTGAGTCCTTAGCTTCTAACTCCTTTTATTACTTTCTCTATTCTTTCCTGCTTTAG
mRNA sequenceShow/hide mRNA sequence
TTCTCTCTCTATCTGTCCCTTTCTCTTTATTATCGTCTTTTTTTGTTTTAAAAATTAATGCATGGTTTGGTTCAAGATTCGTGGACAAAATCTCTCTCTCTCTCTCTCTC
TCTCTCATTTCATTTTTTTTGCAAATCCAGCTTGTATTCTTTCACTGTCTGCAGGGAGTAGATCGAACTGGTTATCAGGTCGGTGCAAACTTAAAAGAGGCTTCAATATC
TTCATCTGTAATTACGGTACAACCACATGATTTCATTGAGGAATTTGATGGAAGAGAAACAGCTGGATTTGAATCAACCTCTTCTTTCTGTGAGGCGTTTTACATCCACA
GGAACATCATCCAAGACTAATGAAAAAGCCAGACCTGAACCCGAAATTCCTCGTCCTCCTGTATATAAATCAGAATTGAAATCTGGTCCAGTGAGGAATCCTGGAACTGT
TCCTTTTGTATGGGAGCAAACTCCAGGAAAACCCAAGGATGAAAGCACTGTGAAAACTCAGAGTTCCAAACGGCCTCCACTTGTCCCAAAACTCCCACCTGGAAGAGCCC
AAAAAATGAATCAAGAAGTTCCTACTTCTATAAATGCTTCATTAGATGAAAATGCGAGAGAAACAGTTGAAGAGCTGACCAGCTGTAAGTCAGGAAATGACGACGAAGAA
GAAGATGACGAGGTTTACAGAGATGCAAACGATATATTTTCTCGATCCGAATCGTTCTTCTTGAATTGTAGTATTAGTGGAGTGAGTGGATTGGATGATTCTGAAATTAA
ACCTTCAGGAGTCTCCTCCATGGATCCACATACTAGGAATTTCATGATGGACCGGTTTCTACCTGCAGCAAAGGCAATGGCATCAGAAACACCTCCACATACTATTAGAA
AACAAACTGTCGAGCGACCGAGGGAAATAAAATTGGTGACAAATGGGGATAGGCAGAGTCGGCCGAACCTGCACGTTCAGACACAACATGTAAAAGAAATATTCAGGGAA
GAAAGTGATGATGAAGATGATGATTTTGATGAGTCTGAATATGCTTCCACTCGGGGATGTGGTTTTCTTCCTCGGTTTTGCTTAAAAGGTTCACTTGTTCTTTTAAATCC
AGTACCTGGAATGAGAATGCAGGCCACTTCGGTGCGCCGAGTTCGTAATAGCTCAACTGGGAGTTCCAAGGATGCTGTAAATGAGAGACGATCAAGTCATGGTCAAGGAA
TAACTCGGCAGAAGCTCGAAGAAAATGCAAGTAATATTCAGGAGATCGATACATTTTCTTTATACAGACATTTGCAGGGCGAGGGCATATCCAAATACCCAAATGAACCT
TCCCAGGCTGTTCATGGAAATGTAAATCCTTCTCTTGGTTACACTGTAAAGGCTACCAATTCTGTGACGAATGAATACGACGAACCACGTCGAAGAAGCCTAAATAGCTT
TCATGCATTACTGGGTGACGAGTCGGGTTCAGCTAGTCCTGTAGAGAAAACTCTGTACATAGACTCTGTTCATAAAATTACATCTCCTGACTCAAGTTCAAATTCTTTGG
ATAGGAAGGGGATAAGTTACAGTGGAGATATGATTGATGATGCTTCGATAAAAAGCACCGAAATTAAAGAACTGTGTAAACTGGATTCTGAAACCCAAGATGTCAAGAAT
TTGAATGCTGTTGGCGAAAAAAGTACTCTGAGACCTGACAGTTTGAAATCTATGGATTCCTGCTTGCTCACTTGCTCGAATAGATCATTGTTTGAGGTAAAAGTAGATTC
AAATTTCTCCAGGCTCAAACCAGAACATACTCAGGATGCTGCGAAGTCGACAAGCTCACAATTCGCTAACAACAAGAAGTTTAATCTGGAAAACCAATTTCCCTTGAAGC
CGAGCAGCCGAGGCGATGCGAATGGTCTTGCTAAAGATAACACTACATTGGTATCCTCATATGGAATTCACAGTGAGAAGGTTAATTTAGACAGGAAACAACCTGAGAAG
TCTGTTTATGGAAGGAATAACGTAATTGTTCCAGAATATGGCAAGAAGCACGAGTCGGATGGCGAAAAGCTTGCTCCTGTTGAGGACTCCCCTGGTCTGAGAACTTCAGA
ACGTGCTACCAATGGAGAGAAGGATTCAAGAAATCAACTTCTCAAGAGGGTAGGTAATGAAGATGGTTCTCGCGGTGGCTACTCACAGTCGCGGCTGCGTTTTGCTCCGC
CTCCGCCAAAATCTCCATCAGAATCTTGGTTGAAACGTACTTTGCCAACTTCTTCAAGAAACACAGTTTTTTTGCAGTCTTCTTTTGCAATGCAGGTTAACCCTGTTTCC
ACGACCGCGTCTCCAGATCGAAAGACACCGTCTACTGTTAAAAGCTCCGATACAAACCATCTGCATCTGCAGTTTTCAAAGACTCTCCGCCGGCAACCCTCTGGACTCAC
CGTCGCCCTCATGGCTCTGTCACTGCCGTCGCCGGCTCCAGTTCGTGAAGACTGTTGGACTTTGTATGATACTTCCACTCTCATCGAGGCCTGGGGTGAACGTCATATCG
ACTTGAACCGCGGCAGCCTTAGGCTCAAACACTGGCAAGAAATCGCCAACGCTGTCAACTCCAGTCACGGTCACGAGAGGAAGTCCATTCGTACGGCTATTCAGTGCAAA
AACCGCATCGACACGCTTAAGAGAAAGTATAAAATCGAGAAGGCGAGAATTCAAGAATCCGGTGGCTCGTACGACTGCGCTTGGCCGTTCTTCTCGTGCCTCGACGATCT
TATAGGCAACAACCACAAGGCTTCCACGCCGGTCGCCGTATCTAATTGTAAAGCCGCGCCAGTAACGACTCCAAGGTTGTCGCTGTTCTCAAAGGTCCCCGTTGCCCCTC
GATCCGGAACTAAGAAGCGTCGTTCAACTCACGTCTATAGGAGTTTTTGCGACTCGTACCTTCGCCGGGATGTAATCTCGAATGAAAATGAAGGAAAAAATAGTATGGAA
TCCGATAATTCGCTCTCGAGTTCGAGGTTCAAGGACAGAGAGGCAGGTTATAGAAAACTGGCCGAGGCAATAGGGACGATCACTGATATATATGAGAGAGTGGAGGTAGC
AAAACAGAGGCACATGATAGAATTGGAGATGCAACGGATGCAATTCGTGAAGGATTTAGAGTATCAGAGAATGCAACTACTTATGGAGATGCAACTTCAGATTCAAAAGA
TCAAGCGCGCCAGGCGAGCATCTGGAGCCGGTGAGTCCTTAGCTTCTAACTCCTTTTATTACTTTCTCTATTCTTTCCTGCTTTAG
Protein sequenceShow/hide protein sequence
MISLRNLMEEKQLDLNQPLLSVRRFTSTGTSSKTNEKARPEPEIPRPPVYKSELKSGPVRNPGTVPFVWEQTPGKPKDESTVKTQSSKRPPLVPKLPPGRAQKMNQEVPT
SINASLDENARETVEELTSCKSGNDDEEEDDEVYRDANDIFSRSESFFLNCSISGVSGLDDSEIKPSGVSSMDPHTRNFMMDRFLPAAKAMASETPPHTIRKQTVERPRE
IKLVTNGDRQSRPNLHVQTQHVKEIFREESDDEDDDFDESEYASTRGCGFLPRFCLKGSLVLLNPVPGMRMQATSVRRVRNSSTGSSKDAVNERRSSHGQGITRQKLEEN
ASNIQEIDTFSLYRHLQGEGISKYPNEPSQAVHGNVNPSLGYTVKATNSVTNEYDEPRRRSLNSFHALLGDESGSASPVEKTLYIDSVHKITSPDSSSNSLDRKGISYSG
DMIDDASIKSTEIKELCKLDSETQDVKNLNAVGEKSTLRPDSLKSMDSCLLTCSNRSLFEVKVDSNFSRLKPEHTQDAAKSTSSQFANNKKFNLENQFPLKPSSRGDANG
LAKDNTTLVSSYGIHSEKVNLDRKQPEKSVYGRNNVIVPEYGKKHESDGEKLAPVEDSPGLRTSERATNGEKDSRNQLLKRVGNEDGSRGGYSQSRLRFAPPPPKSPSES
WLKRTLPTSSRNTVFLQSSFAMQVNPVSTTASPDRKTPSTVKSSDTNHLHLQFSKTLRRQPSGLTVALMALSLPSPAPVREDCWTLYDTSTLIEAWGERHIDLNRGSLRL
KHWQEIANAVNSSHGHERKSIRTAIQCKNRIDTLKRKYKIEKARIQESGGSYDCAWPFFSCLDDLIGNNHKASTPVAVSNCKAAPVTTPRLSLFSKVPVAPRSGTKKRRS
THVYRSFCDSYLRRDVISNENEGKNSMESDNSLSSSRFKDREAGYRKLAEAIGTITDIYERVEVAKQRHMIELEMQRMQFVKDLEYQRMQLLMEMQLQIQKIKRARRASG
AGESLASNSFYYFLYSFLL