; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025915 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025915
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionU-box domain-containing protein 4
Genome locationtig00153017:1273484..1293646
RNA-Seq ExpressionSgr025915
SyntenySgr025915
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR025602 - BCP1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579255.1 U-box domain-containing protein 12, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0078.02Show/hide
Query:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE
        L STR  +FA GA+G+ K R  S+ L+ LR+ + +     KRV+ RRVSSDGGGA+DSTQQQSAA DI+DV NDSSSVGHSYVALFVRMLGLDHDPLDRE
Subjt:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE

Query:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWSNR------------------RDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDE
        QAI+ALWKYSLGGKKHIDAIMQFPGCINLT    R  S                    RD+ +A S  I+         S   +VKEQSICVLWNLSVDE
Subjt:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWSNR------------------RDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDE

Query:  KLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT--------------DEYYRILVIEEGLVPVP
        +LR KIANTDILPLL KNLDDEDMKVKEAAGGV+ANL LS CNHG+IVE GLIPKLAY LKAEADSSKT              DEYYRILVIEEGLVPVP
Subjt:  KLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT--------------DEYYRILVIEEGLVPVP

Query:  IVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWI
        ++GAAAYKSF+PGLHSWPSLPDGT+IE+SSK PSRFGASELLLGLNVDNNANIEE KINAI+GR+QQQFLARIGAIE ED++++QSESSTSNHLTLLPW 
Subjt:  IVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWI

Query:  DGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPED
        DGVARLVL+LELEDDNA +R AE IADASINEHMR+SFKEAGAIKHLVKLLD+MNN VKWA+++ALERLSISNVVCQT+ENEGAL PLLSILKLS+IPE+
Subjt:  DGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPED

Query:  AMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLL
         MEKTLDILSRILDPSKEMKSKFYNGP NGS GGQHSERNS+AST KDVLDAVVVS L+EIL  SSPNLKRKAASILEFISIMDPSM+II+ +EIESGL 
Subjt:  AMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLL

Query:  AVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNM
        AVFQLGVSID+DAEDWPPER+ALEVEEAGL+ISAASRLLTKL+DSEKF +KIN+  FT+ LRRILK DIP+ HKDW+AACL+KVS+V+AP+ DSGDP++M
Subjt:  AVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNM

Query:  EVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIV
        EV LYETIPRLIQQ+KSS S EVQESTVVELN IVSEG+VDATRAVAS GGIFPLVKLIDEGSERA EAALA+LYNLSMD+ENHPAILAAGAVPALRRIV
Subjt:  EVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIV

Query:  LSQRAQWRQALHLLRTLPT
        LSQR QW+QAL+LLRTLPT
Subjt:  LSQRAQWRQALHLLRTLPT

KAG7016771.1 hypothetical protein SDJN02_21881 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0076.04Show/hide
Query:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE
        L STR  +FA GA+G+ K R  S+ L+ LR+ + +     KRV+ RRVSSDGGGA+DSTQQQSAA DI+DV NDSSSVGHSYVALFVRMLGLDHDPLDRE
Subjt:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE

Query:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWSNR------------------RDN----------------------------WLAWSTVID--
        QAI+ALWKYSLGGKKHIDAIMQFPGCINLT    R  S                    RD+                            ++ W  VI+  
Subjt:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWSNR------------------RDN----------------------------WLAWSTVID--

Query:  ---SRAVQVKEQSICVLWNLSVDEKLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT-------
           + AVQVKEQSICVLWNLSVDE+LR KIANTDILPLL KNLDDEDMKVKEAAGGV+ANL LS CNHG+IVE GLIPKLAY LKAEADSSKT       
Subjt:  ---SRAVQVKEQSICVLWNLSVDEKLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT-------

Query:  -------DEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAI
               DEYYRILVIEEGLVPVP++GAAAYKSF+PGLHSWPSLPDGT+IE+SSK PSRFGASELLLGLNVDNNANIEE KINAI+GR+QQQFLARIGAI
Subjt:  -------DEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAI

Query:  EFEDMRNSQSESSTSNHLTLLPWIDGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVC
        E ED++++QSESSTSNHLTLLPW DGVARLVL+LELEDDNA +R AE IADASINEHMR+SFKEAGAIKHLVKLLD+MN+ VKWA+++ALERLSISNVVC
Subjt:  EFEDMRNSQSESSTSNHLTLLPWIDGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVC

Query:  QTLENEGALGPLLSILKLSNIPEDAMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASI
        QT+ENEGAL PLLSILKLS+IPE+ MEKTLDILSRILDPSKEMKSKFYNGP NGS GGQHSERNS+AST KDVLDAVVVS L+EIL  SSPNLKRKAASI
Subjt:  QTLENEGALGPLLSILKLSNIPEDAMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASI

Query:  LEFISIMDPSMEIINSVEIESGLLAVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDW
        LEFISIMDPSM+II+ +EIESGL AVFQLGVSID+DAEDWPPER+ALEVEEAGL+ISAASRLLTKL+DSEKF +KIN+  FT+ LRRILK DIP+ HKDW
Subjt:  LEFISIMDPSMEIINSVEIESGLLAVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDW

Query:  VAACLVKVSTVSAPNRDSGDPVNMEVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYN
        +AACL+KVS+V+AP+ DSGDP++MEV LYETIPRLIQQ+KSS SMEVQESTVVELN IVSEG+VDATRAVAS GGIFPLVKLIDEGSERA EAALA+LYN
Subjt:  VAACLVKVSTVSAPNRDSGDPVNMEVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYN

Query:  LSMDSENHPAILAAGAVPALRRIVLSQRAQWRQALHLLRTLPT
        LSMD+ENHPAILAAGAVPALRRIVLSQR QW+QAL+LLRTLPT
Subjt:  LSMDSENHPAILAAGAVPALRRIVLSQRAQWRQALHLLRTLPT

XP_022146414.1 uncharacterized protein LOC111015638 [Momordica charantia]0.0e+0082.4Show/hide
Query:  TLLSTRKTVFAFGAVGTKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE
        + LSTR  VF   A GTK R  SVFLYHL Y  RYPESLS R +RR VSSDGGGAVDST QQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE
Subjt:  TLLSTRKTVFAFGAVGTKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE

Query:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEK
        QA+IALWKYSLGGKKHIDAIM+FPGCINLT    +  S                 N   + +A S  I+         S A +VKEQSICVLWNLSVDEK
Subjt:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEK

Query:  LRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVPVPI
        LR+KIA+TDIL LLSKNLDDEDMKVKEAAGGVLANLALS CNHG+IVE GLIPKLAY LKAEADSSK               DEYYRILVIEEGLVPVPI
Subjt:  LRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVPVPI

Query:  VGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWID
        VGAAAYKSFRPGLHSWPSLPDGT+IEQSS GPSRFGASELLLGLNVDN ANIEE KINAIVGRTQQQFLARIGAIEFED+++SQ ESST NHLTLLPWID
Subjt:  VGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWID

Query:  GVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDA
        GVARLVLILELEDD A++RAAESIADASINEHMRISFKEAGAIKHLVK+LDNMN+ VKWATIRALERLSISNVVCQT+ENEGALGPLLSIL+LSNIPE+ 
Subjt:  GVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDA

Query:  MEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLA
        MEKTLDIL RILDPSKEMKSKFYNGP NGSHGGQHSE  S+AST KDVLDAV VS L+EILN SSPNLKRKAASILEF+SIMDPSMEII+S+ IESGLLA
Subjt:  MEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLA

Query:  VFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNME
        VFQLGVSIDSDAEDW PERYALEVEEAGL+ISAASRLLTKLLDSEKF +KINS HFTKLLR+ILKSDIPIHHKDW+AACLVKVS++SA N DSG+P+NME
Subjt:  VFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNME

Query:  VTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVL
        VTLYETIPRLIQQIK SFSME QES+VVELN I+SEGIVDATRAVAS GGIFPLVKLIDEGSERA EAALAILYNLSMDSENHPAILAAGAVPALRRIVL
Subjt:  VTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVL

Query:  SQRAQWRQALHLLRTLPT
        SQRA+WR+ALHLLRTLPT
Subjt:  SQRAQWRQALHLLRTLPT

XP_022939600.1 uncharacterized protein LOC111445446 [Cucurbita moschata]0.0e+0078.24Show/hide
Query:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE
        L STR  +FA GA GT K R  S+ L+ LR+ + +     KRV+ RRVSSDGGGA+DSTQQQSAA DI+DV NDSSSVGHSYVALFVRMLGLDHDPLDRE
Subjt:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE

Query:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEK
        QAI+ALWKYSLGGKKHIDAIMQFPGCINLT    R  S                 N   + +A S  I+         S   +VKEQSICVLWNLSVDEK
Subjt:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEK

Query:  LRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT--------------DEYYRILVIEEGLVPVPI
        LR KIANTDILPLL KNLDDEDMKVKEAAGGV+ANL LS CNHG+IVE GLIPKLAY LKAEADSSKT              DEYYRILVIEEGLVPVP+
Subjt:  LRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT--------------DEYYRILVIEEGLVPVPI

Query:  VGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWID
        +GAAAYKSF+PGLHSWPSLPDGT+IE+SSK PSRFGASELLLGLNVDNN NI+E KINAIVGR+QQQFLARIGAIE ED++++QSESSTSNHLTLLPW D
Subjt:  VGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWID

Query:  GVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDA
        GVARLVL+LELEDDNA +R AE IADASINEHMR+SFKEAGAIKHLVKLLD+MNN VKWA+I+ALERLSISNVVCQT+ENEGAL PLLSILKLS+IPE+ 
Subjt:  GVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDA

Query:  MEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLA
        MEKTLDILSRILDPSKEMKSKFY+GP NGS GGQHSERNS+AST KDVLDAVVVS L+EIL  SSPNLKRKAASILEFISIMDPSM+II+ +EIESGL A
Subjt:  MEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLA

Query:  VFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNME
        VFQLGVSID+DAEDWPPER+ALEVEEAGL+ISAASRLLTKL+DSEKF +KIN+  FT+ LRRILK DIPI HKDW+AACL+KVS+V+AP+ DSGDP++ME
Subjt:  VFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNME

Query:  VTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVL
        V LYETIPRLIQQ+KSS SMEVQES VVELN IVSEG+VDATRAVAS GGIFPLVKLIDEGSERA EAALA+LYNLSMD+ENHPAILAAGAVPALRRIVL
Subjt:  VTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVL

Query:  SQRAQWRQALHLLRTLPT
        SQR QW+QAL+LLRTLPT
Subjt:  SQRAQWRQALHLLRTLPT

XP_038874402.1 uncharacterized protein LOC120067080 [Benincasa hispida]0.0e+0081.12Show/hide
Query:  KTLLSTRKTVF-AFGAVG-TKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPL
        +TL  T   +F   GA+G  K    S FLYHLR+ S   +SLSKRV+ RRVSSDGGGAVDSTQQQSAAP I DVQNDSSSVGHSYVALFVRMLGLDHDPL
Subjt:  KTLLSTRKTVF-AFGAVG-TKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPL

Query:  DREQAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSV
        DREQAIIALWKYSLGGKKHIDAIMQFPGCINLT    R  S                 N   + +A S  I+         S   +VKEQSICVLWNLSV
Subjt:  DREQAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSV

Query:  DEKLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVP
        DEKLR KIAN DILPLLSKNLDDEDMKVKEAAGGVLANLALS CN G+IVE GLIPKLAY LKAEADSSK               DEYYRILVIEEGLVP
Subjt:  DEKLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVP

Query:  VPIVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLP
        VPI+GAAAYKSFRPGLHSWPSLPDGT+IEQSSK PSRFGASELLLGLNVDNNA IEE KINAIVGRTQQQFLARIGAIEFED+++SQSESSTSNHLTLLP
Subjt:  VPIVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLP

Query:  WIDGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIP
        WIDGVARLVLILELEDDNAI+RAAESIADASINEHMRISFKEAGAIKHLVK LD MNN VKWA ++ALERLSISNVVCQT+EN+GALGPLLSILKLS+I 
Subjt:  WIDGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIP

Query:  EDAMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESG
        E+ MEKTLDILSRILDPSKEMKSKFY+GP NGS GGQHSERN +AST KDVLD  VVSRL+EI N SSPNLKRKAASILEF+SIMDPSMEII+S+E+ESG
Subjt:  EDAMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESG

Query:  LLAVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPV
        L AVFQLGVSIDSDAEDW PERYALEVEEAGL+ISAASRLLTKLLDSEKF +KINS HFTKLLRRILKSDIPI+HKDW+AACL KVS+VSA N DSGDP+
Subjt:  LLAVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPV

Query:  NMEVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRR
        NMEVTLYETIPRLIQQIKSSFSMEVQES VVELN IVS GIVDATRAVAS GGIFPLVKLIDEGSERA EAALAILYNLSMD+ENHPAI+AAGAVPALR+
Subjt:  NMEVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRR

Query:  IVLSQRAQWRQALHLLRTLPT
        IVLSQR QWRQAL+LLRTLPT
Subjt:  IVLSQRAQWRQALHLLRTLPT

TrEMBL top hitse value%identityAlignment
A0A1S3CU26 uncharacterized protein LOC1035045530.0e+0079.11Show/hide
Query:  GAVG--TKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSD-GGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKY
        GA+G   K    S+FLY LR+ S   +SLSKR+I  RVSSD GGGAVDS+Q QSA P IKDVQNDSSS+G SYVALFVRML LD+DPLDREQAIIALWKY
Subjt:  GAVG--TKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSD-GGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKY

Query:  SLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEKLRMKIANTD
        SLGGKKHIDAIMQFPGCINL     R  S                 N     +A S  I+         S   +VKEQSICVLWNLSVDEKLR+KIANTD
Subjt:  SLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEKLRMKIANTD

Query:  ILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVPVPIVGAAAYKSF
        ILPLLSKNLDDEDMKVKEAAGGVLANLALS CNHG+IVE GLIPKLAY LKAEADSSK               DEYYRILVIEEGLVPVPI+GAAAYKSF
Subjt:  ILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVPVPIVGAAAYKSF

Query:  RPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWIDGVARLVLIL
        RPGLHSWPSLPDGT+IEQSS  PSRFGASELLLGLNVDNNANIEE KINAIVGRTQQQFLARIGAIE E+M++SQSESS+SNHLTLLPWIDGVARLVLIL
Subjt:  RPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWIDGVARLVLIL

Query:  ELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDAMEKTLDILS
        ELEDDNA++RAAESI DASINEHMRISFKEAGAIKHLV  LD MN+ VKWA ++ALERLSISNVVCQ +ENEGALGPLLSILKLS+IPE+ MEKTL+ILS
Subjt:  ELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDAMEKTLDILS

Query:  RILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLAVFQLGVSID
        RILDPSKEMKSKFY+GP NGS G QHSE N +AS  KD LDA VVSRL+EILN SSPNLKRKAASILEF+SIMDPSME+I+ VEIE GL AVFQLGVSID
Subjt:  RILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLAVFQLGVSID

Query:  SDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNMEVTLYETIPR
        SDAE W PERYALEVEEAGL+ISAASRLLTKLLDSEKF +KINS  FTKLLR+ILKSDIPI+HKDW+AACL+KVS++S  N DSGDP+NMEVTLYETIPR
Subjt:  SDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNMEVTLYETIPR

Query:  LIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAQWRQA
        LI+QIKSSFS+EVQES VVELN IVSEG+VDATRAVA  GGIFPLVKLIDEGSERA EAALAILYNLSMDSENHPAI+AAGAVPALRRIVLSQR QWRQA
Subjt:  LIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAQWRQA

Query:  LHLLRTLPT
        L+LLRTLPT
Subjt:  LHLLRTLPT

A0A5D3E3M4 U-box domain-containing protein 40.0e+0080.3Show/hide
Query:  GAVG--TKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSD-GGGAVDSTQQQSAAP-------------DIKDVQNDSSSVGHSYVALFVRMLGLDHDP
        GA+G   K    S+FLY LR+ S   +SLSKR+I  RVSSD GGGAVDS+Q QSA P              IKDVQNDSSS+G SYVALFVRMLGLD+DP
Subjt:  GAVG--TKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSD-GGGAVDSTQQQSAAP-------------DIKDVQNDSSSVGHSYVALFVRMLGLDHDP

Query:  LDREQAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-NRRDNWLAWSTVIDSRAVQVKEQSICVLWNLSVDEKLRMKIANTDILPLLSKNLDDE
        LDREQAIIALWKYSLGGKKHIDAIMQFPGCINL     R  S + R+   A + ++ S ++ VKEQSICVLWNLSVDEKLR+KIANTDILPLLSKNLDDE
Subjt:  LDREQAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-NRRDNWLAWSTVIDSRAVQVKEQSICVLWNLSVDEKLRMKIANTDILPLLSKNLDDE

Query:  DMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPD
        DMKVKEAAGGVLANLALS CNHG+IVE GLIPKLAY LKAEADSSK               DEYYRILVIEEGLVPVPI+G AAYKSFRPGLHSWPSLPD
Subjt:  DMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPD

Query:  GTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWIDGVARLVLILELEDDNAILRAA
        GT+IEQSSK PSRFGASELLLGLNVDNNANIE+ KINAIVGRTQQQFLARIGAIE EDM++SQSESS+SNHLTLLPWIDGVARLVLILELEDDNA++RAA
Subjt:  GTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWIDGVARLVLILELEDDNAILRAA

Query:  ESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDAMEKTLDILSRILDPSKEMKSK
        ESI DASINEHMRISFKEAGAIKHLVK LD MN+ VKWA ++ALERLSISNVVCQ +ENEGALGPLLSILKLS+IPE+ MEKTL+ILSR LDPSKEMKSK
Subjt:  ESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDAMEKTLDILSRILDPSKEMKSK

Query:  FYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLAVFQLGVSIDSDAEDWPPERYA
        FY+GP NGS G QHSE N +AS  KD LDA VVSRL+EILN SSPNLKRKAASILEF+SIMDPSME+I+ VEIE GL AVFQLGVSIDSDAE W PERYA
Subjt:  FYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLAVFQLGVSIDSDAEDWPPERYA

Query:  LEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNMEVTLYETIPRLIQQIKSSFSME
        LEVEEAGL+ISAASRLLTKLLDSEKF +KINS HFTKLLRRILKSDIPI+HKDW+AACL++VS++S  N DSGDP+NMEVTLYETIPRLI+QIKSSFS+E
Subjt:  LEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNMEVTLYETIPRLIQQIKSSFSME

Query:  VQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAQWRQALHLLRTLPT
        VQES VVELN IVSEG+VDATRAVA  GGIFPLVKLIDEGSERA EAALAILYNLSMDSENHPAI+AAGAVPALRRIVLSQR QWRQAL+LLRTLPT
Subjt:  VQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAQWRQALHLLRTLPT

A0A6J1CY29 uncharacterized protein LOC1110156380.0e+0082.4Show/hide
Query:  TLLSTRKTVFAFGAVGTKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE
        + LSTR  VF   A GTK R  SVFLYHL Y  RYPESLS R +RR VSSDGGGAVDST QQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE
Subjt:  TLLSTRKTVFAFGAVGTKARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE

Query:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEK
        QA+IALWKYSLGGKKHIDAIM+FPGCINLT    +  S                 N   + +A S  I+         S A +VKEQSICVLWNLSVDEK
Subjt:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEK

Query:  LRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVPVPI
        LR+KIA+TDIL LLSKNLDDEDMKVKEAAGGVLANLALS CNHG+IVE GLIPKLAY LKAEADSSK               DEYYRILVIEEGLVPVPI
Subjt:  LRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSK--------------TDEYYRILVIEEGLVPVPI

Query:  VGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWID
        VGAAAYKSFRPGLHSWPSLPDGT+IEQSS GPSRFGASELLLGLNVDN ANIEE KINAIVGRTQQQFLARIGAIEFED+++SQ ESST NHLTLLPWID
Subjt:  VGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWID

Query:  GVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDA
        GVARLVLILELEDD A++RAAESIADASINEHMRISFKEAGAIKHLVK+LDNMN+ VKWATIRALERLSISNVVCQT+ENEGALGPLLSIL+LSNIPE+ 
Subjt:  GVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDA

Query:  MEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLA
        MEKTLDIL RILDPSKEMKSKFYNGP NGSHGGQHSE  S+AST KDVLDAV VS L+EILN SSPNLKRKAASILEF+SIMDPSMEII+S+ IESGLLA
Subjt:  MEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLA

Query:  VFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNME
        VFQLGVSIDSDAEDW PERYALEVEEAGL+ISAASRLLTKLLDSEKF +KINS HFTKLLR+ILKSDIPIHHKDW+AACLVKVS++SA N DSG+P+NME
Subjt:  VFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNME

Query:  VTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVL
        VTLYETIPRLIQQIK SFSME QES+VVELN I+SEGIVDATRAVAS GGIFPLVKLIDEGSERA EAALAILYNLSMDSENHPAILAAGAVPALRRIVL
Subjt:  VTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVL

Query:  SQRAQWRQALHLLRTLPT
        SQRA+WR+ALHLLRTLPT
Subjt:  SQRAQWRQALHLLRTLPT

A0A6J1FN67 uncharacterized protein LOC1114454460.0e+0078.24Show/hide
Query:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE
        L STR  +FA GA GT K R  S+ L+ LR+ + +     KRV+ RRVSSDGGGA+DSTQQQSAA DI+DV NDSSSVGHSYVALFVRMLGLDHDPLDRE
Subjt:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE

Query:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEK
        QAI+ALWKYSLGGKKHIDAIMQFPGCINLT    R  S                 N   + +A S  I+         S   +VKEQSICVLWNLSVDEK
Subjt:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWS-----------------NRRDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDEK

Query:  LRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT--------------DEYYRILVIEEGLVPVPI
        LR KIANTDILPLL KNLDDEDMKVKEAAGGV+ANL LS CNHG+IVE GLIPKLAY LKAEADSSKT              DEYYRILVIEEGLVPVP+
Subjt:  LRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT--------------DEYYRILVIEEGLVPVPI

Query:  VGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWID
        +GAAAYKSF+PGLHSWPSLPDGT+IE+SSK PSRFGASELLLGLNVDNN NI+E KINAIVGR+QQQFLARIGAIE ED++++QSESSTSNHLTLLPW D
Subjt:  VGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWID

Query:  GVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDA
        GVARLVL+LELEDDNA +R AE IADASINEHMR+SFKEAGAIKHLVKLLD+MNN VKWA+I+ALERLSISNVVCQT+ENEGAL PLLSILKLS+IPE+ 
Subjt:  GVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDA

Query:  MEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLA
        MEKTLDILSRILDPSKEMKSKFY+GP NGS GGQHSERNS+AST KDVLDAVVVS L+EIL  SSPNLKRKAASILEFISIMDPSM+II+ +EIESGL A
Subjt:  MEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLA

Query:  VFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNME
        VFQLGVSID+DAEDWPPER+ALEVEEAGL+ISAASRLLTKL+DSEKF +KIN+  FT+ LRRILK DIPI HKDW+AACL+KVS+V+AP+ DSGDP++ME
Subjt:  VFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNME

Query:  VTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVL
        V LYETIPRLIQQ+KSS SMEVQES VVELN IVSEG+VDATRAVAS GGIFPLVKLIDEGSERA EAALA+LYNLSMD+ENHPAILAAGAVPALRRIVL
Subjt:  VTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVL

Query:  SQRAQWRQALHLLRTLPT
        SQR QW+QAL+LLRTLPT
Subjt:  SQRAQWRQALHLLRTLPT

A0A6J1IEY0 uncharacterized protein LOC1114721020.0e+0077.53Show/hide
Query:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE
        L STR  +FA GA+GT K R  S+ L+ LR+ + +     KRV+R RVSSDGGGA+DSTQQQSA  DI+DV NDSSSVGHSYVALF+RMLGLDHDPLDRE
Subjt:  LLSTRKTVFAFGAVGT-KARGRSVFLYHLRYCSRYPESLSKRVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDRE

Query:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWSNR------------------RDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDE
        QAI+ALWKYSLGGKKHIDAIMQFPGC+NLT    R  S                    RD+ +A S  I+         S   +VKEQSICVLWNLSVDE
Subjt:  QAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWSNR------------------RDNWLAWSTVID---------SRAVQVKEQSICVLWNLSVDE

Query:  KLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT--------------DEYYRILVIEEGLVPVP
        KLR KIANTDILPLL KNLDDEDMKVKEAAGGV+ANL LS CNHG+IVE GLIPKLAY LKAEADSSKT              DEYYRILVIEEGLVPVP
Subjt:  KLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKT--------------DEYYRILVIEEGLVPVP

Query:  IVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWI
        ++GAAAYKSF+PGLHSWPSLPDGT+IE+SSK PSRFGASELLLGLNVDNNANIEE KINAI+GR+QQ FLARIGAIE ED++++QSESSTSNHLTLLPW 
Subjt:  IVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWI

Query:  DGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPED
        DGVARLVL+LELEDDNA +R AE IADASINEHMR+SFKEAGAIKHLVKLLD+MNN VKWA+I+ALERLSISNVVCQT+ENEGAL PLLSILKLS+IPE+
Subjt:  DGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPED

Query:  AMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLL
         MEKTLDILSRILDPSKEMKSKFY+GP NGS GGQHSERNS+AST KDVLDAVVVS L+EIL  SSPNLKRKAASILEFISIMDPSM+II+ +EIESGL 
Subjt:  AMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLL

Query:  AVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNM
        AVFQLGVSID+DAEDWPPER+ALEVEEAGL+ISAASRLLTKL+DSEKF +KIN+  FT+ LR+ILK DIP+ HKDW+AACL+KVS+V+A + DSGDP++M
Subjt:  AVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNM

Query:  EVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIV
        EV LYETIPRLIQ++KSS SMEVQES VVELN IVSEG+VDATRAVAS GGIFPLVKLIDEGSERA EAALAILYNLSMD+ENHPAILAAGAVPALRRIV
Subjt:  EVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIV

Query:  LSQRAQWRQALHLLRTLPT
        LSQR QW+QAL+LLRTLPT
Subjt:  LSQRAQWRQALHLLRTLPT

SwissProt top hitse value%identityAlignment
O64885 Protein BCCIP homolog6.3e-9258.15Show/hide
Query:  MPRRPTKGRRLLGTRPLTFSSFARAL------------------------------VWNSPEAQNESSDSPSSDEE--FEEVQADFAFFDPKPDDFHGVK
        MPRRP+ GRR+L  +PLTFS F R L                              + N P    E   S SSDEE   E+VQADF FFDPKP DFHGVK
Subjt:  MPRRPTKGRRLLGTRPLTFSSFARAL------------------------------VWNSPEAQNESSDSPSSDEE--FEEVQADFAFFDPKPDDFHGVK

Query:  ILLQNYLDDIQWDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLVKVCQEKDVKNDLKLLLEEQASGVGLLVSQRIVN
        ILLQNYLDD +WDLS F D IL QTTVGTVVKV  DED  VF+ +TALN+ R KD++CF ELKE+L KVC EK++ N+L++LLE++A  VGLLVSQR++N
Subjt:  ILLQNYLDDIQWDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLVKVCQEKDVKNDLKLLLEEQASGVGLLVSQRIVN

Query:  LPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIYKLKNTNHKK-RPATETDESIIYVKAEDEIFHKLCSWSFCFPLHIQPQTTNELKGY
        LPPQLLPPLYD LFDE+SWAIEDEPT++L+ SFRFK Y+LV+KIYKLKN   +K R   E  E  +++K EDE+F +L SWSF FP+  Q  T+ E+K Y
Subjt:  LPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIYKLKNTNHKK-RPATETDESIIYVKAEDEIFHKLCSWSFCFPLHIQPQTTNELKGY

Query:  QLMGMVMAVEASK
        QLMG+VMAVEA+K
Subjt:  QLMGMVMAVEASK

Q2U600 Protein bcp13.4e-2935.94Show/hide
Query:  DSPSSDEEFEEVQADFAFFDPKPDDFHGVKILLQNYLD-DIQ-WDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLV-
        D   SDE+ + V  DF +FDP+P DFHG+KILL+   D D Q +D+S  SD+IL Q  +G+ VKV+G+E +  ++F+T LNL  +KD     +L  YL  
Subjt:  DSPSSDEEFEEVQADFAFFDPKPDDFHGVKILLQNYLD-DIQ-WDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLV-

Query:  KVCQEKDVKNDLKLLLEEQASGVGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIYK-------LKNTNHKKRPATET
        K     D+    +LL +     +GL++++R++N+P +++PP+Y  L +EI+WAI+D      ++ ++F  Y++VSK Y+       ++++  +K+     
Subjt:  KVCQEKDVKNDLKLLLEEQASGVGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIYK-------LKNTNHKKRPATET

Query:  DE-SIIYVKAEDEIFHK
        D+    Y   EDEI  K
Subjt:  DE-SIIYVKAEDEIFHK

Q4HZK7 Protein BCP12.3e-2533.93Show/hide
Query:  PEAQNESSDSPSSDEEFEEVQADFAFFDPKPD-DFHGVKILLQNY--LDDIQWDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCF
        P   N   +  S DE+F+ V  DF +F+  P+ DFHG K LL+    +D   +++S  +DL+L Q T+G+ +KV+G + N  ++ +T LN   ++D    
Subjt:  PEAQNESSDSPSSDEEFEEVQADFAFFDPKPD-DFHGVKILLQNY--LDDIQWDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCF

Query:  LELKEYLVKVCQEKDVKNDLKLLLEEQASG--VGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIY-----KLKNTNH
         ++ +YLV+  Q     + L  + +  +SG  VGL+ S+R++N+P +L PPLY  L DE+  A+ED      ++ + F  Y+++SK Y     KL   N 
Subjt:  LELKEYLVKVCQEKDVKNDLKLLLEEQASG--VGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIY-----KLKNTNH

Query:  KKRPATETDESIIYVKAEDEIFHK
        K++ A E +  + Y   EDE+ HK
Subjt:  KKRPATETDESIIYVKAEDEIFHK

Q4WVS2 Protein bcp16.5e-2832.66Show/hide
Query:  DSPSSDEEFEEVQADFAFFDPKPD-DFHGVKILLQNYLD-DIQ-WDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLV
        +   SDE  + V  DF +FDP+P  DFHG+K LL+   D D Q +DLS  +DLIL Q  +G+ VKV+G+E +  ++F+T LNL  +KD     +L  YL 
Subjt:  DSPSSDEEFEEVQADFAFFDPKPD-DFHGVKILLQNYLD-DIQ-WDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLV

Query:  KVCQEKDVKNDL-KLLLEEQASGVGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIYK--------LKNTNHKKRPAT
        +          L +LL +     +GL++++R++N+P +++PP+Y  L +EI+WAIED      ++ + F  Y++VSK Y+         ++   KK+  +
Subjt:  KVCQEKDVKNDL-KLLLEEQASGVGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIYK--------LKNTNHKKRPAT

Query:  ETDESIIYVKAEDEIF--HKLCSWSFCFPLHIQPQTTNELKGYQLMGM
         ++    +   EDE+F  H +CS S  +        ++  + +Q +G+
Subjt:  ETDESIIYVKAEDEIF--HKLCSWSFCFPLHIQPQTTNELKGYQLMGM

Q5AXW5 Protein bcp16.7e-2531.02Show/hide
Query:  SSDEEFEEVQADFAFFDPKPDDFHGVKILLQNYLD-DIQ-WDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLVKVCQ
        SSDE+ + V  DF +FDP+  DFHG+K L++   D D Q  DLSG +D+IL Q  +G+ VK +G +D+  ++F+T LNL  + D      L  Y+ +   
Subjt:  SSDEEFEEVQADFAFFDPKPDDFHGVKILLQNYLD-DIQ-WDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLVKVCQ

Query:  EKDVKNDL-KLLLEEQASGVGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIY-----KL----KNTNHKKRPATETD
             + L +L  +     +GL++++R++N+P +++PP+Y  L +EI+WAIE+      ++ + F  Y++VSK Y     KL         KK+ A +  
Subjt:  EKDVKNDL-KLLLEEQASGVGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIY-----KL----KNTNHKKRPATETD

Query:  ESIIYVKAEDEIF--HKLCSWSFCFPLHIQPQTTNELKGYQLMGM
            +   EDE+   H LC   + +        ++  + +Q +G+
Subjt:  ESIIYVKAEDEIF--HKLCSWSFCFPLHIQPQTTNELKGYQLMGM

Arabidopsis top hitse value%identityAlignment
AT1G23180.1 ARM repeat superfamily protein5.7e-21355.16Show/hide
Query:  TQQQSAAPDIKDVQNDSSS-VGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQFPGCINL--------TRFSCR--------------
        T   +   D ++V+++SSS VG SYV LFV MLGLD+DPLDREQAI  LWKYSLGGKK IDAIMQF GC+NL        +  +C               
Subjt:  TQQQSAAPDIKDVQNDSSS-VGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQFPGCINL--------TRFSCR--------------

Query:  -KWSNRRDNWLAWSTVIDSR---AVQVKEQSICVLWNLSVDEKLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAY
         + S      L   T + SR   A  VKEQ IC LWNL+VDE++R K+A+ DIL LL   L+D+D+ VKEAAGGVLANLALS+  H ++VE+G+IPKLA 
Subjt:  -KWSNRRDNWLAWSTVIDSR---AVQVKEQSICVLWNLSVDEKLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAY

Query:  LLKAEADSSK-----------------TDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNA-NIE
        LLKA+   +K                  DEYYRILVIEEG+VP+PI+GA AYKSFRP L+SWPSLPDG +IEQ++K PSRFGASELLLGLNVD N  +++
Subjt:  LLKAEADSSK-----------------TDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNA-NIE

Query:  EDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWIDGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNM
        E K+ AIVGRT QQFLARIGAIEFE    S+     S  LTLLP +DGVARLVLIL L D+ A  RAAESIADASINE MR+SF EAGA+K LV+LL N 
Subjt:  EDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWIDGVARLVLILELEDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNM

Query:  N-NVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDAMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAV
        N   VK   IRAL+ LS+S  VCQ +E EGA+  L+++LK   I  +  E  LDI++ ILDPSKEM+SKFY GP NG          S+A + K+VLDA 
Subjt:  N-NVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDAMEKTLDILSRILDPSKEMKSKFYNGPANGSHGGQHSERNSQASTSKDVLDAV

Query:  VVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLAVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKIN
        V SRL++I   +SPNL R A S++EF  I +P+M+ I S +I + L    +  V  + + E    E++ L++EEAGL+ISAASRLLTKLLDSE F+  I+
Subjt:  VVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLAVFQLGVSIDSDAEDWPPERYALEVEEAGLSISAASRLLTKLLDSEKFQHKIN

Query:  SDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNMEVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIF
        +  F +L+R+IL+S +P+H+KDWVAACLVK++ +S+P++   +P+N+EVTLY+TIP L++Q+  S S E +E+ V+ELN IVSEG+ ++ + +AS GGI 
Subjt:  SDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNMEVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDATRAVASNGGIF

Query:  PLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAQWRQALHLLRTLP
        PLVKL++E +ER  EA+L++LYNL+MDSENH AI+ AGAVP LRRIV+SQR QW +AL LLR LP
Subjt:  PLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAQWRQALHLLRTLP

AT1G70760.1 inorganic carbon transport protein-related9.3e-0666.67Show/hide
Query:  MKYPWSTPQDPSQIKNAYLKFPFAKPEDYD
        MK PW  P DP  IKN YLK+P+A PEDYD
Subjt:  MKYPWSTPQDPSQIKNAYLKFPFAKPEDYD

AT1G77460.1 Armadillo/beta-catenin-like repeat ; C2 calcium/lipid-binding domain (CaLB) protein6.0e-0538.96Show/hide
Query:  SMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSEN-HPAILAAGAVPA
        S E  +   VE+  I++  + D+  AV + GGI PLV+L++ GS++A E A  IL+NL   SE     +  AG +PA
Subjt:  SMEVQESTVVELNGIVSEGIVDATRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSEN-HPAILAAGAVPA

AT2G44510.1 CDK inhibitor P21 binding protein4.5e-9358.15Show/hide
Query:  MPRRPTKGRRLLGTRPLTFSSFARAL------------------------------VWNSPEAQNESSDSPSSDEE--FEEVQADFAFFDPKPDDFHGVK
        MPRRP+ GRR+L  +PLTFS F R L                              + N P    E   S SSDEE   E+VQADF FFDPKP DFHGVK
Subjt:  MPRRPTKGRRLLGTRPLTFSSFARAL------------------------------VWNSPEAQNESSDSPSSDEE--FEEVQADFAFFDPKPDDFHGVK

Query:  ILLQNYLDDIQWDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLVKVCQEKDVKNDLKLLLEEQASGVGLLVSQRIVN
        ILLQNYLDD +WDLS F D IL QTTVGTVVKV  DED  VF+ +TALN+ R KD++CF ELKE+L KVC EK++ N+L++LLE++A  VGLLVSQR++N
Subjt:  ILLQNYLDDIQWDLSGFSDLILGQTTVGTVVKVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLVKVCQEKDVKNDLKLLLEEQASGVGLLVSQRIVN

Query:  LPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIYKLKNTNHKK-RPATETDESIIYVKAEDEIFHKLCSWSFCFPLHIQPQTTNELKGY
        LPPQLLPPLYD LFDE+SWAIEDEPT++L+ SFRFK Y+LV+KIYKLKN   +K R   E  E  +++K EDE+F +L SWSF FP+  Q  T+ E+K Y
Subjt:  LPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSKIYKLKNTNHKK-RPATETDESIIYVKAEDEIFHKLCSWSFCFPLHIQPQTTNELKGY

Query:  QLMGMVMAVEASK
        QLMG+VMAVEA+K
Subjt:  QLMGMVMAVEASK

AT5G03830.1 CDK inhibitor P21 binding protein5.4e-5454.17Show/hide
Query:  MPRRPTKGRRLLGTRPLTFSSFARALVWNSPEAQNESSDSPSSDEEFEEVQADFAFFDPKPDDFHGVKILLQNYLDDIQWDL-SGFSDLILGQTTVGTVV
        MPRRP+  RRL   +PL FSSF R+L + S   +       S DE+F  +         K  +   ++    +  +D Q D+ +GF DLIL QT +GTVV
Subjt:  MPRRPTKGRRLLGTRPLTFSSFARALVWNSPEAQNESSDSPSSDEEFEEVQADFAFFDPKPDDFHGVKILLQNYLDDIQWDL-SGFSDLILGQTTVGTVV

Query:  KVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLVKVCQEKDVKNDLKLLLEEQASGVGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQD
        KV  DED  +F+ ITALNL RYK+++CF ELKE+L+KVC EK+V +DL+LLLE++A  VGLLVSQR++NLP QLLPPLYD LFDE+SWA EDEPT+EL+ 
Subjt:  KVEGDEDNGVFSFITALNLGRYKDSRCFLELKEYLVKVCQEKDVKNDLKLLLEEQASGVGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQD

Query:  SFRFKMYILVSKIYKL
        SFRFK YIL++KIYK+
Subjt:  SFRFKMYILVSKIYKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTATCCATGGTCGACGCCGCAAGACCCTTCTCAGATAAAAAATGCATACCTGAAGTTCCCATTTGCCAAGCCTGAAGATTACGACTACACACTTGAGATAATGAG
ATCTATACCTCCTTCCAAGAAAGCTACATATGGAGCCATACCTAGGGTAGTTTGCCGCCCCTTCTCTCCCCTCTCCCTCCTTCGGCCGCACGCGGACCCCAGCCCCCTCC
CGCGCGCTACTCTCTCTCTCCCCATCGGACGCACGCCGAACCCCCCAATAACCCCCTCCTCCTCACCGTCTCCCTCCTCTTACTGGCTCCTGCCGGCGACCACCCACCGG
CGACGCCACACCAAGGACGTGACACCACGGCCGGCGCACCCTCCGACGATCCGCGCGCGACCGAAACTACTCGACGGCGGAATATCTCTACGCGACGACCCTTCTCCGGC
GACGACCTGCGACCACGAAACTCCGGCGACGCATTCCGGCTACCACGACGCACGACGCACGACGGACCCACGAGCCACGGCTACTGTGCTACTCTTCCTCACACGATCAG
ATCCAGGTGACCGACTCCTACGGAACAGACTCAAAAATCAGCGCAGCCTTCGGCGACTAACTCGATCCTGTGACAAAACCCACTCGTCGGCGAGCCCGATCTGCAGCACC
AGCGACCGACTTGTTCAATTTTTAAAGATGCCCCGAAGGCCAACAAAGGGGCGCCGACTGCTGGGAACTCGACCTTTAACTTTTTCCTCATTTGCTCGAGCTCTTGTATG
GAATAGTCCTGAGGCTCAGAATGAATCATCTGACTCTCCTTCCAGTGATGAAGAGTTTGAAGAAGTTCAAGCGGATTTTGCCTTTTTTGATCCAAAACCTGACGACTTTC
ATGGGGTGAAGATATTGCTGCAAAACTATTTAGATGACATACAGTGGGATTTAAGTGGATTTTCTGATTTAATATTAGGACAAACCACAGTTGGTACTGTTGTAAAAGTG
GAGGGGGATGAAGACAATGGAGTATTCTCTTTTATCACAGCCCTTAACTTGGGAAGATATAAGGACAGTAGATGTTTCCTGGAGCTCAAGGAGTATCTCGTCAAAGTTTG
CCAGGAAAAGGATGTAAAGAATGATTTGAAGCTGCTTTTGGAGGAGCAAGCATCCGGTGTCGGTCTCCTAGTCTCTCAAAGAATAGTTAATCTTCCTCCACAACTTCTGC
CTCCACTTTATGATGCCCTTTTCGATGAAATCTCATGGGCTATTGAGGATGAGCCAACCAAAGAGCTCCAAGATTCCTTCCGCTTCAAAATGTATATTTTAGTCAGTAAA
ATTTACAAGCTCAAGAATACAAATCATAAGAAAAGACCGGCTACTGAAACAGATGAGTCAATTATCTATGTGAAGGCAGAAGATGAAATATTTCACAAGCTGTGCTCATG
GTCCTTCTGCTTTCCCTTGCACATCCAACCACAGACGACTAATGAGCTAAAAGGTTATCAGTTAATGGGAATGGTAATGGCTGTTGAAGCTAGCAAAACACTCTTAAGCA
CTCGCAAGACGGTTTTTGCATTTGGCGCTGTGGGAACTAAAGCCAGAGGCCGCTCGGTCTTTCTCTATCATCTACGTTATTGCAGTCGCTACCCAGAATCTCTCTCTAAA
CGCGTTATCCGCCGGAGGGTCAGCAGTGATGGCGGTGGTGCTGTTGATTCTACCCAGCAGCAGTCTGCAGCTCCTGACATCAAAGATGTACAGAATGATTCTTCTAGTGT
TGGACACAGTTATGTGGCGTTATTTGTCCGGATGCTCGGCTTAGATCATGATCCTCTCGATAGAGAACAAGCAATAATAGCTTTATGGAAATATTCCCTTGGAGGGAAGA
AGCACATAGATGCCATCATGCAATTTCCTGGATGCATAAATCTAACAAGATTCAGTTGCAGAAAGTGGAGCAATCGAAGAGATAACTGGCTTGCTTGGTCAACCGTCATT
GACTCCCGAGCTGTGCAGGTGAAGGAGCAAAGCATATGCGTTTTGTGGAATTTGTCAGTAGATGAAAAGCTCAGAATGAAGATTGCCAACACTGATATTCTGCCATTGCT
TAGTAAGAATCTCGATGATGAGGACATGAAGGTGAAGGAAGCAGCTGGAGGAGTTTTAGCAAATCTGGCATTGAGTCAATGTAACCATGGACTTATTGTTGAATTAGGCT
TAATTCCCAAACTGGCGTATCTGTTAAAAGCTGAGGCCGACAGCTCAAAAACTGATGAGTATTATAGAATTCTTGTCATAGAGGAAGGACTGGTTCCAGTACCAATTGTT
GGTGCTGCTGCCTATAAATCCTTCAGACCAGGCCTGCATTCATGGCCTAGTTTGCCTGATGGCACAGACATTGAGCAATCTTCCAAAGGTCCTTCAAGATTTGGTGCCTC
TGAATTACTCCTTGGATTAAATGTCGATAACAATGCAAACATAGAGGAAGACAAGATAAATGCAATTGTTGGACGGACACAGCAACAATTTCTGGCTCGAATCGGTGCTA
TAGAATTTGAAGATATGAGGAACTCTCAATCCGAGTCATCTACCAGTAACCATCTTACACTCTTACCTTGGATAGATGGTGTGGCTCGACTAGTTTTGATCCTTGAACTT
GAAGATGATAATGCCATATTGAGAGCTGCAGAGTCAATTGCTGATGCATCTATTAACGAACACATGCGCATTTCGTTCAAGGAAGCCGGAGCAATCAAGCATTTAGTTAA
GCTTTTAGATAATATGAATAATGTAGTTAAATGGGCTACAATTCGAGCTCTAGAAAGACTGTCAATCAGCAATGTTGTTTGCCAGACACTTGAAAATGAAGGTGCACTTG
GTCCTTTGCTTAGTATTTTAAAGCTCTCAAATATCCCTGAAGATGCGATGGAGAAGACCCTCGATATACTTTCTCGGATCTTGGACCCCAGTAAAGAAATGAAATCAAAG
TTTTACAATGGACCAGCAAACGGGTCTCACGGGGGACAACATTCCGAAAGAAATTCTCAAGCTTCTACCAGCAAAGATGTGCTGGATGCTGTTGTTGTTTCTCGCCTCAT
TGAGATTTTGAATGCCTCATCCCCAAACTTGAAACGCAAAGCTGCTTCTATCCTTGAATTTATTTCTATTATGGACCCAAGCATGGAAATAATCAATTCCGTGGAGATTG
AATCTGGTTTGTTGGCTGTTTTCCAGCTAGGAGTTTCAATAGATTCTGATGCTGAAGATTGGCCGCCTGAAAGATATGCTCTTGAAGTTGAGGAAGCTGGTCTTTCAATA
TCGGCAGCCTCCCGTCTACTAACGAAACTTCTCGATTCTGAGAAGTTCCAACACAAAATAAACTCCGACCATTTCACTAAGTTGCTTCGTCGAATTCTGAAGTCGGACAT
TCCCATTCACCACAAAGATTGGGTTGCTGCTTGCCTAGTGAAAGTCAGCACTGTTTCAGCTCCAAACAGAGATTCCGGGGATCCAGTCAACATGGAGGTAACTCTGTACG
AAACTATACCGAGACTTATTCAGCAGATCAAAAGCTCCTTCTCCATGGAAGTTCAAGAATCCACTGTTGTGGAACTGAACGGAATAGTCTCAGAAGGAATAGTCGATGCT
ACCCGAGCTGTTGCTTCTAATGGTGGTATCTTTCCATTGGTAAAACTGATTGATGAAGGAAGTGAGAGGGCTGCAGAGGCGGCCTTAGCCATACTATATAATCTGAGCAT
GGACAGTGAAAATCATCCAGCAATTTTAGCTGCTGGAGCCGTGCCAGCTTTGAGGAGAATCGTTCTATCACAACGAGCACAGTGGCGGCAAGCTCTTCATTTGTTGAGGA
CATTGCCTACATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTATCCATGGTCGACGCCGCAAGACCCTTCTCAGATAAAAAATGCATACCTGAAGTTCCCATTTGCCAAGCCTGAAGATTACGACTACACACTTGAGATAATGAG
ATCTATACCTCCTTCCAAGAAAGCTACATATGGAGCCATACCTAGGGTAGTTTGCCGCCCCTTCTCTCCCCTCTCCCTCCTTCGGCCGCACGCGGACCCCAGCCCCCTCC
CGCGCGCTACTCTCTCTCTCCCCATCGGACGCACGCCGAACCCCCCAATAACCCCCTCCTCCTCACCGTCTCCCTCCTCTTACTGGCTCCTGCCGGCGACCACCCACCGG
CGACGCCACACCAAGGACGTGACACCACGGCCGGCGCACCCTCCGACGATCCGCGCGCGACCGAAACTACTCGACGGCGGAATATCTCTACGCGACGACCCTTCTCCGGC
GACGACCTGCGACCACGAAACTCCGGCGACGCATTCCGGCTACCACGACGCACGACGCACGACGGACCCACGAGCCACGGCTACTGTGCTACTCTTCCTCACACGATCAG
ATCCAGGTGACCGACTCCTACGGAACAGACTCAAAAATCAGCGCAGCCTTCGGCGACTAACTCGATCCTGTGACAAAACCCACTCGTCGGCGAGCCCGATCTGCAGCACC
AGCGACCGACTTGTTCAATTTTTAAAGATGCCCCGAAGGCCAACAAAGGGGCGCCGACTGCTGGGAACTCGACCTTTAACTTTTTCCTCATTTGCTCGAGCTCTTGTATG
GAATAGTCCTGAGGCTCAGAATGAATCATCTGACTCTCCTTCCAGTGATGAAGAGTTTGAAGAAGTTCAAGCGGATTTTGCCTTTTTTGATCCAAAACCTGACGACTTTC
ATGGGGTGAAGATATTGCTGCAAAACTATTTAGATGACATACAGTGGGATTTAAGTGGATTTTCTGATTTAATATTAGGACAAACCACAGTTGGTACTGTTGTAAAAGTG
GAGGGGGATGAAGACAATGGAGTATTCTCTTTTATCACAGCCCTTAACTTGGGAAGATATAAGGACAGTAGATGTTTCCTGGAGCTCAAGGAGTATCTCGTCAAAGTTTG
CCAGGAAAAGGATGTAAAGAATGATTTGAAGCTGCTTTTGGAGGAGCAAGCATCCGGTGTCGGTCTCCTAGTCTCTCAAAGAATAGTTAATCTTCCTCCACAACTTCTGC
CTCCACTTTATGATGCCCTTTTCGATGAAATCTCATGGGCTATTGAGGATGAGCCAACCAAAGAGCTCCAAGATTCCTTCCGCTTCAAAATGTATATTTTAGTCAGTAAA
ATTTACAAGCTCAAGAATACAAATCATAAGAAAAGACCGGCTACTGAAACAGATGAGTCAATTATCTATGTGAAGGCAGAAGATGAAATATTTCACAAGCTGTGCTCATG
GTCCTTCTGCTTTCCCTTGCACATCCAACCACAGACGACTAATGAGCTAAAAGGTTATCAGTTAATGGGAATGGTAATGGCTGTTGAAGCTAGCAAAACACTCTTAAGCA
CTCGCAAGACGGTTTTTGCATTTGGCGCTGTGGGAACTAAAGCCAGAGGCCGCTCGGTCTTTCTCTATCATCTACGTTATTGCAGTCGCTACCCAGAATCTCTCTCTAAA
CGCGTTATCCGCCGGAGGGTCAGCAGTGATGGCGGTGGTGCTGTTGATTCTACCCAGCAGCAGTCTGCAGCTCCTGACATCAAAGATGTACAGAATGATTCTTCTAGTGT
TGGACACAGTTATGTGGCGTTATTTGTCCGGATGCTCGGCTTAGATCATGATCCTCTCGATAGAGAACAAGCAATAATAGCTTTATGGAAATATTCCCTTGGAGGGAAGA
AGCACATAGATGCCATCATGCAATTTCCTGGATGCATAAATCTAACAAGATTCAGTTGCAGAAAGTGGAGCAATCGAAGAGATAACTGGCTTGCTTGGTCAACCGTCATT
GACTCCCGAGCTGTGCAGGTGAAGGAGCAAAGCATATGCGTTTTGTGGAATTTGTCAGTAGATGAAAAGCTCAGAATGAAGATTGCCAACACTGATATTCTGCCATTGCT
TAGTAAGAATCTCGATGATGAGGACATGAAGGTGAAGGAAGCAGCTGGAGGAGTTTTAGCAAATCTGGCATTGAGTCAATGTAACCATGGACTTATTGTTGAATTAGGCT
TAATTCCCAAACTGGCGTATCTGTTAAAAGCTGAGGCCGACAGCTCAAAAACTGATGAGTATTATAGAATTCTTGTCATAGAGGAAGGACTGGTTCCAGTACCAATTGTT
GGTGCTGCTGCCTATAAATCCTTCAGACCAGGCCTGCATTCATGGCCTAGTTTGCCTGATGGCACAGACATTGAGCAATCTTCCAAAGGTCCTTCAAGATTTGGTGCCTC
TGAATTACTCCTTGGATTAAATGTCGATAACAATGCAAACATAGAGGAAGACAAGATAAATGCAATTGTTGGACGGACACAGCAACAATTTCTGGCTCGAATCGGTGCTA
TAGAATTTGAAGATATGAGGAACTCTCAATCCGAGTCATCTACCAGTAACCATCTTACACTCTTACCTTGGATAGATGGTGTGGCTCGACTAGTTTTGATCCTTGAACTT
GAAGATGATAATGCCATATTGAGAGCTGCAGAGTCAATTGCTGATGCATCTATTAACGAACACATGCGCATTTCGTTCAAGGAAGCCGGAGCAATCAAGCATTTAGTTAA
GCTTTTAGATAATATGAATAATGTAGTTAAATGGGCTACAATTCGAGCTCTAGAAAGACTGTCAATCAGCAATGTTGTTTGCCAGACACTTGAAAATGAAGGTGCACTTG
GTCCTTTGCTTAGTATTTTAAAGCTCTCAAATATCCCTGAAGATGCGATGGAGAAGACCCTCGATATACTTTCTCGGATCTTGGACCCCAGTAAAGAAATGAAATCAAAG
TTTTACAATGGACCAGCAAACGGGTCTCACGGGGGACAACATTCCGAAAGAAATTCTCAAGCTTCTACCAGCAAAGATGTGCTGGATGCTGTTGTTGTTTCTCGCCTCAT
TGAGATTTTGAATGCCTCATCCCCAAACTTGAAACGCAAAGCTGCTTCTATCCTTGAATTTATTTCTATTATGGACCCAAGCATGGAAATAATCAATTCCGTGGAGATTG
AATCTGGTTTGTTGGCTGTTTTCCAGCTAGGAGTTTCAATAGATTCTGATGCTGAAGATTGGCCGCCTGAAAGATATGCTCTTGAAGTTGAGGAAGCTGGTCTTTCAATA
TCGGCAGCCTCCCGTCTACTAACGAAACTTCTCGATTCTGAGAAGTTCCAACACAAAATAAACTCCGACCATTTCACTAAGTTGCTTCGTCGAATTCTGAAGTCGGACAT
TCCCATTCACCACAAAGATTGGGTTGCTGCTTGCCTAGTGAAAGTCAGCACTGTTTCAGCTCCAAACAGAGATTCCGGGGATCCAGTCAACATGGAGGTAACTCTGTACG
AAACTATACCGAGACTTATTCAGCAGATCAAAAGCTCCTTCTCCATGGAAGTTCAAGAATCCACTGTTGTGGAACTGAACGGAATAGTCTCAGAAGGAATAGTCGATGCT
ACCCGAGCTGTTGCTTCTAATGGTGGTATCTTTCCATTGGTAAAACTGATTGATGAAGGAAGTGAGAGGGCTGCAGAGGCGGCCTTAGCCATACTATATAATCTGAGCAT
GGACAGTGAAAATCATCCAGCAATTTTAGCTGCTGGAGCCGTGCCAGCTTTGAGGAGAATCGTTCTATCACAACGAGCACAGTGGCGGCAAGCTCTTCATTTGTTGAGGA
CATTGCCTACATGA
Protein sequenceShow/hide protein sequence
MKYPWSTPQDPSQIKNAYLKFPFAKPEDYDYTLEIMRSIPPSKKATYGAIPRVVCRPFSPLSLLRPHADPSPLPRATLSLPIGRTPNPPITPSSSPSPSSYWLLPATTHR
RRHTKDVTPRPAHPPTIRARPKLLDGGISLRDDPSPATTCDHETPATHSGYHDARRTTDPRATATVLLFLTRSDPGDRLLRNRLKNQRSLRRLTRSCDKTHSSASPICST
SDRLVQFLKMPRRPTKGRRLLGTRPLTFSSFARALVWNSPEAQNESSDSPSSDEEFEEVQADFAFFDPKPDDFHGVKILLQNYLDDIQWDLSGFSDLILGQTTVGTVVKV
EGDEDNGVFSFITALNLGRYKDSRCFLELKEYLVKVCQEKDVKNDLKLLLEEQASGVGLLVSQRIVNLPPQLLPPLYDALFDEISWAIEDEPTKELQDSFRFKMYILVSK
IYKLKNTNHKKRPATETDESIIYVKAEDEIFHKLCSWSFCFPLHIQPQTTNELKGYQLMGMVMAVEASKTLLSTRKTVFAFGAVGTKARGRSVFLYHLRYCSRYPESLSK
RVIRRRVSSDGGGAVDSTQQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQFPGCINLTRFSCRKWSNRRDNWLAWSTVI
DSRAVQVKEQSICVLWNLSVDEKLRMKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSQCNHGLIVELGLIPKLAYLLKAEADSSKTDEYYRILVIEEGLVPVPIV
GAAAYKSFRPGLHSWPSLPDGTDIEQSSKGPSRFGASELLLGLNVDNNANIEEDKINAIVGRTQQQFLARIGAIEFEDMRNSQSESSTSNHLTLLPWIDGVARLVLILEL
EDDNAILRAAESIADASINEHMRISFKEAGAIKHLVKLLDNMNNVVKWATIRALERLSISNVVCQTLENEGALGPLLSILKLSNIPEDAMEKTLDILSRILDPSKEMKSK
FYNGPANGSHGGQHSERNSQASTSKDVLDAVVVSRLIEILNASSPNLKRKAASILEFISIMDPSMEIINSVEIESGLLAVFQLGVSIDSDAEDWPPERYALEVEEAGLSI
SAASRLLTKLLDSEKFQHKINSDHFTKLLRRILKSDIPIHHKDWVAACLVKVSTVSAPNRDSGDPVNMEVTLYETIPRLIQQIKSSFSMEVQESTVVELNGIVSEGIVDA
TRAVASNGGIFPLVKLIDEGSERAAEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAQWRQALHLLRTLPT