; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029003 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029003
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionU-box domain-containing protein 4
Genome locationchr8:34098853..34108685
RNA-Seq ExpressionLag0029003
SyntenyLag0029003
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579255.1 U-box domain-containing protein 12, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.47Show/hide
Query:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA
        +K+FAVGA G+ + RD S+ L+RL +   + E   +RVV RRVS DGGGA+DSTQQQSAA + +DV NDSSSVGHSYVALFVRMLGLDHDPLDREQAI+A
Subjt:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA

Query:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI
        LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSES STCEAAAGLLRSIS+VNL+RDSVAESGAIEEITALLS+PSLTPEVKEQSICVLWNLSVDE+LR KI
Subjt:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI

Query:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA
        ANTDILPLL KNLDDEDMKVKEAAGGV+ANL LSPCNHG+IVESGLI KLAYQLKAEA+SSK VRKEARNALLEL KDEYYRILVIEEGLVPVP++GAAA
Subjt:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA

Query:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL
        YKSF+PGLHSWPSLPDGTEIE+SSK+PSRFGASELLLGLNVDN+ANIEE KINAI+GR+QQQFLARIGAI+ EDL+D+QSESSTSNHLTLLPW DGVARL
Subjt:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL

Query:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL
        VL+LELEDDNAKVR AE IADASINEHMR+SFKEAGAIKHLVKLLD++NNSVKWA++QALERLSISNVVCQTIENEGAL PLL+ILKLSSIPENVMEKTL
Subjt:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL

Query:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG
        DILSR LDPSKEMKSKFYNGP+NGS GGQHSERNSEASTRKDVLDA+VVS LVEIL TSSPNLKRKAASILEFISIMDP M+IID MEIESGLSAVFQLG
Subjt:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG

Query:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE
        VSID+DAEDW PER+ALEVEEA LAISAASRLLTKL+DSEKFC+KIN+  FT+ LRR LK DIP+ HKDWIAACL+KVSSV+AP++DSGDPI+MEV LYE
Subjt:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE

Query:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ
        TIPRLI+Q+KSS S EVQES VVELNRI+SEG+VDATRAVASKGGIFPLVKLIDEGSERA EAALA+LYNLSMDTENHPAILAAGAVP LRRIVLSQRVQ
Subjt:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ

Query:  WRQALYLLRTLPT
        W+QALYLLRTLPT
Subjt:  WRQALYLLRTLPT

XP_022146414.1 uncharacterized protein LOC111015638 [Momordica charantia]0.0e+0089.66Show/hide
Query:  MKVFAVGAAGTRARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIAL
        MKVF   AAGT+ R+LSVFLY L Y  R PESLS R VRR VS DGGGAVDST QQSAAP+ KDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQA+IAL
Subjt:  MKVFAVGAAGTRARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIAL

Query:  WKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIA
        WKYSLGGKKHIDAIM+FPGCINLTVNLL+SESI+TCEAAAGLLRSISLVNLYRDSVAESGAIEEIT LLS+PSL PEVKEQSICVLWNLSVDEKLR+KIA
Subjt:  WKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIA

Query:  NTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAY
        +TDIL LLSKNLDDEDMKVKEAAGGVLANLALSPCNHG+IVESGLI KLAYQLKAEA+SSKIVRKEARN LLELAKDEYYRILVIEEGLVPVPIVGAAAY
Subjt:  NTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAY

Query:  KSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARLV
        KSFRPGLHSWPSLPDGTEIEQSS  PSRFGASELLLGLNVDN ANIEE KINAIVGRTQQQFLARIGAI+FEDL+DSQ ESST NHLTLLPWIDGVARLV
Subjt:  KSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARLV

Query:  LILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLD
        LILELEDD A +RAAESIADASINEHMRISFKEAGAIKHLVK+LDN+N++VKWATI+ALERLSISNVVCQTIENEGALGPLL+IL+LS+IPENVMEKTLD
Subjt:  LILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLD

Query:  ILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGV
        IL R LDPSKEMKSKFYNGPVNGSHGGQHSE  SEASTRKDVLDA+ VS LVEILNTSSPNLKRKAASILEF+SIMDP MEIIDSM IESGL AVFQLGV
Subjt:  ILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGV

Query:  SIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYET
        SIDSDAEDWQPERYALEVEEA LAISAASRLLTKLLDSEKF +KINS HFTKLLR+ LKSDIPI HKDWIAACLVKVSS+SA N+DSG+PINMEVTLYET
Subjt:  SIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYET

Query:  IPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQW
        IPRLI+QIK SFSME QES+VVELNRI+SEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMD+ENHPAILAAGAVP LRRIVLSQR +W
Subjt:  IPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQW

Query:  RQALYLLRTLPT
        R+AL+LLRTLPT
Subjt:  RQALYLLRTLPT

XP_022939600.1 uncharacterized protein LOC111445446 [Cucurbita moschata]0.0e+0086.84Show/hide
Query:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA
        MK+FAVGA GT + RD S+ L+RL +   + E   +RVV RRVS DGGGA+DSTQQQSAA + +DV NDSSSVGHSYVALFVRMLGLDHDPLDREQAI+A
Subjt:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA

Query:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI
        LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSES STCEAAAGLLRSIS+VNL+RDSVAESGAIEEITALLS+PSLTPEVKEQSICVLWNLSVDEKLR KI
Subjt:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI

Query:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA
        ANTDILPLL KNLDDEDMKVKEAAGGV+ANL LSPCNHG+IVESGLI KLAYQLKAEA+SSK +RKEARNALLEL KDEYYRILVIEEGLVPVP++GAAA
Subjt:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA

Query:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL
        YKSF+PGLHSWPSLPDGTEIE+SSK+PSRFGASELLLGLNVDN+ NI+E KINAIVGR+QQQFLARIGAI+ EDL+D+QSESSTSNHLTLLPW DGVARL
Subjt:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL

Query:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL
        VL+LELEDDNAKVR AE IADASINEHMR+SFKEAGAIKHLVKLLD++NNSVKWA+IQALERLSISNVVCQTIENEGAL PLL+ILKLSSIPENVMEKTL
Subjt:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL

Query:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG
        DILSR LDPSKEMKSKFY+GP+NGS GGQHSERNSEAST KDVLDA+VVS LVEIL TSSPNLKRKAASILEFISIMDP M+IID MEIESGLSAVFQLG
Subjt:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG

Query:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE
        VSID+DAEDW PER+ALEVEEA LAISAASRLLTKL+DSEKFC+KIN+  FT+ LRR LK DIPI HKDWIAACL+KVSSV+AP++DSGDPI+MEV LYE
Subjt:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE

Query:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ
        TIPRLI+Q+KSS SMEVQESAVVELNRI+SEG+VDATRAVASKGGIFPLVKLIDEGSERA EAALA+LYNLSMDTENHPAILAAGAVP LRRIVLSQRVQ
Subjt:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ

Query:  WRQALYLLRTLPT
        W+QALYLLRTLPT
Subjt:  WRQALYLLRTLPT

XP_022973564.1 uncharacterized protein LOC111472102 [Cucurbita maxima]0.0e+0086.35Show/hide
Query:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA
        MK+FAVGA GT + RD S+ L+RL +   + E   +RVVR RVS DGGGA+DSTQQQSA  + +DV NDSSSVGHSYVALF+RMLGLDHDPLDREQAI+A
Subjt:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA

Query:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI
        LWKYSLGGKKHIDAIMQFPGC+NLTVNLLRSES STCEAAAGLLRSIS+VNL+RDSVAESGAIEEITALLS+PSLTPEVKEQSICVLWNLSVDEKLR KI
Subjt:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI

Query:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA
        ANTDILPLL KNLDDEDMKVKEAAGGV+ANL LSPCNHG+IVESGLI KLAYQLKAEA+SSK VRKEARNALLEL KDEYYRILVIEEGLVPVP++GAAA
Subjt:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA

Query:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL
        YKSF+PGLHSWPSLPDGTEIE+SSK+PSRFGASELLLGLNVDN+ANIEE KINAI+GR+QQ FLARIGAI+ EDL+D+QSESSTSNHLTLLPW DGVARL
Subjt:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL

Query:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL
        VL+LELEDDNAKVR AE IADASINEHMR+SFKEAGAIKHLVKLLD++NNSVKWA+IQALERLSISNVVCQTIENEGAL PLL+ILKLSSIPENVMEKTL
Subjt:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL

Query:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG
        DILSR LDPSKEMKSKFY+GP+NGS GGQHSERNSEASTRKDVLDA+VVS LVEIL TSSPNLKRKAASILEFISIMDP M+IID MEIESGLSAVFQLG
Subjt:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG

Query:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE
        VSID+DAEDW PER+ALEVEEA LAISAASRLLTKL+DSEKFC+KIN+T FT+ LR+ LK DIP+ HKDWIAACL+KVSSV+A ++DSGDPI+MEV LYE
Subjt:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE

Query:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ
        TIPRLI+++KSS SMEVQESAVVELNRI+SEG+VDATRAVASKGGIFPLVKLIDEGSERA EAALAILYNLSMDTENHPAILAAGAVP LRRIVLSQR Q
Subjt:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ

Query:  WRQALYLLRTLPT
        W+QALYLLRTLPT
Subjt:  WRQALYLLRTLPT

XP_038874402.1 uncharacterized protein LOC120067080 [Benincasa hispida]0.0e+0090.91Show/hide
Query:  MKVF-AVGAAG-TRARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAII
        MK+F  VGA G  +  DLS FLY L + S   +SLS+RVV RRVS DGGGAVDSTQQQSAAP   DVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAII
Subjt:  MKVF-AVGAAG-TRARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAII

Query:  ALWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIK
        ALWKYSLGGKKHIDAIMQFPGCINLTVNLLRSES+STCEAAAGLLRSISLVNLYRDSVAESGAIEEIT LLSQPSLTPEVKEQSICVLWNLSVDEKLR K
Subjt:  ALWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIK

Query:  IANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAA
        IAN DILPLLSKNLDDEDMKVKEAAGGVLANLALSPCN G+IVE+GLI KLAYQLKAEA+SSKI+RKEARNALLEL+KDEYYRILVIEEGLVPVPI+GAA
Subjt:  IANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAA

Query:  AYKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVAR
        AYKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDN+A IEE KINAIVGRTQQQFLARIGAI+FEDL+DSQSESSTSNHLTLLPWIDGVAR
Subjt:  AYKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVAR

Query:  LVLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKT
        LVLILELEDDNA VRAAESIADASINEHMRISFKEAGAIKHLVK LD +NNSVKWA +QALERLSISNVVCQTIEN+GALGPLL+ILKLSSI ENVMEKT
Subjt:  LVLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKT

Query:  LDILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQL
        LDILSR LDPSKEMKSKFY+GPVNGS GGQHSERN EASTRKDVLD  VVSRLVEI NTSSPNLKRKAASILEF+SIMDP MEIIDSME+ESGLSAVFQL
Subjt:  LDILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQL

Query:  GVSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLY
        GVSIDSDAEDWQPERYALEVEEA LAISAASRLLTKLLDSEKFC+KINSTHFTKLLRR LKSDIPI HKDWIAACL KVSSVSA N+DSGDPINMEVTLY
Subjt:  GVSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLY

Query:  ETIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRV
        ETIPRLI+QIKSSFSMEVQESAVVELNRI+S GIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMD ENHPAI+AAGAVP LR+IVLSQRV
Subjt:  ETIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRV

Query:  QWRQALYLLRTLPT
        QWRQALYLLRTLPT
Subjt:  QWRQALYLLRTLPT

TrEMBL top hitse value%identityAlignment
A0A0A0KPE7 Uncharacterized protein0.0e+0086.81Show/hide
Query:  SVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQ
        S+FL RL + S   +SLS+R+V RRVS DGGG  DS+Q QSA P+ KDVQNDSSSVGHSYVALFVRMLGL +DPLDREQAIIALWKYSLGGKKHIDAIMQ
Subjt:  SVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQ

Query:  FPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDED
        FPGCINL VNLLRSESI TCEAAAGLLRSIS VNLYR+SVAESGAIEEIT LL QPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDE+
Subjt:  FPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDED

Query:  MKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDG
        MKVKEAAGGVLANLALSPCNHG+IVESGLI+KLAYQLKAEA+SSKI+RKEARNALLEL+KD YYRILVIEEGLVPVPI+GAAAYKSFRPGLHSWP LPDG
Subjt:  MKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDG

Query:  TEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARLVLILELEDDNAKVRAAE
         EIEQS+K PSR+GAS+LLLGLNVD +ANIEE KINAIVGRTQQQFLARIGAI+ EDL+DSQSESS+ NHLTLLPWIDGVARLVLILELEDDNA  RAA 
Subjt:  TEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARLVLILELEDDNAKVRAAE

Query:  SIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLDILSRSLDPSKEMKSKF
        SIADASINEHMRISFKEAGAIK+LVK LD  N+SVKWA +QALERLSISNVVCQ IENEGALGPLL+ILK S IPENVMEKTL+ILSR LDPSKEMKSKF
Subjt:  SIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLDILSRSLDPSKEMKSKF

Query:  YNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGVSIDSDAEDWQPERYAL
        Y+GPVNGS GGQHSE N EAS RKDVLDA VVSR VEILNTSSPNLK+KAASILEF+SIMDP ME+ID +EIE GLSAVFQLGVSIDSD E WQPERYAL
Subjt:  YNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGVSIDSDAEDWQPERYAL

Query:  EVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYETIPRLIEQIKSSFSMEV
        EVEEA LAISAASRLLTKLLDSEKF +KINSTHFTKLLRR LKSDIPI HKDWIAACL+K+SSV   N D GDPINMEVTLYETIPRLIEQ++SSFS+EV
Subjt:  EVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYETIPRLIEQIKSSFSMEV

Query:  QESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQWRQALYLLRTLPT
        QESAVVELNRI+SEGIV+ATRAVASKGGIFPLVKLIDEGSERA EAALAILYNLSMD+ENHPAI+AAGAVP LRRI LSQRVQW+QALYLLRTLPT
Subjt:  QESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQWRQALYLLRTLPT

A0A1S3CU26 uncharacterized protein LOC1035045530.0e+0087.41Show/hide
Query:  VGAAGTRARDL--SVFLYRLSYCSRNPESLSRRVVRRRVSCD-GGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWK
        VGA G   +    S+FLYRL + S   +SLS+R++  RVS D GGGAVDS+Q QSA P  KDVQNDSSS+G SYVALFVRML LD+DPLDREQAIIALWK
Subjt:  VGAAGTRARDL--SVFLYRLSYCSRNPESLSRRVVRRRVSCD-GGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWK

Query:  YSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANT
        YSLGGKKHIDAIMQFPGCINL VNLLRSES+ST EAAAGLLRSISLVNLYR+SVAESGAIEEIT LL QPSLTPEVKEQSICVLWNLSVDEKLRIKIANT
Subjt:  YSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANT

Query:  DILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAYKS
        DILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHG+IVESGLI KLAYQLKAEA+SSKI+RKEARNALLEL KDEYYRILVIEEGLVPVPI+GAAAYKS
Subjt:  DILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAYKS

Query:  FRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARLVLI
        FRPGLHSWPSLPDGTEIEQSS EPSRFGASELLLGLNVDN+ANIEE KINAIVGRTQQQFLARIGAI+ E+++DSQSESS+SNHLTLLPWIDGVARLVLI
Subjt:  FRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARLVLI

Query:  LELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLDIL
        LELEDDNA VRAAESI DASINEHMRISFKEAGAIKHLV  LD +N+SVKWA +QALERLSISNVVCQ IENEGALGPLL+ILKLSSIPENVMEKTL+IL
Subjt:  LELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLDIL

Query:  SRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGVSI
        SR LDPSKEMKSKFY+GPVNGS G QHSE N EAS RKD LDA VVSRLVEILNTSSPNLKRKAASILEF+SIMDP ME+ID +EIE GLSAVFQLGVSI
Subjt:  SRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGVSI

Query:  DSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYETIP
        DSDAE WQPERYALEVEEA LAISAASRLLTKLLDSEKF +KINST FTKLLR+ LKSDIPI HKDWIAACL+KVSS+S  N DSGDPINMEVTLYETIP
Subjt:  DSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYETIP

Query:  RLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQWRQ
        RLIEQIKSSFS+EVQESAVVELNRI+SEG+VDATRAVA KGGIFPLVKLIDEGSERA EAALAILYNLSMD+ENHPAI+AAGAVP LRRIVLSQRVQWRQ
Subjt:  RLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQWRQ

Query:  ALYLLRTLPT
        ALYLLRTLPT
Subjt:  ALYLLRTLPT

A0A6J1CY29 uncharacterized protein LOC1110156380.0e+0089.66Show/hide
Query:  MKVFAVGAAGTRARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIAL
        MKVF   AAGT+ R+LSVFLY L Y  R PESLS R VRR VS DGGGAVDST QQSAAP+ KDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQA+IAL
Subjt:  MKVFAVGAAGTRARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIAL

Query:  WKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIA
        WKYSLGGKKHIDAIM+FPGCINLTVNLL+SESI+TCEAAAGLLRSISLVNLYRDSVAESGAIEEIT LLS+PSL PEVKEQSICVLWNLSVDEKLR+KIA
Subjt:  WKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIA

Query:  NTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAY
        +TDIL LLSKNLDDEDMKVKEAAGGVLANLALSPCNHG+IVESGLI KLAYQLKAEA+SSKIVRKEARN LLELAKDEYYRILVIEEGLVPVPIVGAAAY
Subjt:  NTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAY

Query:  KSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARLV
        KSFRPGLHSWPSLPDGTEIEQSS  PSRFGASELLLGLNVDN ANIEE KINAIVGRTQQQFLARIGAI+FEDL+DSQ ESST NHLTLLPWIDGVARLV
Subjt:  KSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARLV

Query:  LILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLD
        LILELEDD A +RAAESIADASINEHMRISFKEAGAIKHLVK+LDN+N++VKWATI+ALERLSISNVVCQTIENEGALGPLL+IL+LS+IPENVMEKTLD
Subjt:  LILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLD

Query:  ILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGV
        IL R LDPSKEMKSKFYNGPVNGSHGGQHSE  SEASTRKDVLDA+ VS LVEILNTSSPNLKRKAASILEF+SIMDP MEIIDSM IESGL AVFQLGV
Subjt:  ILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGV

Query:  SIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYET
        SIDSDAEDWQPERYALEVEEA LAISAASRLLTKLLDSEKF +KINS HFTKLLR+ LKSDIPI HKDWIAACLVKVSS+SA N+DSG+PINMEVTLYET
Subjt:  SIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYET

Query:  IPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQW
        IPRLI+QIK SFSME QES+VVELNRI+SEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMD+ENHPAILAAGAVP LRRIVLSQR +W
Subjt:  IPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQW

Query:  RQALYLLRTLPT
        R+AL+LLRTLPT
Subjt:  RQALYLLRTLPT

A0A6J1FN67 uncharacterized protein LOC1114454460.0e+0086.84Show/hide
Query:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA
        MK+FAVGA GT + RD S+ L+RL +   + E   +RVV RRVS DGGGA+DSTQQQSAA + +DV NDSSSVGHSYVALFVRMLGLDHDPLDREQAI+A
Subjt:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA

Query:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI
        LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSES STCEAAAGLLRSIS+VNL+RDSVAESGAIEEITALLS+PSLTPEVKEQSICVLWNLSVDEKLR KI
Subjt:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI

Query:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA
        ANTDILPLL KNLDDEDMKVKEAAGGV+ANL LSPCNHG+IVESGLI KLAYQLKAEA+SSK +RKEARNALLEL KDEYYRILVIEEGLVPVP++GAAA
Subjt:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA

Query:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL
        YKSF+PGLHSWPSLPDGTEIE+SSK+PSRFGASELLLGLNVDN+ NI+E KINAIVGR+QQQFLARIGAI+ EDL+D+QSESSTSNHLTLLPW DGVARL
Subjt:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL

Query:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL
        VL+LELEDDNAKVR AE IADASINEHMR+SFKEAGAIKHLVKLLD++NNSVKWA+IQALERLSISNVVCQTIENEGAL PLL+ILKLSSIPENVMEKTL
Subjt:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL

Query:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG
        DILSR LDPSKEMKSKFY+GP+NGS GGQHSERNSEAST KDVLDA+VVS LVEIL TSSPNLKRKAASILEFISIMDP M+IID MEIESGLSAVFQLG
Subjt:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG

Query:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE
        VSID+DAEDW PER+ALEVEEA LAISAASRLLTKL+DSEKFC+KIN+  FT+ LRR LK DIPI HKDWIAACL+KVSSV+AP++DSGDPI+MEV LYE
Subjt:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE

Query:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ
        TIPRLI+Q+KSS SMEVQESAVVELNRI+SEG+VDATRAVASKGGIFPLVKLIDEGSERA EAALA+LYNLSMDTENHPAILAAGAVP LRRIVLSQRVQ
Subjt:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ

Query:  WRQALYLLRTLPT
        W+QALYLLRTLPT
Subjt:  WRQALYLLRTLPT

A0A6J1IEY0 uncharacterized protein LOC1114721020.0e+0086.35Show/hide
Query:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA
        MK+FAVGA GT + RD S+ L+RL +   + E   +RVVR RVS DGGGA+DSTQQQSA  + +DV NDSSSVGHSYVALF+RMLGLDHDPLDREQAI+A
Subjt:  MKVFAVGAAGT-RARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIA

Query:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI
        LWKYSLGGKKHIDAIMQFPGC+NLTVNLLRSES STCEAAAGLLRSIS+VNL+RDSVAESGAIEEITALLS+PSLTPEVKEQSICVLWNLSVDEKLR KI
Subjt:  LWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKI

Query:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA
        ANTDILPLL KNLDDEDMKVKEAAGGV+ANL LSPCNHG+IVESGLI KLAYQLKAEA+SSK VRKEARNALLEL KDEYYRILVIEEGLVPVP++GAAA
Subjt:  ANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAA

Query:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL
        YKSF+PGLHSWPSLPDGTEIE+SSK+PSRFGASELLLGLNVDN+ANIEE KINAI+GR+QQ FLARIGAI+ EDL+D+QSESSTSNHLTLLPW DGVARL
Subjt:  YKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARL

Query:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL
        VL+LELEDDNAKVR AE IADASINEHMR+SFKEAGAIKHLVKLLD++NNSVKWA+IQALERLSISNVVCQTIENEGAL PLL+ILKLSSIPENVMEKTL
Subjt:  VLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTL

Query:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG
        DILSR LDPSKEMKSKFY+GP+NGS GGQHSERNSEASTRKDVLDA+VVS LVEIL TSSPNLKRKAASILEFISIMDP M+IID MEIESGLSAVFQLG
Subjt:  DILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLG

Query:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE
        VSID+DAEDW PER+ALEVEEA LAISAASRLLTKL+DSEKFC+KIN+T FT+ LR+ LK DIP+ HKDWIAACL+KVSSV+A ++DSGDPI+MEV LYE
Subjt:  VSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYE

Query:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ
        TIPRLI+++KSS SMEVQESAVVELNRI+SEG+VDATRAVASKGGIFPLVKLIDEGSERA EAALAILYNLSMDTENHPAILAAGAVP LRRIVLSQR Q
Subjt:  TIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQ

Query:  WRQALYLLRTLPT
        W+QALYLLRTLPT
Subjt:  WRQALYLLRTLPT

SwissProt top hitse value%identityAlignment
F4I718 Protein CELLULOSE SYNTHASE INTERACTIVE 34.1e-0436.84Show/hide
Query:  SMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTEN-HPAILAAGAVP
        S E  +   VE+ +I++  + D+  AV + GGI PLV+L++ GS++A E A  IL+NL   +E     +  AG +P
Subjt:  SMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTEN-HPAILAAGAVP

Q5VRH9 U-box domain-containing protein 129.8e-0628.9Show/hide
Query:  GCINLTVNLLRSESISTCEAAAGLLRSISLVNL-YRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDM
        G ++L +N LRS +     AAAG +R ++  N+  R  +AE+GAI  +  LLS  S  P  +E ++  L NLS+ E  +  I ++  +P + + L    M
Subjt:  GCINLTVNLLRSESISTCEAAAGLLRSISLVNL-YRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDM

Query:  KVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLV
        + +E A   L +L++   N   I  +G I  L   L    + S   +K+A  A+  L   +  ++  ++ G+V
Subjt:  KVKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLV

Q8VZ40 U-box domain-containing protein 147.5e-0623.51Show/hide
Query:  CDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHID--AIMQFPGCINLTVNLLRSESISTCEAAAG
        C+  G ++  Q Q +   TK   + SS    ++V   +  L         EQ   A  +  L  K+++D    +   G I L V LL S    T E +  
Subjt:  CDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHID--AIMQFPGCINLTVNLLRSESISTCEAAAG

Query:  LLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIV
         L ++S+    + ++ ++GAI +I  +L   S+  E +E +   L++LSV ++ ++ I     +  L   L++   + K+ A   + NL +   N    V
Subjt:  LLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIV

Query:  ESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPV
        + G++  L   LK   ++   +  EA   L  L+ ++  +  + E   +PV
Subjt:  ESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPV

Q9ZV31 U-box domain-containing protein 126.3e-0523.42Show/hide
Query:  VALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTP
        + L V +L + +D   +E A+ ++   S+  +++   I+   G +   V++L+  S+   E AA  L S+S+++  + ++  +GAI  +  LLS+ S   
Subjt:  VALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTP

Query:  EVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSP
          K+ +   L+NL + +  + K     ++P+L + L + +  + + +  +LA L+  P
Subjt:  EVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSP

Arabidopsis top hitse value%identityAlignment
AT1G23180.1 ARM repeat superfamily protein1.1e-24661.62Show/hide
Query:  TQQQSAAPETKDVQNDSSS-VGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNL
        T   +   + ++V+++SSS VG SYV LFV MLGLD+DPLDREQAI  LWKYSLGGKK IDAIMQF GC+NL VNLL+SES S CEAAAGL+RSI+ VNL
Subjt:  TQQQSAAPETKDVQNDSSS-VGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNL

Query:  YRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAY
        YR+SVAESGA+EEITALLS+PSL   VKEQ IC LWNL+VDE++R K+A+ DIL LL   L+D+D+ VKEAAGGVLANLALS   H I+VE G+I KLA 
Subjt:  YRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIVESGLISKLAY

Query:  QLKA---EAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVD-NSANIE
         LKA   E + SK++RKEARN LLELAKDEYYRILVIEEG+VP+PI+GA AYKSFRP L+SWPSLPDG  IEQ++K PSRFGASELLLGLNVD N  +++
Subjt:  QLKA---EAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTEIEQSSKEPSRFGASELLLGLNVD-NSANIE

Query:  ESKINAIVGRTQQQFLARIGAIDFEDLRDSQSES-STSNHLTLLPWIDGVARLVLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDN
        E+K+ AIVGRT QQFLARIGAI+FE  ++ +SE    S  LTLLP +DGVARLVLIL L D+ A  RAAESIADASINE MR+SF EAGA+K LV+LL N
Subjt:  ESKINAIVGRTQQQFLARIGAIDFEDLRDSQSES-STSNHLTLLPWIDGVARLVLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHLVKLLDN

Query:  VN-NSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLDILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDA
         N  +VK   I+AL+ LS+S  VCQ IE EGA+  L+N+LK   I  NV E  LDI++  LDPSKEM+SKFY GPVNG          S+A +RK+VLDA
Subjt:  VN-NSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLDILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDA

Query:  IVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGVSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKI
         V SRLV+I  T+SPNL R A S++EF  I +P M+ I S +I + L    +  V  + + E  + E++ L++EEA L ISAASRLLTKLLDSE F   I
Subjt:  IVVSRLVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGVSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKI

Query:  NSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYETIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGI
        ++  F +L+R+ L+S +P+ +KDW+AACLVK++++S+P+    +PIN+EVTLY+TIP L+EQ+  S S E +E+AV+ELN+I+SEG+ ++ + +AS+GGI
Subjt:  NSTHFTKLLRRTLKSDIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYETIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGI

Query:  FPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQWRQALYLLRTLP
         PLVKL++E +ER  EA+L++LYNL+MD+ENH AI+ AGAVPVLRRIV+SQR QW +AL LLR LP
Subjt:  FPLVKLIDEGSERATEAALAILYNLSMDTENHPAILAAGAVPVLRRIVLSQRVQWRQALYLLRTLP

AT1G77460.1 Armadillo/beta-catenin-like repeat ; C2 calcium/lipid-binding domain (CaLB) protein2.9e-0536.84Show/hide
Query:  SMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTEN-HPAILAAGAVP
        S E  +   VE+ +I++  + D+  AV + GGI PLV+L++ GS++A E A  IL+NL   +E     +  AG +P
Subjt:  SMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDTEN-HPAILAAGAVP

AT2G28830.1 PLANT U-BOX 124.5e-0623.42Show/hide
Query:  VALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTP
        + L V +L + +D   +E A+ ++   S+  +++   I+   G +   V++L+  S+   E AA  L S+S+++  + ++  +GAI  +  LLS+ S   
Subjt:  VALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHIDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTP

Query:  EVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSP
          K+ +   L+NL + +  + K     ++P+L + L + +  + + +  +LA L+  P
Subjt:  EVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSP

AT3G01400.1 ARM repeat superfamily protein3.7e-0827.59Show/hide
Query:  GCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMK
        G I   ++L+ S  +   E     + ++SL +  ++S+A SGAI+ +   L     TP  KE + C L  LS  E+ ++ I  +  +PLL   L+    +
Subjt:  GCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMK

Query:  VKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPV
         K+ A   L +L  +  N    V+SG++  L  +L A+  S+ + +     +LL    +   +  ++EEG VPV
Subjt:  VKEAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPV

AT3G54850.1 plant U-box 145.3e-0723.51Show/hide
Query:  CDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHID--AIMQFPGCINLTVNLLRSESISTCEAAAG
        C+  G ++  Q Q +   TK   + SS    ++V   +  L         EQ   A  +  L  K+++D    +   G I L V LL S    T E +  
Subjt:  CDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKHID--AIMQFPGCINLTVNLLRSESISTCEAAAG

Query:  LLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIV
         L ++S+    + ++ ++GAI +I  +L   S+  E +E +   L++LSV ++ ++ I     +  L   L++   + K+ A   + NL +   N    V
Subjt:  LLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGIIV

Query:  ESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPV
        + G++  L   LK   ++   +  EA   L  L+ ++  +  + E   +PV
Subjt:  ESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTTTTTGCAGTTGGCGCTGCGGGGACTAGAGCCAGAGACCTATCGGTCTTTCTCTATCGTCTAAGTTACTGCAGTCGCAACCCAGAATCTCTCTCTAGACGTGT
TGTCCGCAGGAGGGTCAGCTGTGACGGCGGTGGCGCTGTTGATTCTACGCAGCAGCAGTCTGCCGCTCCTGAAACCAAAGATGTACAAAATGATTCTTCTAGTGTTGGAC
ACAGTTATGTGGCATTATTTGTCAGGATGCTTGGCTTAGATCATGATCCTCTCGATAGAGAACAAGCAATAATAGCTTTATGGAAATATTCACTTGGAGGAAAGAAGCAC
ATAGATGCCATCATGCAGTTTCCTGGATGCATAAATCTAACAGTAAATCTTCTTAGGTCAGAGTCAATCTCTACGTGTGAAGCAGCTGCTGGCCTTCTACGGTCAATTTC
TCTTGTGAACTTGTACCGAGATTCAGTTGCAGAAAGTGGAGCAATTGAAGAGATAACTGCCTTGCTTAGTCAACCCTCCTTGACTCCCGAGGTGAAGGAGCAAAGCATAT
GCGTTTTGTGGAATTTATCAGTAGATGAAAAGCTCAGAATCAAAATTGCCAATACTGATATTCTGCCATTGCTTAGTAAGAATCTTGATGATGAGGACATGAAGGTGAAG
GAAGCAGCGGGAGGGGTTTTAGCAAATCTGGCTTTGAGCCCATGTAACCATGGGATTATTGTTGAATCAGGCTTAATTTCTAAACTGGCGTATCAGTTAAAAGCTGAGGC
AGAGAGCTCAAAAATTGTGAGAAAGGAAGCAAGAAATGCTTTGCTAGAACTTGCTAAAGATGAATATTATAGAATTCTTGTCATAGAGGAAGGGCTGGTTCCAGTACCAA
TTGTTGGTGCTGCTGCCTATAAATCCTTCAGACCAGGCCTGCATTCATGGCCCAGTTTGCCTGATGGCACAGAAATTGAACAATCTTCCAAAGAACCTTCAAGATTTGGT
GCCTCTGAATTACTCCTCGGATTAAATGTTGATAATAGTGCCAACATAGAAGAAAGCAAGATAAATGCAATTGTTGGACGGACACAGCAACAATTTCTGGCTCGAATCGG
TGCCATAGATTTTGAAGATTTGAGGGACTCTCAATCTGAATCATCTACCAGTAACCATCTTACACTCCTACCTTGGATAGATGGTGTGGCTCGACTTGTTTTGATCCTTG
AACTTGAGGATGATAATGCCAAAGTGAGAGCTGCAGAGTCAATTGCTGATGCATCTATCAACGAACACATGCGTATTTCATTCAAGGAAGCTGGAGCAATCAAACATTTA
GTAAAGCTTTTAGATAATGTGAATAATTCAGTCAAATGGGCTACAATTCAAGCTCTGGAGAGACTGTCAATTAGCAATGTTGTTTGCCAGACAATTGAAAATGAAGGTGC
ACTTGGTCCTCTGCTCAACATTTTAAAGCTCTCAAGTATACCTGAAAATGTGATGGAAAAGACCCTCGATATACTTTCTCGGAGCTTGGACCCTAGTAAAGAAATGAAAT
CAAAGTTTTACAATGGACCGGTAAACGGGTCTCATGGGGGACAACATTCAGAAAGAAATTCTGAAGCTTCTACCAGGAAAGATGTGCTGGATGCCATTGTTGTTTCTCGC
CTTGTTGAGATTTTGAACACCTCATCCCCAAACTTGAAACGCAAAGCTGCTTCTATCCTGGAATTTATTTCTATTATGGACCCAGGCATGGAGATAATCGATTCCATGGA
GATAGAATCTGGTTTATCAGCTGTTTTCCAGCTAGGAGTTTCAATAGATTCTGATGCTGAAGATTGGCAGCCTGAAAGATATGCCCTTGAAGTTGAGGAAGCCAGTCTTG
CAATATCAGCAGCCTCCCGACTACTGACGAAACTTCTAGATTCTGAGAAGTTCTGCCACAAAATAAACTCCACCCATTTCACTAAGTTGCTTCGTCGAACCCTGAAGTCA
GATATTCCCATTCTCCACAAAGATTGGATTGCTGCTTGCCTAGTCAAAGTCAGCTCCGTTTCAGCTCCGAATGTAGATTCTGGGGATCCAATCAACATGGAGGTCACTCT
GTACGAAACTATACCAAGACTTATTGAGCAGATCAAAAGCTCCTTCTCCATGGAAGTTCAAGAATCTGCTGTTGTGGAGCTGAACAGAATAATCTCAGAGGGAATAGTCG
ATGCCACCCGAGCTGTTGCCTCTAAAGGTGGTATCTTTCCATTGGTAAAGCTGATTGATGAAGGAAGTGAGAGGGCAACAGAAGCAGCCTTAGCCATACTGTATAATCTG
AGCATGGACACTGAAAATCATCCCGCAATTTTAGCTGCTGGAGCCGTGCCTGTGCTGAGGAGAATTGTTCTGTCGCAACGTGTGCAGTGGCGACAAGCTCTTTATTTGTT
GAGGACGTTGCCTACATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTTTTTGCAGTTGGCGCTGCGGGGACTAGAGCCAGAGACCTATCGGTCTTTCTCTATCGTCTAAGTTACTGCAGTCGCAACCCAGAATCTCTCTCTAGACGTGT
TGTCCGCAGGAGGGTCAGCTGTGACGGCGGTGGCGCTGTTGATTCTACGCAGCAGCAGTCTGCCGCTCCTGAAACCAAAGATGTACAAAATGATTCTTCTAGTGTTGGAC
ACAGTTATGTGGCATTATTTGTCAGGATGCTTGGCTTAGATCATGATCCTCTCGATAGAGAACAAGCAATAATAGCTTTATGGAAATATTCACTTGGAGGAAAGAAGCAC
ATAGATGCCATCATGCAGTTTCCTGGATGCATAAATCTAACAGTAAATCTTCTTAGGTCAGAGTCAATCTCTACGTGTGAAGCAGCTGCTGGCCTTCTACGGTCAATTTC
TCTTGTGAACTTGTACCGAGATTCAGTTGCAGAAAGTGGAGCAATTGAAGAGATAACTGCCTTGCTTAGTCAACCCTCCTTGACTCCCGAGGTGAAGGAGCAAAGCATAT
GCGTTTTGTGGAATTTATCAGTAGATGAAAAGCTCAGAATCAAAATTGCCAATACTGATATTCTGCCATTGCTTAGTAAGAATCTTGATGATGAGGACATGAAGGTGAAG
GAAGCAGCGGGAGGGGTTTTAGCAAATCTGGCTTTGAGCCCATGTAACCATGGGATTATTGTTGAATCAGGCTTAATTTCTAAACTGGCGTATCAGTTAAAAGCTGAGGC
AGAGAGCTCAAAAATTGTGAGAAAGGAAGCAAGAAATGCTTTGCTAGAACTTGCTAAAGATGAATATTATAGAATTCTTGTCATAGAGGAAGGGCTGGTTCCAGTACCAA
TTGTTGGTGCTGCTGCCTATAAATCCTTCAGACCAGGCCTGCATTCATGGCCCAGTTTGCCTGATGGCACAGAAATTGAACAATCTTCCAAAGAACCTTCAAGATTTGGT
GCCTCTGAATTACTCCTCGGATTAAATGTTGATAATAGTGCCAACATAGAAGAAAGCAAGATAAATGCAATTGTTGGACGGACACAGCAACAATTTCTGGCTCGAATCGG
TGCCATAGATTTTGAAGATTTGAGGGACTCTCAATCTGAATCATCTACCAGTAACCATCTTACACTCCTACCTTGGATAGATGGTGTGGCTCGACTTGTTTTGATCCTTG
AACTTGAGGATGATAATGCCAAAGTGAGAGCTGCAGAGTCAATTGCTGATGCATCTATCAACGAACACATGCGTATTTCATTCAAGGAAGCTGGAGCAATCAAACATTTA
GTAAAGCTTTTAGATAATGTGAATAATTCAGTCAAATGGGCTACAATTCAAGCTCTGGAGAGACTGTCAATTAGCAATGTTGTTTGCCAGACAATTGAAAATGAAGGTGC
ACTTGGTCCTCTGCTCAACATTTTAAAGCTCTCAAGTATACCTGAAAATGTGATGGAAAAGACCCTCGATATACTTTCTCGGAGCTTGGACCCTAGTAAAGAAATGAAAT
CAAAGTTTTACAATGGACCGGTAAACGGGTCTCATGGGGGACAACATTCAGAAAGAAATTCTGAAGCTTCTACCAGGAAAGATGTGCTGGATGCCATTGTTGTTTCTCGC
CTTGTTGAGATTTTGAACACCTCATCCCCAAACTTGAAACGCAAAGCTGCTTCTATCCTGGAATTTATTTCTATTATGGACCCAGGCATGGAGATAATCGATTCCATGGA
GATAGAATCTGGTTTATCAGCTGTTTTCCAGCTAGGAGTTTCAATAGATTCTGATGCTGAAGATTGGCAGCCTGAAAGATATGCCCTTGAAGTTGAGGAAGCCAGTCTTG
CAATATCAGCAGCCTCCCGACTACTGACGAAACTTCTAGATTCTGAGAAGTTCTGCCACAAAATAAACTCCACCCATTTCACTAAGTTGCTTCGTCGAACCCTGAAGTCA
GATATTCCCATTCTCCACAAAGATTGGATTGCTGCTTGCCTAGTCAAAGTCAGCTCCGTTTCAGCTCCGAATGTAGATTCTGGGGATCCAATCAACATGGAGGTCACTCT
GTACGAAACTATACCAAGACTTATTGAGCAGATCAAAAGCTCCTTCTCCATGGAAGTTCAAGAATCTGCTGTTGTGGAGCTGAACAGAATAATCTCAGAGGGAATAGTCG
ATGCCACCCGAGCTGTTGCCTCTAAAGGTGGTATCTTTCCATTGGTAAAGCTGATTGATGAAGGAAGTGAGAGGGCAACAGAAGCAGCCTTAGCCATACTGTATAATCTG
AGCATGGACACTGAAAATCATCCCGCAATTTTAGCTGCTGGAGCCGTGCCTGTGCTGAGGAGAATTGTTCTGTCGCAACGTGTGCAGTGGCGACAAGCTCTTTATTTGTT
GAGGACGTTGCCTACATGA
Protein sequenceShow/hide protein sequence
MKVFAVGAAGTRARDLSVFLYRLSYCSRNPESLSRRVVRRRVSCDGGGAVDSTQQQSAAPETKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAIIALWKYSLGGKKH
IDAIMQFPGCINLTVNLLRSESISTCEAAAGLLRSISLVNLYRDSVAESGAIEEITALLSQPSLTPEVKEQSICVLWNLSVDEKLRIKIANTDILPLLSKNLDDEDMKVK
EAAGGVLANLALSPCNHGIIVESGLISKLAYQLKAEAESSKIVRKEARNALLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTEIEQSSKEPSRFG
ASELLLGLNVDNSANIEESKINAIVGRTQQQFLARIGAIDFEDLRDSQSESSTSNHLTLLPWIDGVARLVLILELEDDNAKVRAAESIADASINEHMRISFKEAGAIKHL
VKLLDNVNNSVKWATIQALERLSISNVVCQTIENEGALGPLLNILKLSSIPENVMEKTLDILSRSLDPSKEMKSKFYNGPVNGSHGGQHSERNSEASTRKDVLDAIVVSR
LVEILNTSSPNLKRKAASILEFISIMDPGMEIIDSMEIESGLSAVFQLGVSIDSDAEDWQPERYALEVEEASLAISAASRLLTKLLDSEKFCHKINSTHFTKLLRRTLKS
DIPILHKDWIAACLVKVSSVSAPNVDSGDPINMEVTLYETIPRLIEQIKSSFSMEVQESAVVELNRIISEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNL
SMDTENHPAILAAGAVPVLRRIVLSQRVQWRQALYLLRTLPT