; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021503 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021503
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionU-box domain-containing protein 4
Genome locationscaffold208:203795..209100
RNA-Seq ExpressionMS021503
SyntenyMS021503
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579255.1 U-box domain-containing protein 12, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.14Show/hide
Query:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHI
        KPR+ S+ L+ L +     E    R V R VSSDGGGA+DST QQSAA DI+DV NDSSSVGHSYVALFVRMLGLDHDPLDREQA++ALWKYSLGGKKHI
Subjt:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHI

Query:  DAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKN
        DAIM+FPGCINLTVNLL+SES +TCEAAAGLLRSIS+VNL+RDSVAESGAIEEIT LLSRPSL PEVKEQSICVLWNLSVDE+LR KIA+TDIL LL KN
Subjt:  DAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKN

Query:  LDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWP
        LDDEDMKVKEAAGGV+ANL LSPCNHGVIVESGLIPKLAYQLKAEADSSK VRKEARN LLEL KDEYYRILVIEEGLVPVP++GAAAYKSF+PGLHSWP
Subjt:  LDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWP

Query:  SLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTAV
        SLPDGTEIE+SS  PSRFGASELLLGLNVDN ANIEEGKINAI+GR+QQQFLARIGAIE EDLKD+Q ESST NHLTLLPW DGVARLVL+LELEDD A 
Subjt:  SLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTAV

Query:  MRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSKE
        +R AE IADASINEHMR+SFKEAGAIKHLVK+LD+MN++VKWA+++ALERLSISNVVCQTIENEGAL PLLSIL+LS+IPENVMEKTLDIL RILDPSKE
Subjt:  MRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSKE

Query:  MKSKFYNGPVNGSHGGQHSE--SEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQP
        MKSKFYNGP+NGS GGQHSE  SEASTRKDVLDAV VS LVEIL TSSPNLKRKAASILEF+SIMDPSM+IID M IESGL AVFQLGVSID+DAEDW P
Subjt:  MKSKFYNGPVNGSHGGQHSE--SEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQP

Query:  ERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKGS
        ER+ALEVEEAGLAISAASRLLTKL+DSEKF NKIN+A FT+ LR+ILK DIP+ HKDWIAACL+KVSS++A ++DSG+PI+MEV LYETIPRLIQQ+K S
Subjt:  ERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKGS

Query:  FSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTLP
         S E QES+VVELNRI+SEG+VDATRAVASKGGIFPLVKLIDEGSERA EAALA+LYNLSMD+ENHPAILAAGAVPALRRIVLSQR +W++AL+LLRTLP
Subjt:  FSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTLP

Query:  T
        T
Subjt:  T

XP_008467122.1 PREDICTED: uncharacterized protein LOC103504553 [Cucumis melo]0.0e+0086.16Show/hide
Query:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSD-GGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKH
        KP + S+FLY L +     +SLS R +   VSSD GGGAVDS+  QSA P IKDVQNDSSS+G SYVALFVRML LD+DPLDREQA+IALWKYSLGGKKH
Subjt:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSD-GGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKH

Query:  IDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSK
        IDAIM+FPGCINL VNLL+SES++T EAAAGLLRSISLVNLYR+SVAESGAIEEITGLL +PSL PEVKEQSICVLWNLSVDEKLR+KIA+TDIL LLSK
Subjt:  IDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSK

Query:  NLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSW
        NLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKI+RKEARN LLEL KDEYYRILVIEEGLVPVPI+GAAAYKSFRPGLHSW
Subjt:  NLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSW

Query:  PSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTA
        PSLPDGTEIEQSS  PSRFGASELLLGLNVDN ANIEEGKINAIVGRTQQQFLARIGAIE E++KDSQ ESS+ NHLTLLPWIDGVARLVLILELEDD A
Subjt:  PSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTA

Query:  VMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSK
        V+RAAESI DASINEHMRISFKEAGAIKHLV  LD MND+VKWA ++ALERLSISNVVCQ IENEGALGPLLSIL+LS+IPENVMEKTL+IL RILDPSK
Subjt:  VMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSK

Query:  EMKSKFYNGPVNGSHGGQHSES--EASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQ
        EMKSKFY+GPVNGS G QHSE   EAS RKD LDA  VSRLVEILNTSSPNLKRKAASILEFVSIMDPSME+ID + IE GL AVFQLGVSIDSDAE WQ
Subjt:  EMKSKFYNGPVNGSHGGQHSES--EASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQ

Query:  PERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKG
        PERYALEVEEAGLAISAASRLLTKLLDSEKF NKINS  FTKLLR+ILKSDIPI+HKDWIAACL+KVSSIS LN DSG+PINMEVTLYETIPRLI+QIK 
Subjt:  PERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKG

Query:  SFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTL
        SFS+E QES+VVELNRI+SEG+VDATRAVA KGGIFPLVKLIDEGSERA EAALAILYNLSMDSENHPAI+AAGAVPALRRIVLSQR +WR+AL+LLRTL
Subjt:  SFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTL

Query:  PT
        PT
Subjt:  PT

XP_022146414.1 uncharacterized protein LOC111015638 [Momordica charantia]0.0e+0099.88Show/hide
Query:  AAGTKPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGG
        AAGTKPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGG
Subjt:  AAGTKPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGG

Query:  KKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLL
        KKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLL
Subjt:  KKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLL

Query:  LSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGL
        LSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGL
Subjt:  LSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGL

Query:  HSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELED
        HSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELED
Subjt:  HSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELED

Query:  DTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILD
        DTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILD
Subjt:  DTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILD

Query:  PSKEMKSKFYNGPVNGSHGGQHSESEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDW
        PSKEMKSKFYNGPVNGSHGGQHSESEASTRKDVLDAVAVS LVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDW
Subjt:  PSKEMKSKFYNGPVNGSHGGQHSESEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDW

Query:  QPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIK
        QPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIK
Subjt:  QPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIK

Query:  GSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRT
        GSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRT
Subjt:  GSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRT

Query:  LPT
        LPT
Subjt:  LPT

XP_022973564.1 uncharacterized protein LOC111472102 [Cucurbita maxima]0.0e+0083.87Show/hide
Query:  AAGT-KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLG
        A GT KPR+ S+ L+ L +     E    R VR  VSSDGGGA+DST QQSA  DI+DV NDSSSVGHSYVALF+RMLGLDHDPLDREQA++ALWKYSLG
Subjt:  AAGT-KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLG

Query:  GKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILL
        GKKHIDAIM+FPGC+NLTVNLL+SES +TCEAAAGLLRSIS+VNL+RDSVAESGAIEEIT LLSRPSL PEVKEQSICVLWNLSVDEKLR KIA+TDIL 
Subjt:  GKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILL

Query:  LLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPG
        LL KNLDDEDMKVKEAAGGV+ANL LSPCNHGVIVESGLIPKLAYQLKAEADSSK VRKEARN LLEL KDEYYRILVIEEGLVPVP++GAAAYKSF+PG
Subjt:  LLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPG

Query:  LHSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELE
        LHSWPSLPDGTEIE+SS  PSRFGASELLLGLNVDN ANIEEGKINAI+GR+QQ FLARIGAIE EDLKD+Q ESST NHLTLLPW DGVARLVL+LELE
Subjt:  LHSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELE

Query:  DDTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRIL
        DD A +R AE IADASINEHMR+SFKEAGAIKHLVK+LD+MN++VKWA+I+ALERLSISNVVCQTIENEGAL PLLSIL+LS+IPENVMEKTLDIL RIL
Subjt:  DDTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRIL

Query:  DPSKEMKSKFYNGPVNGSHGGQHSE--SEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDA
        DPSKEMKSKFY+GP+NGS GGQHSE  SEASTRKDVLDAV VS LVEIL TSSPNLKRKAASILEF+SIMDPSM+IID M IESGL AVFQLGVSID+DA
Subjt:  DPSKEMKSKFYNGPVNGSHGGQHSE--SEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDA

Query:  EDWQPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQ
        EDW PER+ALEVEEAGLAISAASRLLTKL+DSEKF NKIN+  FT+ LRQILK DIP+ HKDWIAACL+KVSS++AL++DSG+PI+MEV LYETIPRLIQ
Subjt:  EDWQPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQ

Query:  QIKGSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHL
        ++K S SME QES+VVELNRI+SEG+VDATRAVASKGGIFPLVKLIDEGSERA EAALAILYNLSMD+ENHPAILAAGAVPALRRIVLSQR +W++AL+L
Subjt:  QIKGSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHL

Query:  LRTLPT
        LRTLPT
Subjt:  LRTLPT

XP_038874402.1 uncharacterized protein LOC120067080 [Benincasa hispida]0.0e+0089.14Show/hide
Query:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHI
        KP +LS FLYHL +     +SLS R V R VSSDGGGAVDST QQSAAP I DVQNDSSSVGHSYVALFVRMLGLDHDPLDREQA+IALWKYSLGGKKHI
Subjt:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHI

Query:  DAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKN
        DAIM+FPGCINLTVNLL+SES++TCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLS+PSL PEVKEQSICVLWNLSVDEKLR KIA+ DIL LLSKN
Subjt:  DAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKN

Query:  LDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWP
        LDDEDMKVKEAAGGVLANLALSPCN GVIVE+GLIPKLAYQLKAEADSSKI+RKEARN LLEL+KDEYYRILVIEEGLVPVPI+GAAAYKSFRPGLHSWP
Subjt:  LDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWP

Query:  SLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTAV
        SLPDGTEIEQSS  PSRFGASELLLGLNVDN A IEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQ ESST NHLTLLPWIDGVARLVLILELEDD A+
Subjt:  SLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTAV

Query:  MRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSKE
        +RAAESIADASINEHMRISFKEAGAIKHLVK LD MN++VKWA ++ALERLSISNVVCQTIEN+GALGPLLSIL+LS+I ENVMEKTLDIL RILDPSKE
Subjt:  MRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSKE

Query:  MKSKFYNGPVNGSHGGQHSES--EASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQP
        MKSKFY+GPVNGS GGQHSE   EASTRKDVLD   VSRLVEI NTSSPNLKRKAASILEFVSIMDPSMEIIDSM +ESGL AVFQLGVSIDSDAEDWQP
Subjt:  MKSKFYNGPVNGSHGGQHSES--EASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQP

Query:  ERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKGS
        ERYALEVEEAGLAISAASRLLTKLLDSEKF NKINS HFTKLLR+ILKSDIPI+HKDWIAACL KVSS+SALNMDSG+PINMEVTLYETIPRLIQQIK S
Subjt:  ERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKGS

Query:  FSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTLP
        FSME QES+VVELNRI+S GIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMD+ENHPAI+AAGAVPALR+IVLSQR +WR+AL+LLRTLP
Subjt:  FSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTLP

Query:  T
        T
Subjt:  T

TrEMBL top hitse value%identityAlignment
A0A0A0KPE7 Uncharacterized protein0.0e+0084.27Show/hide
Query:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHI
        KP + S+FL  L +     +SLS R V R VSSDGGG  DS+  QSA PDIKDVQNDSSSVGHSYVALFVRMLGL +DPLDREQA+IALWKYSLGGKKHI
Subjt:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHI

Query:  DAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKN
        DAIM+FPGCINL VNLL+SESI TCEAAAGLLRSIS VNLYR+SVAESGAIEEITGLL +PSL PEVKEQSICVLWNLSVDEKLR+KIA+TDIL LLSKN
Subjt:  DAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKN

Query:  LDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWP
        LDDE+MKVKEAAGGVLANLALSPCNHGVIVESGLI KLAYQLKAEADSSKI+RKEARN LLEL+KD YYRILVIEEGLVPVPI+GAAAYKSFRPGLHSWP
Subjt:  LDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWP

Query:  SLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTAV
         LPDG EIEQS+  PSR+GAS+LLLGLNVD  ANIEE KINAIVGRTQQQFLARIGAIE EDLKDSQ ESS+ NHLTLLPWIDGVARLVLILELEDD A+
Subjt:  SLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTAV

Query:  MRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSKE
         RAA SIADASINEHMRISFKEAGAIK+LVK LD  ND+VKWA ++ALERLSISNVVCQ IENEGALGPLLSIL+ S IPENVMEKTL+IL RILDPSKE
Subjt:  MRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSKE

Query:  MKSKFYNGPVNGSHGGQHSES--EASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQP
        MKSKFY+GPVNGS GGQHSE   EAS RKDVLDA  VSR VEILNTSSPNLK+KAASILEFVSIMDPSME+ID + IE GL AVFQLGVSIDSD E WQP
Subjt:  MKSKFYNGPVNGSHGGQHSES--EASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQP

Query:  ERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKGS
        ERYALEVEEAGLAISAASRLLTKLLDSEKF NKINS HFTKLLR++LKSDIPI+HKDWIAACL+K+SS+  LN D G+PINMEVTLYETIPRLI+Q++ S
Subjt:  ERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKGS

Query:  FSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTLP
        FS+E QES+VVELNRI+SEGIV+ATRAVASKGGIFPLVKLIDEGSERA EAALAILYNLSMDSENHPAI+AAGAVPALRRI LSQR +W++AL+LLRTLP
Subjt:  FSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTLP

Query:  T
        T
Subjt:  T

A0A1S3CU26 uncharacterized protein LOC1035045530.0e+0086.16Show/hide
Query:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSD-GGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKH
        KP + S+FLY L +     +SLS R +   VSSD GGGAVDS+  QSA P IKDVQNDSSS+G SYVALFVRML LD+DPLDREQA+IALWKYSLGGKKH
Subjt:  KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSD-GGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKH

Query:  IDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSK
        IDAIM+FPGCINL VNLL+SES++T EAAAGLLRSISLVNLYR+SVAESGAIEEITGLL +PSL PEVKEQSICVLWNLSVDEKLR+KIA+TDIL LLSK
Subjt:  IDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSK

Query:  NLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSW
        NLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKI+RKEARN LLEL KDEYYRILVIEEGLVPVPI+GAAAYKSFRPGLHSW
Subjt:  NLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSW

Query:  PSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTA
        PSLPDGTEIEQSS  PSRFGASELLLGLNVDN ANIEEGKINAIVGRTQQQFLARIGAIE E++KDSQ ESS+ NHLTLLPWIDGVARLVLILELEDD A
Subjt:  PSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTA

Query:  VMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSK
        V+RAAESI DASINEHMRISFKEAGAIKHLV  LD MND+VKWA ++ALERLSISNVVCQ IENEGALGPLLSIL+LS+IPENVMEKTL+IL RILDPSK
Subjt:  VMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSK

Query:  EMKSKFYNGPVNGSHGGQHSES--EASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQ
        EMKSKFY+GPVNGS G QHSE   EAS RKD LDA  VSRLVEILNTSSPNLKRKAASILEFVSIMDPSME+ID + IE GL AVFQLGVSIDSDAE WQ
Subjt:  EMKSKFYNGPVNGSHGGQHSES--EASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQ

Query:  PERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKG
        PERYALEVEEAGLAISAASRLLTKLLDSEKF NKINS  FTKLLR+ILKSDIPI+HKDWIAACL+KVSSIS LN DSG+PINMEVTLYETIPRLI+QIK 
Subjt:  PERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKG

Query:  SFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTL
        SFS+E QES+VVELNRI+SEG+VDATRAVA KGGIFPLVKLIDEGSERA EAALAILYNLSMDSENHPAI+AAGAVPALRRIVLSQR +WR+AL+LLRTL
Subjt:  SFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTL

Query:  PT
        PT
Subjt:  PT

A0A6J1CY29 uncharacterized protein LOC1110156380.0e+0099.88Show/hide
Query:  AAGTKPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGG
        AAGTKPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGG
Subjt:  AAGTKPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGG

Query:  KKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLL
        KKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLL
Subjt:  KKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLL

Query:  LSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGL
        LSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGL
Subjt:  LSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGL

Query:  HSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELED
        HSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELED
Subjt:  HSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELED

Query:  DTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILD
        DTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILD
Subjt:  DTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILD

Query:  PSKEMKSKFYNGPVNGSHGGQHSESEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDW
        PSKEMKSKFYNGPVNGSHGGQHSESEASTRKDVLDAVAVS LVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDW
Subjt:  PSKEMKSKFYNGPVNGSHGGQHSESEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDW

Query:  QPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIK
        QPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIK
Subjt:  QPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIK

Query:  GSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRT
        GSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRT
Subjt:  GSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRT

Query:  LPT
        LPT
Subjt:  LPT

A0A6J1FN67 uncharacterized protein LOC1114454460.0e+0084Show/hide
Query:  AAGT-KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLG
        A GT KPR+ S+ L+ L +     E    R V R VSSDGGGA+DST QQSAA DI+DV NDSSSVGHSYVALFVRMLGLDHDPLDREQA++ALWKYSLG
Subjt:  AAGT-KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLG

Query:  GKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILL
        GKKHIDAIM+FPGCINLTVNLL+SES +TCEAAAGLLRSIS+VNL+RDSVAESGAIEEIT LLSRPSL PEVKEQSICVLWNLSVDEKLR KIA+TDIL 
Subjt:  GKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILL

Query:  LLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPG
        LL KNLDDEDMKVKEAAGGV+ANL LSPCNHGVIVESGLIPKLAYQLKAEADSSK +RKEARN LLEL KDEYYRILVIEEGLVPVP++GAAAYKSF+PG
Subjt:  LLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPG

Query:  LHSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELE
        LHSWPSLPDGTEIE+SS  PSRFGASELLLGLNVDN  NI+EGKINAIVGR+QQQFLARIGAIE EDLKD+Q ESST NHLTLLPW DGVARLVL+LELE
Subjt:  LHSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELE

Query:  DDTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRIL
        DD A +R AE IADASINEHMR+SFKEAGAIKHLVK+LD+MN++VKWA+I+ALERLSISNVVCQTIENEGAL PLLSIL+LS+IPENVMEKTLDIL RIL
Subjt:  DDTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRIL

Query:  DPSKEMKSKFYNGPVNGSHGGQHSE--SEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDA
        DPSKEMKSKFY+GP+NGS GGQHSE  SEAST KDVLDAV VS LVEIL TSSPNLKRKAASILEF+SIMDPSM+IID M IESGL AVFQLGVSID+DA
Subjt:  DPSKEMKSKFYNGPVNGSHGGQHSE--SEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDA

Query:  EDWQPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQ
        EDW PER+ALEVEEAGLAISAASRLLTKL+DSEKF NKIN+A FT+ LR+ILK DIPI HKDWIAACL+KVSS++A ++DSG+PI+MEV LYETIPRLIQ
Subjt:  EDWQPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQ

Query:  QIKGSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHL
        Q+K S SME QES+VVELNRI+SEG+VDATRAVASKGGIFPLVKLIDEGSERA EAALA+LYNLSMD+ENHPAILAAGAVPALRRIVLSQR +W++AL+L
Subjt:  QIKGSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHL

Query:  LRTLPT
        LRTLPT
Subjt:  LRTLPT

A0A6J1IEY0 uncharacterized protein LOC1114721020.0e+0083.87Show/hide
Query:  AAGT-KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLG
        A GT KPR+ S+ L+ L +     E    R VR  VSSDGGGA+DST QQSA  DI+DV NDSSSVGHSYVALF+RMLGLDHDPLDREQA++ALWKYSLG
Subjt:  AAGT-KPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLG

Query:  GKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILL
        GKKHIDAIM+FPGC+NLTVNLL+SES +TCEAAAGLLRSIS+VNL+RDSVAESGAIEEIT LLSRPSL PEVKEQSICVLWNLSVDEKLR KIA+TDIL 
Subjt:  GKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILL

Query:  LLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPG
        LL KNLDDEDMKVKEAAGGV+ANL LSPCNHGVIVESGLIPKLAYQLKAEADSSK VRKEARN LLEL KDEYYRILVIEEGLVPVP++GAAAYKSF+PG
Subjt:  LLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPG

Query:  LHSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELE
        LHSWPSLPDGTEIE+SS  PSRFGASELLLGLNVDN ANIEEGKINAI+GR+QQ FLARIGAIE EDLKD+Q ESST NHLTLLPW DGVARLVL+LELE
Subjt:  LHSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELE

Query:  DDTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRIL
        DD A +R AE IADASINEHMR+SFKEAGAIKHLVK+LD+MN++VKWA+I+ALERLSISNVVCQTIENEGAL PLLSIL+LS+IPENVMEKTLDIL RIL
Subjt:  DDTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNMNDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRIL

Query:  DPSKEMKSKFYNGPVNGSHGGQHSE--SEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDA
        DPSKEMKSKFY+GP+NGS GGQHSE  SEASTRKDVLDAV VS LVEIL TSSPNLKRKAASILEF+SIMDPSM+IID M IESGL AVFQLGVSID+DA
Subjt:  DPSKEMKSKFYNGPVNGSHGGQHSE--SEASTRKDVLDAVAVSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDA

Query:  EDWQPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQ
        EDW PER+ALEVEEAGLAISAASRLLTKL+DSEKF NKIN+  FT+ LRQILK DIP+ HKDWIAACL+KVSS++AL++DSG+PI+MEV LYETIPRLIQ
Subjt:  EDWQPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQ

Query:  QIKGSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHL
        ++K S SME QES+VVELNRI+SEG+VDATRAVASKGGIFPLVKLIDEGSERA EAALAILYNLSMD+ENHPAILAAGAVPALRRIVLSQR +W++AL+L
Subjt:  QIKGSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHL

Query:  LRTLPT
        LRTLPT
Subjt:  LRTLPT

SwissProt top hitse value%identityAlignment
F4I718 Protein CELLULOSE SYNTHASE INTERACTIVE 31.3e-0540.26Show/hide
Query:  SMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSEN-HPAILAAGAVPA
        S E  +   VE+ +IL+  + D+  AV + GGI PLV+L++ GS++A E A  IL+NL   SE     +  AG +PA
Subjt:  SMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSEN-HPAILAAGAVPA

Q5VRH9 U-box domain-containing protein 125.7e-0628.9Show/hide
Query:  GCINLTVNLLQSESIATCEAAAGLLRSISLVNL-YRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDM
        G ++L +N L+S +     AAAG +R ++  N+  R  +AE+GAI  +  LLS  S  P  +E ++  L NLS+ E  +  I D+  +  + + L    M
Subjt:  GCINLTVNLLQSESIATCEAAAGLLRSISLVNL-YRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDM

Query:  KVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLV
        + +E A   L +L++   N   I  +G IP L   L    D S   +K+A   +  L   +  ++  ++ G+V
Subjt:  KVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLV

Q5XEZ8 U-box domain-containing protein 24.1e-0429.61Show/hide
Query:  GCINLTVNLLQSESIATCEA-AAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDM
        G I   +++L++  +   +A +A  L S+S++  Y+  + E+GAIE +  LL   SL+   K+ +   L+NLS+  + + K+ +   +  L + L D   
Subjt:  GCINLTVNLLQSESIATCEA-AAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDM

Query:  KVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLEL-AKDEYYRILVIEEGLVPVPIV
         + E A  VLANLA        I E G IP L   ++  +   K   + A   LL+L      +   VI EG++P P+V
Subjt:  KVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLEL-AKDEYYRILVIEEGLVPVPIV

Q8VZ40 U-box domain-containing protein 141.3e-0524.88Show/hide
Query:  EQAVIALWKYSLGGKKHID--AIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSV
        EQ   A  +  L  K+++D    +   G I L V LL S    T E +   L ++S+    + ++ ++GAI +I  +L   S+  E +E +   L++LSV
Subjt:  EQAVIALWKYSLGGKKHID--AIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSV

Query:  DEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVP
         ++ ++ I     +  L   L++   + K+ A   + NL +   N    V+ G++  L   LK   D+   +  EA  +L  L+ ++  +  + E   +P
Subjt:  DEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVP

Query:  V
        V
Subjt:  V

Q9ZV31 U-box domain-containing protein 124.1e-0422.64Show/hide
Query:  QQSAAPDIKDV--QNDSSSV---GHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLV
        ++SAA +I+ +  QN+ + V       + L V +L + +D   +E AV ++   S+  +++   I+   G +   V++LQ  S+   E AA  L S+S++
Subjt:  QQSAAPDIKDV--QNDSSSV---GHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLV

Query:  NLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKL
        +  + ++  +GAI  +  LLS  S     K+ +   L+NL + +  + K     ++ +L + L + +  + + +  +LA L+  P     +  +  +P L
Subjt:  NLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKL

Query:  AYQLKAEADSSK
           +++ +  +K
Subjt:  AYQLKAEADSSK

Arabidopsis top hitse value%identityAlignment
AT1G23180.1 ARM repeat superfamily protein1.1e-24961.65Show/hide
Query:  THQQSAAPDIKDVQNDSSS-VGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNL
        T   +   D ++V+++SSS VG SYV LFV MLGLD+DPLDREQA+  LWKYSLGGKK IDAIM+F GC+NL VNLL+SES + CEAAAGL+RSI+ VNL
Subjt:  THQQSAAPDIKDVQNDSSS-VGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNL

Query:  YRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAY
        YR+SVAESGA+EEIT LLSRPSLA  VKEQ IC LWNL+VDE++R K+AD DIL LL   L+D+D+ VKEAAGGVLANLALS   H ++VE G+IPKLA 
Subjt:  YRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAY

Query:  QLKA---EADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVD-NKANIE
         LKA   E   SK++RKEARNVLLELAKDEYYRILVIEEG+VP+PI+GA AYKSFRP L+SWPSLPDG  IEQ++  PSRFGASELLLGLNVD N  +++
Subjt:  QLKA---EADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTEIEQSSNGPSRFGASELLLGLNVD-NKANIE

Query:  EGKINAIVGRTQQQFLARIGAIEFEDLKDSQLES-STGNHLTLLPWIDGVARLVLILELEDDTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDN
        E K+ AIVGRT QQFLARIGAIEFE  K+ + E       LTLLP +DGVARLVLIL L D+ A  RAAESIADASINE MR+SF EAGA+K LV++L N
Subjt:  EGKINAIVGRTQQQFLARIGAIEFEDLKDSQLES-STGNHLTLLPWIDGVARLVLILELEDDTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDN

Query:  MN-DAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSKEMKSKFYNGPVNGSHGGQHSESEASTRKDVLDAVA
         N + VK   IRAL+ LS+S  VCQ IE EGA+  L+++L+   I  NV E  LDI+  ILDPSKEM+SKFY GPVNG        S+A +RK+VLDA  
Subjt:  MN-DAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSKEMKSKFYNGPVNGSHGGQHSESEASTRKDVLDAVA

Query:  VSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINS
         SRLV+I  T+SPNL R A S++EF  I +P+M+ I S  I + L    +  V  + + E  + E++ L++EEAGL ISAASRLLTKLLDSE F   I++
Subjt:  VSRLVEILNTSSPNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINS

Query:  AHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKGSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFP
        A F +L+R+IL+S +P+H+KDW+AACLVK++++S+ +    NPIN+EVTLY+TIP L++Q+  S S E +E++V+ELN+I+SEG+ ++ + +AS+GGI P
Subjt:  AHFTKLLRQILKSDIPIHHKDWIAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKGSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFP

Query:  LVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTLP
        LVKL++E +ER  EA+L++LYNL+MDSENH AI+ AGAVP LRRIV+SQR +W +AL LLR LP
Subjt:  LVKLIDEGSERATEAALAILYNLSMDSENHPAILAAGAVPALRRIVLSQRAEWRRALHLLRTLP

AT1G77460.1 Armadillo/beta-catenin-like repeat ; C2 calcium/lipid-binding domain (CaLB) protein9.0e-0740.26Show/hide
Query:  SMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSEN-HPAILAAGAVPA
        S E  +   VE+ +IL+  + D+  AV + GGI PLV+L++ GS++A E A  IL+NL   SE     +  AG +PA
Subjt:  SMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSEN-HPAILAAGAVPA

AT1G77460.2 Armadillo/beta-catenin-like repeat ; C2 calcium/lipid-binding domain (CaLB) protein9.0e-0740.26Show/hide
Query:  SMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSEN-HPAILAAGAVPA
        S E  +   VE+ +IL+  + D+  AV + GGI PLV+L++ GS++A E A  IL+NL   SE     +  AG +PA
Subjt:  SMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSEN-HPAILAAGAVPA

AT2G28830.1 PLANT U-BOX 122.9e-0522.64Show/hide
Query:  QQSAAPDIKDV--QNDSSSV---GHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLV
        ++SAA +I+ +  QN+ + V       + L V +L + +D   +E AV ++   S+  +++   I+   G +   V++LQ  S+   E AA  L S+S++
Subjt:  QQSAAPDIKDV--QNDSSSV---GHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHIDAIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLV

Query:  NLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKL
        +  + ++  +GAI  +  LLS  S     K+ +   L+NL + +  + K     ++ +L + L + +  + + +  +LA L+  P     +  +  +P L
Subjt:  NLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKL

Query:  AYQLKAEADSSK
           +++ +  +K
Subjt:  AYQLKAEADSSK

AT3G54850.1 plant U-box 149.0e-0724.88Show/hide
Query:  EQAVIALWKYSLGGKKHID--AIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSV
        EQ   A  +  L  K+++D    +   G I L V LL S    T E +   L ++S+    + ++ ++GAI +I  +L   S+  E +E +   L++LSV
Subjt:  EQAVIALWKYSLGGKKHID--AIMKFPGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSV

Query:  DEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVP
         ++ ++ I     +  L   L++   + K+ A   + NL +   N    V+ G++  L   LK   D+   +  EA  +L  L+ ++  +  + E   +P
Subjt:  DEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVLANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVP

Query:  V
        V
Subjt:  V


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCTGCGGGAACTAAACCCAGAAACCTATCGGTCTTTCTCTACCATCTACCTTATCGCGGTCGCTACCCAGAATCTCTCTCTAATCGTTTTGTCCGCCGGAGTGTCAGCAG
CGATGGCGGCGGCGCTGTTGATTCTACCCACCAGCAGTCTGCAGCTCCCGACATCAAAGATGTACAGAATGATTCTTCTAGTGTTGGACACAGTTATGTGGCATTATTTG
TCCGGATGCTCGGCTTAGATCACGATCCTCTTGATAGAGAACAAGCAGTAATAGCTTTATGGAAATATTCCCTTGGAGGAAAGAAGCACATAGATGCCATCATGAAATTT
CCTGGATGCATAAATCTAACAGTAAATCTTCTTCAGTCAGAATCAATCGCTACATGTGAAGCAGCTGCTGGCCTTCTACGGTCAATTTCTCTTGTGAACTTATACAGAGA
TTCAGTTGCAGAAAGTGGAGCAATTGAAGAGATAACTGGCTTGCTTAGTCGACCGTCGTTGGCTCCCGAGGTGAAGGAGCAAAGCATATGCGTTTTGTGGAATTTATCAG
TAGATGAAAAGCTCAGATTGAAGATTGCCGACACTGATATTCTTCTATTGCTTAGTAAGAATCTTGATGATGAGGACATGAAGGTGAAGGAAGCAGCTGGAGGGGTTTTG
GCAAATCTGGCATTGAGTCCATGTAACCATGGAGTTATTGTTGAATCAGGCTTAATTCCGAAATTGGCTTATCAGTTAAAAGCTGAGGCAGACAGCTCGAAAATTGTGAG
AAAGGAAGCAAGAAATGTGTTGCTAGAACTTGCTAAGGATGAGTATTATAGAATTCTTGTCATAGAGGAAGGACTGGTTCCAGTACCAATTGTTGGTGCTGCTGCCTACA
AGTCCTTCAGACCAGGCCTGCATTCATGGCCTAGTTTGCCTGATGGCACAGAAATTGAACAATCTTCTAACGGTCCTTCAAGATTTGGTGCCTCCGAATTACTCCTTGGA
TTAAATGTCGATAATAAGGCAAACATAGAGGAAGGAAAGATAAATGCAATTGTTGGACGGACGCAGCAGCAATTTCTGGCTCGAATTGGTGCTATAGAATTTGAAGATTT
GAAGGACTCTCAACTTGAGTCATCTACCGGTAACCATCTTACACTCTTACCATGGATAGATGGTGTGGCTCGACTAGTTTTGATCCTTGAACTTGAAGATGATACCGCCG
TAATGAGAGCTGCAGAGTCAATTGCTGACGCATCCATCAACGAACACATGCGTATTTCGTTCAAGGAAGCTGGAGCAATCAAGCATTTAGTTAAGGTTTTAGATAATATG
AATGATGCAGTCAAATGGGCTACAATTCGAGCTCTGGAGAGACTGTCAATCAGCAATGTTGTTTGCCAGACAATTGAAAATGAAGGTGCACTTGGTCCTTTGCTTAGTAT
TTTAAGGCTCTCAAATATCCCTGAAAATGTGATGGAGAAGACGCTCGATATACTTTGTCGAATCCTGGACCCCAGTAAAGAAATGAAATCAAAGTTTTACAATGGACCGG
TAAACGGGTCTCATGGAGGACAACATTCAGAATCTGAAGCTTCTACGAGGAAAGATGTGCTGGATGCTGTTGCAGTTTCTCGCCTCGTTGAGATTTTGAATACCTCATCC
CCAAACTTGAAACGCAAAGCCGCTTCTATCCTGGAATTTGTTTCTATTATGGACCCAAGCATGGAAATAATCGATTCCATGGCGATAGAATCTGGTTTGTTGGCTGTTTT
CCAGTTAGGAGTCTCAATAGATTCTGATGCTGAAGATTGGCAGCCTGAAAGGTACGCCCTTGAAGTTGAGGAAGCTGGTCTTGCCATATCAGCAGCCTCCCGTCTACTAA
CGAAACTTCTCGATTCCGAGAAGTTCTACAACAAAATAAACTCTGCCCATTTCACTAAGTTACTTCGTCAAATCCTAAAGTCAGACATTCCCATTCACCACAAAGACTGG
ATTGCTGCCTGCCTAGTCAAAGTTAGCTCCATTTCAGCTCTGAACATGGATTCAGGGAATCCGATCAACATGGAGGTAACTCTTTACGAAACAATACCAAGACTCATTCA
GCAGATCAAAGGCTCCTTCTCCATGGAAGATCAAGAATCCTCTGTTGTGGAACTGAACAGAATACTCTCAGAAGGAATTGTCGATGCTACCCGAGCTGTTGCTTCTAAAG
GTGGTATCTTTCCATTGGTAAAGCTGATCGATGAAGGAAGCGAGAGGGCCACAGAAGCAGCCCTAGCCATACTGTATAATCTGAGCATGGACAGTGAAAATCATCCAGCA
ATTTTAGCTGCTGGAGCCGTGCCAGCATTGAGGAGAATCGTACTATCACAACGAGCAGAATGGCGACGAGCTCTTCATTTGTTGAGGACATTGCCTACA
mRNA sequenceShow/hide mRNA sequence
GCTGCGGGAACTAAACCCAGAAACCTATCGGTCTTTCTCTACCATCTACCTTATCGCGGTCGCTACCCAGAATCTCTCTCTAATCGTTTTGTCCGCCGGAGTGTCAGCAG
CGATGGCGGCGGCGCTGTTGATTCTACCCACCAGCAGTCTGCAGCTCCCGACATCAAAGATGTACAGAATGATTCTTCTAGTGTTGGACACAGTTATGTGGCATTATTTG
TCCGGATGCTCGGCTTAGATCACGATCCTCTTGATAGAGAACAAGCAGTAATAGCTTTATGGAAATATTCCCTTGGAGGAAAGAAGCACATAGATGCCATCATGAAATTT
CCTGGATGCATAAATCTAACAGTAAATCTTCTTCAGTCAGAATCAATCGCTACATGTGAAGCAGCTGCTGGCCTTCTACGGTCAATTTCTCTTGTGAACTTATACAGAGA
TTCAGTTGCAGAAAGTGGAGCAATTGAAGAGATAACTGGCTTGCTTAGTCGACCGTCGTTGGCTCCCGAGGTGAAGGAGCAAAGCATATGCGTTTTGTGGAATTTATCAG
TAGATGAAAAGCTCAGATTGAAGATTGCCGACACTGATATTCTTCTATTGCTTAGTAAGAATCTTGATGATGAGGACATGAAGGTGAAGGAAGCAGCTGGAGGGGTTTTG
GCAAATCTGGCATTGAGTCCATGTAACCATGGAGTTATTGTTGAATCAGGCTTAATTCCGAAATTGGCTTATCAGTTAAAAGCTGAGGCAGACAGCTCGAAAATTGTGAG
AAAGGAAGCAAGAAATGTGTTGCTAGAACTTGCTAAGGATGAGTATTATAGAATTCTTGTCATAGAGGAAGGACTGGTTCCAGTACCAATTGTTGGTGCTGCTGCCTACA
AGTCCTTCAGACCAGGCCTGCATTCATGGCCTAGTTTGCCTGATGGCACAGAAATTGAACAATCTTCTAACGGTCCTTCAAGATTTGGTGCCTCCGAATTACTCCTTGGA
TTAAATGTCGATAATAAGGCAAACATAGAGGAAGGAAAGATAAATGCAATTGTTGGACGGACGCAGCAGCAATTTCTGGCTCGAATTGGTGCTATAGAATTTGAAGATTT
GAAGGACTCTCAACTTGAGTCATCTACCGGTAACCATCTTACACTCTTACCATGGATAGATGGTGTGGCTCGACTAGTTTTGATCCTTGAACTTGAAGATGATACCGCCG
TAATGAGAGCTGCAGAGTCAATTGCTGACGCATCCATCAACGAACACATGCGTATTTCGTTCAAGGAAGCTGGAGCAATCAAGCATTTAGTTAAGGTTTTAGATAATATG
AATGATGCAGTCAAATGGGCTACAATTCGAGCTCTGGAGAGACTGTCAATCAGCAATGTTGTTTGCCAGACAATTGAAAATGAAGGTGCACTTGGTCCTTTGCTTAGTAT
TTTAAGGCTCTCAAATATCCCTGAAAATGTGATGGAGAAGACGCTCGATATACTTTGTCGAATCCTGGACCCCAGTAAAGAAATGAAATCAAAGTTTTACAATGGACCGG
TAAACGGGTCTCATGGAGGACAACATTCAGAATCTGAAGCTTCTACGAGGAAAGATGTGCTGGATGCTGTTGCAGTTTCTCGCCTCGTTGAGATTTTGAATACCTCATCC
CCAAACTTGAAACGCAAAGCCGCTTCTATCCTGGAATTTGTTTCTATTATGGACCCAAGCATGGAAATAATCGATTCCATGGCGATAGAATCTGGTTTGTTGGCTGTTTT
CCAGTTAGGAGTCTCAATAGATTCTGATGCTGAAGATTGGCAGCCTGAAAGGTACGCCCTTGAAGTTGAGGAAGCTGGTCTTGCCATATCAGCAGCCTCCCGTCTACTAA
CGAAACTTCTCGATTCCGAGAAGTTCTACAACAAAATAAACTCTGCCCATTTCACTAAGTTACTTCGTCAAATCCTAAAGTCAGACATTCCCATTCACCACAAAGACTGG
ATTGCTGCCTGCCTAGTCAAAGTTAGCTCCATTTCAGCTCTGAACATGGATTCAGGGAATCCGATCAACATGGAGGTAACTCTTTACGAAACAATACCAAGACTCATTCA
GCAGATCAAAGGCTCCTTCTCCATGGAAGATCAAGAATCCTCTGTTGTGGAACTGAACAGAATACTCTCAGAAGGAATTGTCGATGCTACCCGAGCTGTTGCTTCTAAAG
GTGGTATCTTTCCATTGGTAAAGCTGATCGATGAAGGAAGCGAGAGGGCCACAGAAGCAGCCCTAGCCATACTGTATAATCTGAGCATGGACAGTGAAAATCATCCAGCA
ATTTTAGCTGCTGGAGCCGTGCCAGCATTGAGGAGAATCGTACTATCACAACGAGCAGAATGGCGACGAGCTCTTCATTTGTTGAGGACATTGCCTACA
Protein sequenceShow/hide protein sequence
AAGTKPRNLSVFLYHLPYRGRYPESLSNRFVRRSVSSDGGGAVDSTHQQSAAPDIKDVQNDSSSVGHSYVALFVRMLGLDHDPLDREQAVIALWKYSLGGKKHIDAIMKF
PGCINLTVNLLQSESIATCEAAAGLLRSISLVNLYRDSVAESGAIEEITGLLSRPSLAPEVKEQSICVLWNLSVDEKLRLKIADTDILLLLSKNLDDEDMKVKEAAGGVL
ANLALSPCNHGVIVESGLIPKLAYQLKAEADSSKIVRKEARNVLLELAKDEYYRILVIEEGLVPVPIVGAAAYKSFRPGLHSWPSLPDGTEIEQSSNGPSRFGASELLLG
LNVDNKANIEEGKINAIVGRTQQQFLARIGAIEFEDLKDSQLESSTGNHLTLLPWIDGVARLVLILELEDDTAVMRAAESIADASINEHMRISFKEAGAIKHLVKVLDNM
NDAVKWATIRALERLSISNVVCQTIENEGALGPLLSILRLSNIPENVMEKTLDILCRILDPSKEMKSKFYNGPVNGSHGGQHSESEASTRKDVLDAVAVSRLVEILNTSS
PNLKRKAASILEFVSIMDPSMEIIDSMAIESGLLAVFQLGVSIDSDAEDWQPERYALEVEEAGLAISAASRLLTKLLDSEKFYNKINSAHFTKLLRQILKSDIPIHHKDW
IAACLVKVSSISALNMDSGNPINMEVTLYETIPRLIQQIKGSFSMEDQESSVVELNRILSEGIVDATRAVASKGGIFPLVKLIDEGSERATEAALAILYNLSMDSENHPA
ILAAGAVPALRRIVLSQRAEWRRALHLLRTLPT