; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006364 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006364
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionARM repeat superfamily protein
Genome locationscaffold327:125775..127484
RNA-Seq ExpressionMS006364
SyntenyMS006364
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589272.1 hypothetical protein SDJN03_17837, partial [Cucurbita argyrosperma subsp. sororia]1.7e-27289.01Show/hide
Query:  MPPTTAG---PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSA
        MPP TAG   PPPSTSSSQ LLDLII IVSLLLLSSLNVRSFVGRWQ+LHSKLSTLHSAL E+SDSPHWSENPLVHTILPSLLSTLQRLKSLS QCSDSA
Subjt:  MPPTTAG---PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSA

Query:  FSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGN
        FSGGKLHMQSDLDMASASLS+QLNDLDLLLRSGVLYQSNALVLSQP PGSNKDDTEFFIRDLFTRLQIGG EFKKKALESLVQLLNQDEKSAGLVAKEGN
Subjt:  FSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGN

Query:  VGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQ
        VGYL+HLLDF++QPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVE+ITID ENAWAVSAYGG+++LIEACRSGTP +Q
Subjt:  VGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQ

Query:  APAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSV
        A AVGAIRNVTAVEDIKTSLVEEGA+PVLLQLLVSS   TQEKAA+ IAVLASSG+YFRS IIQERGL +LL LIHDSPS +TIEN LRALSSLAVSD +
Subjt:  APAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSV

Query:  ARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQML
        ARILSSSTLF+MKLGEL+KHG+L+LQQIAASL+ANLSISDGNKRAI SCM SLVKLMEMPKPAGVQE AV ALASLLTVRSNRKELMRDEKSVMRLMQML
Subjt:  ARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQML

Query:  DPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        DP+NEV+ KTFP+AIV+AVL+GGSNGCR+RL++AGA+Q+LQ LSDMNV GAKKALQRLTGNRLR+IFSRTWRE
Subjt:  DPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

XP_008449480.1 PREDICTED: armadillo repeat-containing protein 3 [Cucumis melo]3.5e-27389.72Show/hide
Query:  MPPTTAG----PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS
        MPPTT G    PPPSTSSSQALLDLII IVSLLLLSSLNVRSFVGRWQ+LHSKL+ LHSAL EI DS HWSENPLVHTILPSLLSTLQRLKSLS QCSD 
Subjt:  MPPTTAG----PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS

Query:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG
        AFSGGKLHMQSDLDMASASLS+QLNDLDLLLRSGVLYQSNALVLSQP PGSNKDDTEFFIRDLFTRLQIGG EFKKKALESLVQLLNQDEKSAGLVAKEG
Subjt:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG

Query:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV
        NVGYLVHLLDFNAQPSVRELAASA+SVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITID ENAWAVSAYGG+++LI+ACRSGTP +
Subjt:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV

Query:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS
        QA AVGAIRNVTAVEDIK SLVEEG +PVLLQLLVSST A+QEKAA+  AVLASSG+YFRSLIIQERGL RLLHLIHDS S DTIE+ALRALSSLAVSDS
Subjt:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS

Query:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM
        VARILSSSTLFVMKLGELVKHGNL+LQQIAASL++NLSISDGNKRAI SCM SLVKLMEMPKPAGVQEVAV+ALASLLTVRSNRKELM+DEKSVMRLMQM
Subjt:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM

Query:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        LDP+NEVV K+FPIAIV+AVLAGGS GCRKRL++AGAYQ+LQ L+DMNV GAKKALQRL GNRLRSIF+RTWRE
Subjt:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

XP_022148067.1 armadillo repeat-containing protein 3 [Momordica charantia]5.8e-30599.82Show/hide
Query:  MPPTTAGPPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSG
        MPPTTAGPPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSG
Subjt:  MPPTTAGPPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSG

Query:  GKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGY
        GKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGY
Subjt:  GKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGY

Query:  LVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPA
        LVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPA
Subjt:  LVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPA

Query:  VGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARI
        VGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSL+VSDSVARI
Subjt:  VGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARI

Query:  LSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPR
        LSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPR
Subjt:  LSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPR

Query:  NEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        NEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
Subjt:  NEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

XP_023511429.1 armadillo repeat-containing protein 3-like [Cucurbita pepo subsp. pepo]2.9e-27289.02Show/hide
Query:  MPPTTAGPP----PSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS
        MPPTTAGPP    PS SSSQALLDLII IVSLLLLSSLNVRSFVGRWQ+LHSKLSTLHSAL EI DSP+WSENPLV TILPSLLSTLQRLKSLS QCSDS
Subjt:  MPPTTAGPP----PSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS

Query:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG
        AFSGGKLHMQSDLDM SASLS+QLNDLDLLLRSGVLYQSNALVLSQP PGSN DDTEFFIRDLFTRLQIGG EFKKKALESLVQLLNQDEKSA LVAKEG
Subjt:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG

Query:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV
        NVGYL+HLLDFNA PSVRE+AASAVSVLSTASDESR+RVFEEGGLGPLLRILETGS+HLKEKAAAAVEAIT+DPENAWAVSAYGGV +LIEACRSGTPP+
Subjt:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV

Query:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS
        QA AVGAIRNVTAVEDIKTSLVEEGA+PVLLQ L+SST ATQEKA++ IAVLA+SG+YFRSLIIQE GL +LLHLIHDSPSSDTI NALRAL SLAVSDS
Subjt:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS

Query:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM
        VARILSSSTLFVMKLGELVKHGNL+LQQIAASL+ANLSISDGNKRAI SCM SLVKLMEMPKPAGVQEVAV ALASLLTVRSNRKELM+DEKSVMRLMQM
Subjt:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM

Query:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        LDP+NEVV KTFP+AI++AVLAGGSNGCRKRLIDAGAYQ+LQ L++ ++ GAKK LQRL GNRLRSIFSRTWRE
Subjt:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

XP_038888451.1 armadillo repeat-containing protein 3 [Benincasa hispida]9.7e-27690.59Show/hide
Query:  MPPTTAG----PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS
        MPPTTAG    PP STSSSQALLDLII IVS LLLSSLNVRSFVGRWQ+LHSKLS LHSAL EI DSPHWSENPLVHTILPSLLSTLQRLKSLS QCSD 
Subjt:  MPPTTAG----PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS

Query:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG
        AFSGGKLHMQSDLDMASASLS+QLNDLDLLLRSGVLYQSNALVLSQP PGSNKDDTEFFIRDLFTRLQIGG EFKKKALESLVQLLNQDEKSAGLVAKEG
Subjt:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG

Query:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV
        N+GYLVHLLDFNAQPSVRELAASAVSVLSTA DESRKRVFEEGGLGPLLRILETG MHLKEKAAAAVEAITID ENAWAVSAYGGV++LI+ACRSGTP +
Subjt:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV

Query:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS
        QA AVGAIRNVTAVEDIK+SLVEEGA+PVLLQLLVSST A QEKAA+ IAVLASSG+Y+RSLIIQERGL RLLHLIHDSPSSDTIENALRALSSLAVSDS
Subjt:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS

Query:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM
        VARILSSSTLFVMKLGELVKHGNL+LQQIAASL++NLSISDGNKRAI SCM+SLVKLMEMPKPAGVQEVAV+ALASLLTVRSNRKELM+DEKSVMRLMQM
Subjt:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM

Query:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        LDP+NEVV K+FP+AIV+AVLAGGSNGCRKRLIDAGAYQ+LQ L+DMNV GAKKALQRL GNRLRSIF+RTW+E
Subjt:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

TrEMBL top hitse value%identityAlignment
A0A1S3BLH4 armadillo repeat-containing protein 31.7e-27389.72Show/hide
Query:  MPPTTAG----PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS
        MPPTT G    PPPSTSSSQALLDLII IVSLLLLSSLNVRSFVGRWQ+LHSKL+ LHSAL EI DS HWSENPLVHTILPSLLSTLQRLKSLS QCSD 
Subjt:  MPPTTAG----PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS

Query:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG
        AFSGGKLHMQSDLDMASASLS+QLNDLDLLLRSGVLYQSNALVLSQP PGSNKDDTEFFIRDLFTRLQIGG EFKKKALESLVQLLNQDEKSAGLVAKEG
Subjt:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG

Query:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV
        NVGYLVHLLDFNAQPSVRELAASA+SVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITID ENAWAVSAYGG+++LI+ACRSGTP +
Subjt:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV

Query:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS
        QA AVGAIRNVTAVEDIK SLVEEG +PVLLQLLVSST A+QEKAA+  AVLASSG+YFRSLIIQERGL RLLHLIHDS S DTIE+ALRALSSLAVSDS
Subjt:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS

Query:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM
        VARILSSSTLFVMKLGELVKHGNL+LQQIAASL++NLSISDGNKRAI SCM SLVKLMEMPKPAGVQEVAV+ALASLLTVRSNRKELM+DEKSVMRLMQM
Subjt:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM

Query:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        LDP+NEVV K+FPIAIV+AVLAGGS GCRKRL++AGAYQ+LQ L+DMNV GAKKALQRL GNRLRSIF+RTWRE
Subjt:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

A0A5A7V7E1 Armadillo repeat-containing protein 31.7e-27389.72Show/hide
Query:  MPPTTAG----PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS
        MPPTT G    PPPSTSSSQALLDLII IVSLLLLSSLNVRSFVGRWQ+LHSKL+ LHSAL EI DS HWSENPLVHTILPSLLSTLQRLKSLS QCSD 
Subjt:  MPPTTAG----PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS

Query:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG
        AFSGGKLHMQSDLDMASASLS+QLNDLDLLLRSGVLYQSNALVLSQP PGSNKDDTEFFIRDLFTRLQIGG EFKKKALESLVQLLNQDEKSAGLVAKEG
Subjt:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG

Query:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV
        NVGYLVHLLDFNAQPSVRELAASA+SVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITID ENAWAVSAYGG+++LI+ACRSGTP +
Subjt:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV

Query:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS
        QA AVGAIRNVTAVEDIK SLVEEG +PVLLQLLVSST A+QEKAA+  AVLASSG+YFRSLIIQERGL RLLHLIHDS S DTIE+ALRALSSLAVSDS
Subjt:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS

Query:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM
        VARILSSSTLFVMKLGELVKHGNL+LQQIAASL++NLSISDGNKRAI SCM SLVKLMEMPKPAGVQEVAV+ALASLLTVRSNRKELM+DEKSVMRLMQM
Subjt:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM

Query:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        LDP+NEVV K+FPIAIV+AVLAGGS GCRKRL++AGAYQ+LQ L+DMNV GAKKALQRL GNRLRSIF+RTWRE
Subjt:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

A0A6J1D423 armadillo repeat-containing protein 32.8e-30599.82Show/hide
Query:  MPPTTAGPPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSG
        MPPTTAGPPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSG
Subjt:  MPPTTAGPPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSG

Query:  GKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGY
        GKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGY
Subjt:  GKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGY

Query:  LVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPA
        LVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPA
Subjt:  LVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPA

Query:  VGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARI
        VGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSL+VSDSVARI
Subjt:  VGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARI

Query:  LSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPR
        LSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPR
Subjt:  LSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPR

Query:  NEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        NEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
Subjt:  NEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

A0A6J1EXZ9 armadillo repeat-containing protein 3-like9.2e-27288.66Show/hide
Query:  MPPTTAG---PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSA
        MPP TAG   PPPSTSSSQ LLDLII IVSLLLLSSLNVRSFVGRWQ+LHSKLSTLHSAL E+SDSPHWSENPLVHTILPSLLSTLQRLKSLS QCSDSA
Subjt:  MPPTTAG---PPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSA

Query:  FSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGN
        FSGGKLHMQSDLDMASASLS+QLNDLDLLLRSGVLYQSNALVLSQP PGSNKDDTEFFIRDLFTRLQIGG EFKKKALESLVQLLNQDEKSAGLVAKEGN
Subjt:  FSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGN

Query:  VGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQ
        VGYL+HLLDF++QPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVE+ITID ENAWAVSAYGG+++LIEACRSGTP +Q
Subjt:  VGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQ

Query:  APAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSV
        A AVGAIRNVTAVEDIKTSLVEEGA+PVLLQLLVSS   TQEKAA+ IAVLASSG+YFRS IIQERGL +LL LIHDSPS +TIEN LRALSSLAVSD +
Subjt:  APAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSV

Query:  ARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQML
        ARILSSSTLF+MKLGEL+KHG+L+LQQIAASL+ANLSISDGNKRAI SCM SLVKLMEMPKPAGVQE AV ALASLLTVRSNRKELMRDEKSVMRLMQML
Subjt:  ARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQML

Query:  DPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        DP+NEV+ KTFP+AIV+AVL+GGSNGCR+RL++AGA+Q+LQ L+D NV GAKKALQRLTGNRLR+IFSRTWRE
Subjt:  DPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

A0A6J1HXP3 armadillo repeat-containing protein 3-like4.1e-27289.37Show/hide
Query:  MPPTTAGPP----PSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS
        MPPTTAGPP    PS SSSQALLDLII IVSLLLLSSLNVRSFVGRWQ+LHSKLSTLHSAL EI DSP+WSENPLV TILPSLLSTLQRLKSLS QCSDS
Subjt:  MPPTTAGPP----PSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDS

Query:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG
        AFSGGKLHMQSDLDM SASLS+QLNDLDLLLRSGVLYQSNALVLSQP PGSN DDTEFFIRDLFTRLQIGG EFKKKALESLVQLLNQDEKSA LVAKEG
Subjt:  AFSGGKLHMQSDLDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEG

Query:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV
        NVGYL+HLLDFNA PSVRE+AASAVSVLSTASDESR+RVFEEGGLGPLLRILETGS+HLKEKAAAAVEAIT+DPENAWAVSAYGGV +LIEACRSGTPP+
Subjt:  NVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPV

Query:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS
        QA AVGAIRNVTAVEDIKTSLVEEGA+PVLLQ L+SST ATQEKA++ IAVLA+SG+YFRSLIIQE GL RLLHLIH SPSSDTI NALRALSSLAVSDS
Subjt:  QAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDS

Query:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM
        VARILSSS LFVMKLGELVKHGNL+LQQIAASL+ANLSISDGNKRAI SCM SLVKLMEMPKPAGVQEVAV ALASLLTVRSNRKELM+DEKSVMRLMQM
Subjt:  VARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQM

Query:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE
        LDP+NEVV KTFP+AIV+AVLAGGSNGCRKRLIDAGAYQ+LQ L++ +V GAKK LQRL GNRLRSIFSRTWRE
Subjt:  LDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSRTWRE

SwissProt top hitse value%identityAlignment
O22193 U-box domain-containing protein 41.1e-1426.36Show/hide
Query:  VLSQPAPGSNKD--DTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVF
        ++S P+  + +D  + E  ++ L   L+    + +++A   L  L   +  +  ++   G +  LV LL ++   + +E A +A+  LS  +D ++K + 
Subjt:  VLSQPAPGSNKD--DTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVF

Query:  EEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMA
        + G + PL+ +LE GS   KE +AA + ++++  EN   +   G +  L++   +GTP  +  A  A+ N++  ++ K  +V+ GAV  L+ L+      
Subjt:  EEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMA

Query:  TQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVS
          +KA   +A LA+  +  R+ I QE G+P L+ ++ +  S+   ENA  AL  L+ +
Subjt:  TQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVS

Q2GW27 Vacuolar protein 82.2e-1224.19Show/hide
Query:  NKDDTEFF----IRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGP
        N+ +T+FF    +R L T +     + ++ A  +  ++  +D ++         +G ++ LL+ N+   V+  A++A+  L+  +D ++  + + GGL P
Subjt:  NKDDTEFF----IRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGP

Query:  LLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAI
        L++ + + ++ ++  A   +  +    EN   ++  G +  L    +S    VQ  A GA+ N+T  ++ +  LV  GA+PVL+QLL SS +  Q     
Subjt:  LLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAI

Query:  CIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAI
         ++ +A   +  R L   E+ L + L  + +S S      A  AL +LA SD   ++       +  L  L++   L L   A + + N+SI   N+  I
Subjt:  CIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAI

Query:  WSC--MASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLS
             +  LV L+       +Q  A+  L +L       K L+ +  +V +  Q++      V      AI  AVLA  S+  +  L++ G +  L  L+
Subjt:  WSC--MASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLS

Query:  ---DMNVVGAKKALQRLTGNRL--RSIFSRTWRE
            + V G   A      +++   SIF + W E
Subjt:  ---DMNVVGAKKALQRLTGNRL--RSIFSRTWRE

Q8GUG9 U-box domain-containing protein 117.6e-1325.82Show/hide
Query:  IRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHL
        IR L  RL    TE ++ A+  +  L  +   +  L+A+ G +  LV+LL  +   + +E A + V  LS   + +++ +   G +  ++++L  G+M  
Subjt:  IRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHL

Query:  KEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYF
        +E AAA + ++++  EN   +   G +  L++   +GTP  +  A  A+ N+      K   V  G V  L+++L  ST       A+ I  + ++    
Subjt:  KEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYF

Query:  RSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFVMKLGELVKHGNLMLQQIAASLL
        +S I++   LP L+ ++    + +  ENA   L SL   D+   I       V+ L +L K+G    ++ A SLL
Subjt:  RSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFVMKLGELVKHGNLMLQQIAASLL

Q8VZ40 U-box domain-containing protein 143.4e-1326.6Show/hide
Query:  SNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRK-RVFEEGGLGPLL
        S+ D    F+  L  +L  G TE ++ A   L  L  ++  +   +A+ G +  LV LL   + P  R    S  ++L+ + +E  K  + + G +  ++
Subjt:  SNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRK-RVFEEGGLGPLL

Query:  RILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICI
         +L+ GSM  +E AAA + ++++  EN  A+ A G +  LI     GT   +  A  AI N+   +  K+  V+ G V  L +LL  +     ++A   +
Subjt:  RILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICI

Query:  AVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSD----SVARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDG
        A+L S+    ++ I +   +P L+ +I  + S    ENA   L  L + +    +VAR + +     + L EL ++G    ++ AASLL  +  ++G
Subjt:  AVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSD----SVARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDG

Q9ZV31 U-box domain-containing protein 122.2e-1225.58Show/hide
Query:  LSQP-----APGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKR
        +SQP     +  S  DD    I +L  +L     E ++ A   +  L  Q+  +   +A  G +  LV+LL  +     +E A +++  LS   +   K 
Subjt:  LSQP-----APGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAVSVLSTASDESRKR

Query:  VFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSST
        V+  G +  ++ +L+ GSM  +E AAA + ++++  EN   + A G +  L+     G+   +  A  A+ N+   +  K   V  G VPVL++LL    
Subjt:  VFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLLVSST

Query:  MATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFVMK-LGELVKHGNLMLQQIAASLLANL
            +++   +A+L+S  D  +S +     +P L+  I  S S    EN+   L  L  S +   ++ +  L +M  L E+ ++G    ++ AA LL   
Subjt:  MATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFVMK-LGELVKHGNLMLQQIAASLLANL

Query:  S
        S
Subjt:  S

Arabidopsis top hitse value%identityAlignment
AT2G05810.1 ARM repeat superfamily protein5.3e-19563.72Show/hide
Query:  PSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSDL
        P T+  Q L+DLI +++SLLLLSSL VRSF+GRWQ+L SKL TL+S+LS +S+SPHWS+NPL+HT+LPSLLS LQRL SLS QCS ++FSGGKL MQSDL
Subjt:  PSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSDL

Query:  DMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNA
        D+AS+SLS  ++DLDLLLRSGVL+Q NA+VLS P P S+KDD  FFIRDLFTRLQIGG EFKKK+LESL+QLL  +EKSA ++AKEGNVGYLV LLD + 
Subjt:  DMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNA

Query:  QPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTA
         P +RE A +AVS+L+++S +SRK VFE+GGLGPLLR+LETGS   K +AA A+EAIT DP  AWA+SAYGGV +LIEACRSG+  VQ    GAI N+ A
Subjt:  QPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTA

Query:  VEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQER-GLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFV
        VE+I+T+L EEGA+PVL+QLL+S + + QEK A  I++++SSG+Y+R LI++ER GL  L+HL+ +S + DTIE+ L ALS ++  ++V+R+LSSST F+
Subjt:  VEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQER-GLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFV

Query:  MKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNE-VVVKT
        ++LGEL+KHGN++LQQI+ SLL+NL+ISDGNKRA+  C++SL++LME PKPAG+QE A +A  SLLTVRSNRKELMRDEKSV+RL+QMLDPRNE +  K 
Subjt:  MKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNE-VVVKT

Query:  FPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRL-TGNRLRSI-FSRTWRE
         P+ +V+A+L+GGS   R +LI  GA + LQ L +M V GAKKA+QRL  GNRL+SI F+R W++
Subjt:  FPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRL-TGNRLRSI-FSRTWRE

AT2G05810.2 ARM repeat superfamily protein5.3e-19563.72Show/hide
Query:  PSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSDL
        P T+  Q L+DLI +++SLLLLSSL VRSF+GRWQ+L SKL TL+S+LS +S+SPHWS+NPL+HT+LPSLLS LQRL SLS QCS ++FSGGKL MQSDL
Subjt:  PSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSDL

Query:  DMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNA
        D+AS+SLS  ++DLDLLLRSGVL+Q NA+VLS P P S+KDD  FFIRDLFTRLQIGG EFKKK+LESL+QLL  +EKSA ++AKEGNVGYLV LLD + 
Subjt:  DMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNA

Query:  QPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTA
         P +RE A +AVS+L+++S +SRK VFE+GGLGPLLR+LETGS   K +AA A+EAIT DP  AWA+SAYGGV +LIEACRSG+  VQ    GAI N+ A
Subjt:  QPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTA

Query:  VEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQER-GLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFV
        VE+I+T+L EEGA+PVL+QLL+S + + QEK A  I++++SSG+Y+R LI++ER GL  L+HL+ +S + DTIE+ L ALS ++  ++V+R+LSSST F+
Subjt:  VEDIKTSLVEEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQER-GLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFV

Query:  MKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNE-VVVKT
        ++LGEL+KHGN++LQQI+ SLL+NL+ISDGNKRA+  C++SL++LME PKPAG+QE A +A  SLLTVRSNRKELMRDEKSV+RL+QMLDPRNE +  K 
Subjt:  MKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNE-VVVKT

Query:  FPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRL-TGNRLRSI-FSRTWRE
         P+ +V+A+L+GGS   R +LI  GA + LQ L +M V GAKKA+QRL  GNRL+SI F+R W++
Subjt:  FPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRL-TGNRLRSI-FSRTWRE

AT2G45720.1 ARM repeat superfamily protein2.8e-7134.4Show/hide
Query:  TSSSQALLDLII---HIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSD
        T   Q + DL++    +V + L  +  V+ F  RW+++ S+L  + + LS++S  P +S++ L    L ++L TL+    L+  C  S    GKL MQSD
Subjt:  TSSSQALLDLII---HIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSD

Query:  LDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFN
        LD  SA +   L D  LL+++GVL +     +++P   S +D   F +R+L  RLQIG  E K+KALE LV+++ +DEK+        NV  LV LL   
Subjt:  LDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFN

Query:  AQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVT
        + PSVRE A + +  L+  S      +  E  L  L+R+LE+GS+  KEKA  +++ ++I  E + ++  +GGV  LIE C++G    Q+ +   ++N++
Subjt:  AQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVT

Query:  AVEDIKTSLVEEGAVPVLLQLL-VSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLF
        AV +++ +L EEG V V++ +L     + ++E AA C+  L SS +  R  +I E G+  LL  + D P     E+ + A+ +L  S SV          
Subjt:  AVEDIKTSLVEEGAVPVLLQLL-VSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLF

Query:  VMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIW--SCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNEVVV
        +  L  ++K G++  QQ AAS +  ++ S+  KR I    C+  L++++E  K +G +EVA QA+ASL+TV  N +E+ RDEKSV  L+ +L+P      
Subjt:  VMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIW--SCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNEVVV

Query:  KTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSR
        K + ++ ++A+ +  S  C+K ++  GA   L+KLS++ V G+KK L+R+   +L+S FSR
Subjt:  KTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSR

AT2G45720.2 ARM repeat superfamily protein2.8e-7134.4Show/hide
Query:  TSSSQALLDLII---HIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSD
        T   Q + DL++    +V + L  +  V+ F  RW+++ S+L  + + LS++S  P +S++ L    L ++L TL+    L+  C  S    GKL MQSD
Subjt:  TSSSQALLDLII---HIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSD

Query:  LDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFN
        LD  SA +   L D  LL+++GVL +     +++P   S +D   F +R+L  RLQIG  E K+KALE LV+++ +DEK+        NV  LV LL   
Subjt:  LDMASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFN

Query:  AQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVT
        + PSVRE A + +  L+  S      +  E  L  L+R+LE+GS+  KEKA  +++ ++I  E + ++  +GGV  LIE C++G    Q+ +   ++N++
Subjt:  AQPSVRELAASAVSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVT

Query:  AVEDIKTSLVEEGAVPVLLQLL-VSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLF
        AV +++ +L EEG V V++ +L     + ++E AA C+  L SS +  R  +I E G+  LL  + D P     E+ + A+ +L  S SV          
Subjt:  AVEDIKTSLVEEGAVPVLLQLL-VSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLF

Query:  VMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIW--SCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNEVVV
        +  L  ++K G++  QQ AAS +  ++ S+  KR I    C+  L++++E  K +G +EVA QA+ASL+TV  N +E+ RDEKSV  L+ +L+P      
Subjt:  VMKLGELVKHGNLMLQQIAASLLANLSISDGNKRAIW--SCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNEVVV

Query:  KTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSR
        K + ++ ++A+ +  S  C+K ++  GA   L+KLS++ V G+KK L+R+   +L+S FSR
Subjt:  KTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSR

AT5G50900.1 ARM repeat superfamily protein4.2e-6732.36Show/hide
Query:  IIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSDLDMASASLSNQLN
        +  +++ L+ S  N+ SF  +W  + +KL+ L + LS+ SD    S N L   +L S+  TL    +++A+C     + GKL  QS++D   A L   + 
Subjt:  IIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSDLDMASASLSNQLN

Query:  DLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAV
        D ++L++SG+L   N +V+S  +  S K+      R+L  RLQIGG E K  A++SL++LL +D+K+  +   +G V  LV LLD +    ++E   + +
Subjt:  DLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASAV

Query:  SVLSTASDESRKRVFEEGG---LGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLV
        S +S    ES K V    G   L  LLR+LE+GS   KEKA  A++A+++  ENA A+   GG++ L+E C+ G+P  QA A G +RN+    + K + V
Subjt:  SVLSTASDESRKRVFEEGG---LGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLV

Query:  EEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFVMKLGELVKHG
        EE A+ VL+ ++ S T   QE A  C+A L S  +     +++E G+  L        S  ++E  +  L +LA+   V  ++ S   F+ +L  ++  G
Subjt:  EEGAVPVLLQLLVSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFVMKLGELVKHG

Query:  NLMLQQIAASLLANLSISDGNKRAIW--SCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNEVVVKTFPIAIVSAV
         L ++  AA  +++L  S  +++ +    C+  L+ +++  K    +E A +AL++LL   SNRK   + +K V+ L+Q+LDP+ + + K + ++ +  +
Subjt:  NLMLQQIAASLLANLSISDGNKRAIW--SCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNEVVVKTFPIAIVSAV

Query:  LAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSR
        +   S  CRK+++ AGA  +LQKL DM+  GAKK  + L+ +++  +F+R
Subjt:  LAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKKALQRLTGNRLRSIFSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCGACCACCGCCGGACCGCCGCCCTCCACCTCCTCCTCCCAAGCCCTTCTCGACCTCATCATACACATTGTCTCCCTCCTCCTCCTCTCCTCCCTCAATGTCCG
CTCCTTCGTTGGCCGCTGGCAGCTTCTCCATTCCAAGCTCTCGACTCTCCACTCTGCTCTCTCTGAAATCTCCGATTCCCCCCACTGGTCCGAGAATCCCCTTGTTCATA
CCATTCTCCCCTCCCTCCTCTCCACGCTTCAGCGGCTCAAGTCTTTGTCCGCCCAATGCTCCGACTCTGCCTTCTCCGGCGGCAAGCTTCATATGCAGAGCGACTTGGAT
ATGGCCTCGGCTTCTCTCTCTAACCAACTCAACGACCTCGATTTGTTACTCAGATCTGGGGTTCTCTACCAATCGAACGCGCTAGTGCTGTCCCAGCCGGCTCCCGGTTC
GAATAAAGACGATACGGAGTTTTTCATCAGAGATCTTTTCACGCGGCTGCAAATCGGGGGGACGGAGTTCAAGAAGAAGGCTCTGGAGTCTCTGGTTCAGCTTCTGAACC
AGGATGAGAAGTCTGCGGGTTTAGTTGCCAAAGAGGGGAACGTTGGGTATTTGGTTCACTTGCTCGATTTCAACGCTCAGCCGTCTGTGCGAGAGCTAGCGGCTTCTGCG
GTTTCAGTTCTCTCCACGGCGAGCGATGAATCGAGGAAAAGAGTGTTCGAAGAAGGAGGATTAGGGCCACTGTTGAGGATTCTTGAAACAGGATCAATGCATCTGAAGGA
AAAAGCCGCCGCCGCCGTCGAAGCCATTACCATTGACCCGGAAAATGCTTGGGCGGTTTCCGCTTACGGAGGCGTAGCGATCTTGATCGAGGCCTGCCGATCAGGCACCC
CACCGGTGCAGGCTCCAGCGGTCGGTGCGATTAGGAATGTCACGGCGGTGGAAGATATCAAAACCTCTCTCGTCGAGGAAGGGGCGGTTCCAGTATTGTTACAGTTGCTA
GTTTCAAGCACGATGGCGACTCAAGAAAAGGCAGCAATCTGTATAGCAGTATTGGCCTCCTCCGGTGATTATTTCCGATCACTGATAATCCAAGAACGAGGATTACCTAG
ATTATTACATTTGATCCACGACTCACCGAGTTCAGATACGATCGAAAACGCGTTGCGAGCCCTAAGTTCTCTCGCGGTTTCCGATTCAGTTGCTCGGATTCTATCGTCTT
CAACATTATTCGTAATGAAACTCGGCGAATTGGTGAAGCACGGAAACCTAATGCTGCAGCAAATCGCCGCTTCTCTGCTCGCTAATTTATCAATCAGCGACGGGAATAAG
CGAGCTATTTGGAGTTGTATGGCTTCCTTGGTGAAGCTAATGGAGATGCCGAAGCCGGCCGGAGTTCAGGAGGTGGCGGTGCAAGCCCTTGCGTCACTCTTGACCGTCCG
ATCAAATCGGAAGGAGTTGATGAGGGACGAGAAGAGCGTGATGAGATTGATGCAGATGCTAGATCCGAGAAACGAGGTCGTCGTTAAGACGTTTCCGATAGCGATCGTCT
CGGCAGTACTCGCCGGAGGAAGCAACGGTTGCCGGAAGCGACTTATAGATGCCGGAGCTTACCAGAACCTACAGAAGCTCTCCGATATGAACGTCGTCGGAGCTAAAAAG
GCGTTGCAGCGGCTCACCGGCAATCGGCTGAGAAGCATCTTCAGCAGGACCTGGAGGGAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCGACCACCGCCGGACCGCCGCCCTCCACCTCCTCCTCCCAAGCCCTTCTCGACCTCATCATACACATTGTCTCCCTCCTCCTCCTCTCCTCCCTCAATGTCCG
CTCCTTCGTTGGCCGCTGGCAGCTTCTCCATTCCAAGCTCTCGACTCTCCACTCTGCTCTCTCTGAAATCTCCGATTCCCCCCACTGGTCCGAGAATCCCCTTGTTCATA
CCATTCTCCCCTCCCTCCTCTCCACGCTTCAGCGGCTCAAGTCTTTGTCCGCCCAATGCTCCGACTCTGCCTTCTCCGGCGGCAAGCTTCATATGCAGAGCGACTTGGAT
ATGGCCTCGGCTTCTCTCTCTAACCAACTCAACGACCTCGATTTGTTACTCAGATCTGGGGTTCTCTACCAATCGAACGCGCTAGTGCTGTCCCAGCCGGCTCCCGGTTC
GAATAAAGACGATACGGAGTTTTTCATCAGAGATCTTTTCACGCGGCTGCAAATCGGGGGGACGGAGTTCAAGAAGAAGGCTCTGGAGTCTCTGGTTCAGCTTCTGAACC
AGGATGAGAAGTCTGCGGGTTTAGTTGCCAAAGAGGGGAACGTTGGGTATTTGGTTCACTTGCTCGATTTCAACGCTCAGCCGTCTGTGCGAGAGCTAGCGGCTTCTGCG
GTTTCAGTTCTCTCCACGGCGAGCGATGAATCGAGGAAAAGAGTGTTCGAAGAAGGAGGATTAGGGCCACTGTTGAGGATTCTTGAAACAGGATCAATGCATCTGAAGGA
AAAAGCCGCCGCCGCCGTCGAAGCCATTACCATTGACCCGGAAAATGCTTGGGCGGTTTCCGCTTACGGAGGCGTAGCGATCTTGATCGAGGCCTGCCGATCAGGCACCC
CACCGGTGCAGGCTCCAGCGGTCGGTGCGATTAGGAATGTCACGGCGGTGGAAGATATCAAAACCTCTCTCGTCGAGGAAGGGGCGGTTCCAGTATTGTTACAGTTGCTA
GTTTCAAGCACGATGGCGACTCAAGAAAAGGCAGCAATCTGTATAGCAGTATTGGCCTCCTCCGGTGATTATTTCCGATCACTGATAATCCAAGAACGAGGATTACCTAG
ATTATTACATTTGATCCACGACTCACCGAGTTCAGATACGATCGAAAACGCGTTGCGAGCCCTAAGTTCTCTCGCGGTTTCCGATTCAGTTGCTCGGATTCTATCGTCTT
CAACATTATTCGTAATGAAACTCGGCGAATTGGTGAAGCACGGAAACCTAATGCTGCAGCAAATCGCCGCTTCTCTGCTCGCTAATTTATCAATCAGCGACGGGAATAAG
CGAGCTATTTGGAGTTGTATGGCTTCCTTGGTGAAGCTAATGGAGATGCCGAAGCCGGCCGGAGTTCAGGAGGTGGCGGTGCAAGCCCTTGCGTCACTCTTGACCGTCCG
ATCAAATCGGAAGGAGTTGATGAGGGACGAGAAGAGCGTGATGAGATTGATGCAGATGCTAGATCCGAGAAACGAGGTCGTCGTTAAGACGTTTCCGATAGCGATCGTCT
CGGCAGTACTCGCCGGAGGAAGCAACGGTTGCCGGAAGCGACTTATAGATGCCGGAGCTTACCAGAACCTACAGAAGCTCTCCGATATGAACGTCGTCGGAGCTAAAAAG
GCGTTGCAGCGGCTCACCGGCAATCGGCTGAGAAGCATCTTCAGCAGGACCTGGAGGGAA
Protein sequenceShow/hide protein sequence
MPPTTAGPPPSTSSSQALLDLIIHIVSLLLLSSLNVRSFVGRWQLLHSKLSTLHSALSEISDSPHWSENPLVHTILPSLLSTLQRLKSLSAQCSDSAFSGGKLHMQSDLD
MASASLSNQLNDLDLLLRSGVLYQSNALVLSQPAPGSNKDDTEFFIRDLFTRLQIGGTEFKKKALESLVQLLNQDEKSAGLVAKEGNVGYLVHLLDFNAQPSVRELAASA
VSVLSTASDESRKRVFEEGGLGPLLRILETGSMHLKEKAAAAVEAITIDPENAWAVSAYGGVAILIEACRSGTPPVQAPAVGAIRNVTAVEDIKTSLVEEGAVPVLLQLL
VSSTMATQEKAAICIAVLASSGDYFRSLIIQERGLPRLLHLIHDSPSSDTIENALRALSSLAVSDSVARILSSSTLFVMKLGELVKHGNLMLQQIAASLLANLSISDGNK
RAIWSCMASLVKLMEMPKPAGVQEVAVQALASLLTVRSNRKELMRDEKSVMRLMQMLDPRNEVVVKTFPIAIVSAVLAGGSNGCRKRLIDAGAYQNLQKLSDMNVVGAKK
ALQRLTGNRLRSIFSRTWRE