; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS027537 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS027537
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionU-box domain-containing protein 4
Genome locationscaffold313:352448..355102
RNA-Seq ExpressionMS027537
SyntenyMS027537
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008460513.1 PREDICTED: U-box domain-containing protein 2 isoform X1 [Cucumis melo]1.5e-17689.56Show/hide
Query:  MDFDSTSPSGTHHHH-------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSM---LHSPESHHHAALLALLNLAVKDEKN
        MD+  TSPS THHH        DAA+ +AL L+QSD  DSK   ACEIRRLTKTSQR RRHLSQSIPHLVSM   LHSPESH  AALLALLNLAVKDEKN
Subjt:  MDFDSTSPSGTHHHH-------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSM---LHSPESHHHAALLALLNLAVKDEKN

Query:  KIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLK
        KI+IVEAGALGPI+GFLQSESL LQENA ASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDS P+P+IVSLLK
Subjt:  KIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLK

Query:  TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL
        TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAV ALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL
Subjt:  TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL

Query:  LRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        LRLLRDSPYPRSELQ DTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPT+MPIN CTSEVS K
Subjt:  LRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

XP_022141735.1 U-box domain-containing protein 4 [Momordica charantia]3.3e-20099.73Show/hide
Query:  MDFDSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLHSPESHHHAALLALLNLAVKDEKNKIRIVEAGAL
        MDFDSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLHSPESHHHAALLALLNLAVKDEKNKIRIVEAGAL
Subjt:  MDFDSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLHSPESHHHAALLALLNLAVKDEKNKIRIVEAGAL

Query:  GPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAE
        GPIVGFLQSESLNLQEN+AASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAE
Subjt:  GPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAE

Query:  KCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYP
        KCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYP
Subjt:  KCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYP

Query:  RSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        RSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
Subjt:  RSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

XP_022990963.1 U-box domain-containing protein 2-like [Cucurbita maxima]2.6e-17687.98Show/hide
Query:  MDFDSTSPSGT-----HHHH----------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALLALLN
        MD+  TSP+ +     HHHH          + AV +AL L+QSD PDSK   ACEIRRLTKTSQR RRHLS+SIPHLVSML   HSPESH  AALLALLN
Subjt:  MDFDSTSPSGT-----HHHH----------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALLALLN

Query:  LAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPI
        LAVKDEKNKI+IVEAGALGPIVGF QSESL LQENA ASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQA+ADAVMALSNLSTLP NLSIILDSKPI
Subjt:  LAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPI

Query:  PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPK
        PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIAL SEEGGVLAVVEVLENGSLQSRDHAV ALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPK
Subjt:  PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPK

Query:  SQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        SQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVS K
Subjt:  SQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

XP_023524665.1 U-box domain-containing protein 2-like [Cucurbita pepo subsp. pepo]4.4e-17687.09Show/hide
Query:  MDFDSTSPSGT------HHHH-------------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALL
        MD+  TSP+ +      HHHH             + AV +AL L+QSD PDSK   ACEIRRLTKTSQR RRHLS+SIPHLVSML   HSPESH  AALL
Subjt:  MDFDSTSPSGT------HHHH-------------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALL

Query:  ALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILD
        ALLNLAVKDEKNKI+IVEAGALGPI+GF QSESL LQENA ASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLP NLSIILD
Subjt:  ALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILD

Query:  SKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQ
        SKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIAL SEEGGVLAVVEVLENGSLQSRDHAV ALLTMCESDRCKYREPILGEGVIPGLLELTVQ
Subjt:  SKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQ

Query:  GTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        GTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVS K
Subjt:  GTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

XP_038876343.1 U-box domain-containing protein 2 [Benincasa hispida]1.8e-17790.6Show/hide
Query:  MDFDSTSPSGTHHH-------HDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALLALLNLAVKDEKN
        MD+ S SPSG HH         DAAV +AL L+QSD  DSK   ACEIRRLTKTSQR RRHLSQSIPHLVSML   HSPESH  AALLALLNLAVKDEKN
Subjt:  MDFDSTSPSGTHHH-------HDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALLALLNLAVKDEKN

Query:  KIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLK
        KI+IV AGALGPI+GFLQSESL LQENA ASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKP+PSIVSLLK
Subjt:  KIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLK

Query:  TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL
        TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAV ALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL
Subjt:  TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL

Query:  LRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        LRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVS K
Subjt:  LRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

TrEMBL top hitse value%identityAlignment
A0A1S3CC78 U-box domain-containing protein 2 isoform X17.3e-17789.56Show/hide
Query:  MDFDSTSPSGTHHHH-------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSM---LHSPESHHHAALLALLNLAVKDEKN
        MD+  TSPS THHH        DAA+ +AL L+QSD  DSK   ACEIRRLTKTSQR RRHLSQSIPHLVSM   LHSPESH  AALLALLNLAVKDEKN
Subjt:  MDFDSTSPSGTHHHH-------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSM---LHSPESHHHAALLALLNLAVKDEKN

Query:  KIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLK
        KI+IVEAGALGPI+GFLQSESL LQENA ASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDS P+P+IVSLLK
Subjt:  KIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLK

Query:  TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL
        TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAV ALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL
Subjt:  TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL

Query:  LRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        LRLLRDSPYPRSELQ DTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPT+MPIN CTSEVS K
Subjt:  LRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

A0A5A7V2N1 U-box domain-containing protein 2 isoform X17.3e-17789.56Show/hide
Query:  MDFDSTSPSGTHHHH-------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSM---LHSPESHHHAALLALLNLAVKDEKN
        MD+  TSPS THHH        DAA+ +AL L+QSD  DSK   ACEIRRLTKTSQR RRHLSQSIPHLVSM   LHSPESH  AALLALLNLAVKDEKN
Subjt:  MDFDSTSPSGTHHHH-------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSM---LHSPESHHHAALLALLNLAVKDEKN

Query:  KIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLK
        KI+IVEAGALGPI+GFLQSESL LQENA ASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDS P+P+IVSLLK
Subjt:  KIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLK

Query:  TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL
        TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAV ALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL
Subjt:  TCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTL

Query:  LRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        LRLLRDSPYPRSELQ DTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPT+MPIN CTSEVS K
Subjt:  LRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

A0A6J1CIY3 U-box domain-containing protein 41.6e-20099.73Show/hide
Query:  MDFDSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLHSPESHHHAALLALLNLAVKDEKNKIRIVEAGAL
        MDFDSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLHSPESHHHAALLALLNLAVKDEKNKIRIVEAGAL
Subjt:  MDFDSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLHSPESHHHAALLALLNLAVKDEKNKIRIVEAGAL

Query:  GPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAE
        GPIVGFLQSESLNLQEN+AASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAE
Subjt:  GPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAE

Query:  KCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYP
        KCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYP
Subjt:  KCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYP

Query:  RSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        RSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
Subjt:  RSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

A0A6J1GYA0 U-box domain-containing protein 2-like6.2e-17687.47Show/hide
Query:  MDFDSTSPSGTHHHH---------------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALLALLN
        MD+  TSP+ ++ HH               + AV +AL L+QSD PDSK   ACEIRRLTKTSQ  RRHLS+SIPHLVSML   HSPESH  AALLALLN
Subjt:  MDFDSTSPSGTHHHH---------------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALLALLN

Query:  LAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPI
        LAVKDEKNKI+IVEAGALGPI+GF QSESL LQENA ASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLP NLSIILDSKPI
Subjt:  LAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPI

Query:  PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPK
        PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAV ALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPK
Subjt:  PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPK

Query:  SQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        SQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVS K
Subjt:  SQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

A0A6J1JUU0 U-box domain-containing protein 2-like1.2e-17687.98Show/hide
Query:  MDFDSTSPSGT-----HHHH----------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALLALLN
        MD+  TSP+ +     HHHH          + AV +AL L+QSD PDSK   ACEIRRLTKTSQR RRHLS+SIPHLVSML   HSPESH  AALLALLN
Subjt:  MDFDSTSPSGT-----HHHH----------DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSML---HSPESHHHAALLALLN

Query:  LAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPI
        LAVKDEKNKI+IVEAGALGPIVGF QSESL LQENA ASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQA+ADAVMALSNLSTLP NLSIILDSKPI
Subjt:  LAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPI

Query:  PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPK
        PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIAL SEEGGVLAVVEVLENGSLQSRDHAV ALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPK
Subjt:  PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPK

Query:  SQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK
        SQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVS K
Subjt:  SQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK

SwissProt top hitse value%identityAlignment
O22193 U-box domain-containing protein 42.9e-2933.45Show/hide
Query:  DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQS--IPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESL
        +  V + +  ++S   D++  A  E+R L K +  +R  +  S  I  LV +L+S +S     A+ ALLNL++ D  NK  I +AGA+ P++  L++ S 
Subjt:  DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQS--IPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESL

Query:  NLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGF
          +EN+AA+L +LS    NK  I  +GAI  LV++L  G+P+ K DA  AL NLS    N ++I+ S  +  ++ L+     ++   +K  +++ +L   
Subjt:  NLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGF

Query:  DEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD
         EGR A+  +EGG+  +VEV+E GS + +++A +ALL +  ++  ++   +L EG +P L+ L+  GTP+++ KA+ LL   R+
Subjt:  DEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD

Q5VRH9 U-box domain-containing protein 127.4e-2532.55Show/hide
Query:  DSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQ--SIPHLVSMLHS--PESHHHAALLALLNLAVKDEKNKIRIVEAGA
        D  +   + + H A +V  ++ ++S + D +  AA EIR L K +  +R  +++  +IP LV++L S  P +  H A+ ALLNL++  E NK  IV++ A
Subjt:  DSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQ--SIPHLVSMLHS--PESHHHAALLALLNLAVKDEKNKIRIVEAGA

Query:  LGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTA
        +  IV  L++ S+  +ENAAA+L +LS    NK  I AAGAIP L+ +L  GSP+ K DA  A+ NL     N    + +  +  +++ L     +    
Subjt:  LGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTA

Query:  EKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDS
        ++  SL+  L G  EG+I +   E  +  +VEV++ GS ++R++A + L  +C +D  +        GV   L EL+  GT +++ KA ++L L+  +
Subjt:  EKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDS

Q5XEZ8 U-box domain-containing protein 23.5e-2733.68Show/hide
Query:  SGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLS--QSIPHLVSMLHSPESHHHA-ALLALLNLAVKDEKNKIRIVEAGALGPIVG
        +G+    +  V + +  ++S   D++  A   IR L + S  +R  ++  ++IP LVS+L+S +    A A+  LLNL++ D  NK  I E+GA+ P++ 
Subjt:  SGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLS--QSIPHLVSMLHSPESHHHA-ALLALLNLAVKDEKNKIRIVEAGALGPIVG

Query:  FLQSESL-NLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCS
         L++  L   + N+AA+L +LS     K  I  AGAI  LV++L  GS   K DA  AL NLS    N + ++++  +  +V L+     +    EK   
Subjt:  FLQSESL-NLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCS

Query:  LIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLR
        ++ +L    EG+IA+  EEGG+  +VEV+E GS + +++A +ALL +C +   K+   ++ EGVIP L+ LT  GT + + KA+ LL+  +
Subjt:  LIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLR

Q8GWV5 U-box domain-containing protein 32.1e-2734.78Show/hide
Query:  DFDSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQ--SIPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAG
        D D +    T H      ++ +  ++S     K  AA EIR LT  S  +R H+ +  +I  L+S+L+S E      A+ ALLNL++  E NK  IVE G
Subjt:  DFDSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQ--SIPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAG

Query:  ALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAA-GAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSK
        A+ P+V  L + +   +EN+AASL +LS   VN+  I  +  AI  LV +L  G+ + K DA  AL NLS    N + I+ +K +  +V LL       +
Subjt:  ALGPIVGFLQSESLNLQENAAASLLTLSASTVNKPLISAA-GAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSK

Query:  TAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD
          +K  +L+ +L    EGR A+   EGG+  +VE ++ GS + +++A S LL +C +   K+   +L EG IP L+ L+  GT +++ KA+ LL   R+
Subjt:  TAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD

Q9C7R6 U-box domain-containing protein 175.3e-2335.34Show/hide
Query:  AACEIRRLTKTSQRSRRHLSQ--SIPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAG-ALGPIVGFLQSE-SLNLQENAAASLLTLSASTVN
        AA EIR L KT + +R ++++  +IPHL  +L S  +     ++ A+LNL++  EKNK RI+E G  L  IV  L S  ++  QENAAA+L +LSA    
Subjt:  AACEIRRLTKTSQRSRRHLSQ--SIPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAG-ALGPIVGFLQSE-SLNLQENAAASLLTLSASTVN

Query:  KPLISAAG-AIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVV
        K  I+     +  L  +L+ G+P+ K DAV AL NLST P N S +++   + S+V  L    K+   AE+    +  LV    G  A+  E+  V  ++
Subjt:  KPLISAAG-AIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVV

Query:  EVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLE-LTVQGTPKSQSKAKTLLRLLR
         ++  G+ + +++AV+ALL +C S      E +L    I GLL+ L   GT +++ KA +L R+ +
Subjt:  EVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLE-LTVQGTPKSQSKAKTLLRLLR

Arabidopsis top hitse value%identityAlignment
AT2G23140.1 RING/U-box superfamily protein with ARM repeat domain2.0e-3033.45Show/hide
Query:  DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQS--IPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESL
        +  V + +  ++S   D++  A  E+R L K +  +R  +  S  I  LV +L+S +S     A+ ALLNL++ D  NK  I +AGA+ P++  L++ S 
Subjt:  DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQS--IPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESL

Query:  NLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGF
          +EN+AA+L +LS    NK  I  +GAI  LV++L  G+P+ K DA  AL NLS    N ++I+ S  +  ++ L+     ++   +K  +++ +L   
Subjt:  NLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGF

Query:  DEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD
         EGR A+  +EGG+  +VEV+E GS + +++A +ALL +  ++  ++   +L EG +P L+ L+  GTP+++ KA+ LL   R+
Subjt:  DEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD

AT2G23140.2 RING/U-box superfamily protein with ARM repeat domain2.0e-3033.45Show/hide
Query:  DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQS--IPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESL
        +  V + +  ++S   D++  A  E+R L K +  +R  +  S  I  LV +L+S +S     A+ ALLNL++ D  NK  I +AGA+ P++  L++ S 
Subjt:  DAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQS--IPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESL

Query:  NLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGF
          +EN+AA+L +LS    NK  I  +GAI  LV++L  G+P+ K DA  AL NLS    N ++I+ S  +  ++ L+     ++   +K  +++ +L   
Subjt:  NLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGF

Query:  DEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD
         EGR A+  +EGG+  +VEV+E GS + +++A +ALL +  ++  ++   +L EG +P L+ L+  GTP+++ KA+ LL   R+
Subjt:  DEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD

AT3G03440.1 ARM repeat superfamily protein3.5e-12372.65Show/hide
Query:  AAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLH--SPESHHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESLNL
        A++ R L LI+S+D DS+L AA EIRRLTKTS R RRH SQ++  LVSML   SPESHH AALLALLNLAVKDEKNK+ I+EAGAL PI+ FLQS S  L
Subjt:  AAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLH--SPESHHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESLNL

Query:  QENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESL-VGFD
        QE A+ASLLTLSAS  NKP+I A G +PLLV++++ GSPQAKADAVMALSNLSTLP NLS+IL +KP+  I++LLK+ KKSSKT+EKCCSLIE+L V  +
Subjt:  QENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESL-VGFD

Query:  EGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIV
        E R  L S+EGGVLAVVEVLENGSLQ+R+HAV  LLT+C+SDR KYREPIL EGVIPGLLELTVQGT KS+ KA+ LL LLR+S  PRSE+QPDTIENIV
Subjt:  EGRIALTSEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIV

Query:  CNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRA
         ++IS IDG DDQS KAKKMLAEMVQVSME+SLRHLQ RA
Subjt:  CNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRA

AT4G12710.1 ARM repeat superfamily protein5.5e-6847.6Show/hide
Query:  DPDSKLHAACEIRRL-----TKTSQRSRRHLSQSIPHLVSMLHSPE-SHHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLL
        D D ++ AA EIR+L      K+S RS+   +  IP LV ML S      HA+LLALLNLAV++E+NKI IV+AGA+ P++  L+  + +L+E A A++L
Subjt:  DPDSKLHAACEIRRL-----TKTSQRSRRHLSQSIPHLVSMLHSPE-SHHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSESLNLQENAAASLL

Query:  TLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDE-GRIALTSE
        TLSA+  NK +I ++G  PLL+++L  G+ Q K DAV AL NLS      + ILD+K +  ++ LLK CKK SK AEK  +L+E ++   E GR A+TS 
Subjt:  TLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDE-GRIALTSE

Query:  EGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDG
        E G+L +VE +E+GS  S +HAV ALL++C SDR KYR+ IL EG IPGLL  TV GT KS+ +A+ LL LLR++P  + E+ P T+E IV  I  Q+DG
Subjt:  EGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDG

Query:  DDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVC
         +  +  AKK+L +MV  SME S++ +Q +A  C
Subjt:  DDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVC

AT5G14510.1 ARM repeat superfamily protein2.0e-3334.81Show/hide
Query:  SDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSES-LNLQENAAASLLTL
        S + +S++ AA E+  L++  QR +    + I  L+SML S +      AL ALL+LA   E+NK+RIV++GA+  ++  LQSE+ + + E A A LL L
Subjt:  SDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLHSPES-HHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSES-LNLQENAAASLLTL

Query:  SASTVNKPLISAAGAIPLLVEILRCG--SPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEE
        S+   NK  +++   + LLV ++     + QAK D +  L NLSTL   + +++ S    +++ ++  C KSS+ A+K  +L+E+++       +++S  
Subjt:  SASTVNKPLISAAGAIPLLVEILRCG--SPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEE

Query:  GGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGD
        G +  +VE +E GS Q ++HAV  LL +C +DR   R  IL EGV+PGLL+++V GT +++  A+ LL LLRD      + +   IE IV  I+ +ID +
Subjt:  GGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGD

Query:  DDQ-SSKAKKMLAEMV
         ++      K++ EM+
Subjt:  DDQ-SSKAKKMLAEMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTGATTCCACCTCCCCCTCCGGCACCCACCACCACCACGACGCCGCCGTCGTCCGCGCCCTCCATCTCATCCAATCCGACGACCCCGATTCCAAGCTCCACGC
CGCCTGCGAAATCCGCCGCCTCACCAAAACCTCCCAACGCTCCCGCCGACACCTCTCCCAATCCATCCCCCACCTCGTCTCCATGCTCCACTCCCCGGAATCCCACCACC
ACGCCGCCCTCCTCGCCCTCCTCAACCTCGCCGTCAAAGATGAGAAGAATAAGATCAGGATTGTAGAAGCTGGTGCTTTGGGACCAATAGTTGGTTTTCTTCAATCAGAG
AGTTTGAACCTACAGGAGAATGCAGCTGCATCTTTACTCACTCTATCTGCTTCTACTGTCAACAAGCCATTAATAAGTGCTGCTGGTGCCATTCCCCTCCTTGTGGAGAT
TCTTAGATGTGGAAGCCCACAAGCCAAGGCTGATGCTGTGATGGCTCTTTCCAATCTTTCAACACTTCCACATAATCTTAGTATCATTCTAGATTCAAAGCCAATCCCTT
CAATAGTTAGTCTGCTGAAAACTTGTAAAAAATCTTCAAAAACAGCTGAAAAATGCTGCTCGCTAATTGAATCCTTAGTTGGTTTTGATGAAGGCAGAATCGCATTGACA
TCTGAAGAAGGTGGAGTTCTTGCAGTTGTAGAAGTGCTTGAGAATGGCTCTCTTCAAAGTCGTGATCACGCTGTCAGTGCACTACTGACAATGTGTGAGAGCGACCGGTG
TAAATACAGAGAACCCATCTTAGGAGAAGGGGTAATCCCTGGGCTTCTTGAACTCACTGTACAAGGAACACCGAAATCTCAGTCAAAGGCCAAAACCCTGCTGAGGTTAT
TAAGGGACTCTCCATATCCAAGATCCGAGCTTCAACCCGACACAATCGAGAATATCGTTTGTAACATCATCTCTCAGATCGACGGAGACGATGATCAATCTAGCAAAGCA
AAGAAGATGCTGGCAGAGATGGTGCAAGTGAGTATGGAGCAGAGTTTGAGGCATTTACAACGGCGAGCTCTGGTATGTACCCCCACCGATATGCCGATTAATACTTGCAC
CTCTGAAGTGTCTTTAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTGATTCCACCTCCCCCTCCGGCACCCACCACCACCACGACGCCGCCGTCGTCCGCGCCCTCCATCTCATCCAATCCGACGACCCCGATTCCAAGCTCCACGC
CGCCTGCGAAATCCGCCGCCTCACCAAAACCTCCCAACGCTCCCGCCGACACCTCTCCCAATCCATCCCCCACCTCGTCTCCATGCTCCACTCCCCGGAATCCCACCACC
ACGCCGCCCTCCTCGCCCTCCTCAACCTCGCCGTCAAAGATGAGAAGAATAAGATCAGGATTGTAGAAGCTGGTGCTTTGGGACCAATAGTTGGTTTTCTTCAATCAGAG
AGTTTGAACCTACAGGAGAATGCAGCTGCATCTTTACTCACTCTATCTGCTTCTACTGTCAACAAGCCATTAATAAGTGCTGCTGGTGCCATTCCCCTCCTTGTGGAGAT
TCTTAGATGTGGAAGCCCACAAGCCAAGGCTGATGCTGTGATGGCTCTTTCCAATCTTTCAACACTTCCACATAATCTTAGTATCATTCTAGATTCAAAGCCAATCCCTT
CAATAGTTAGTCTGCTGAAAACTTGTAAAAAATCTTCAAAAACAGCTGAAAAATGCTGCTCGCTAATTGAATCCTTAGTTGGTTTTGATGAAGGCAGAATCGCATTGACA
TCTGAAGAAGGTGGAGTTCTTGCAGTTGTAGAAGTGCTTGAGAATGGCTCTCTTCAAAGTCGTGATCACGCTGTCAGTGCACTACTGACAATGTGTGAGAGCGACCGGTG
TAAATACAGAGAACCCATCTTAGGAGAAGGGGTAATCCCTGGGCTTCTTGAACTCACTGTACAAGGAACACCGAAATCTCAGTCAAAGGCCAAAACCCTGCTGAGGTTAT
TAAGGGACTCTCCATATCCAAGATCCGAGCTTCAACCCGACACAATCGAGAATATCGTTTGTAACATCATCTCTCAGATCGACGGAGACGATGATCAATCTAGCAAAGCA
AAGAAGATGCTGGCAGAGATGGTGCAAGTGAGTATGGAGCAGAGTTTGAGGCATTTACAACGGCGAGCTCTGGTATGTACCCCCACCGATATGCCGATTAATACTTGCAC
CTCTGAAGTGTCTTTAAAGTAA
Protein sequenceShow/hide protein sequence
MDFDSTSPSGTHHHHDAAVVRALHLIQSDDPDSKLHAACEIRRLTKTSQRSRRHLSQSIPHLVSMLHSPESHHHAALLALLNLAVKDEKNKIRIVEAGALGPIVGFLQSE
SLNLQENAAASLLTLSASTVNKPLISAAGAIPLLVEILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPIPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALT
SEEGGVLAVVEVLENGSLQSRDHAVSALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKA
KKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSLK