; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005844 (gene) of Snake gourd v1 genome

Gene IDTan0005844
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionU-box domain-containing protein 2-like
Genome locationLG11:12377098..12380206
RNA-Seq ExpressionTan0005844
SyntenyTan0005844
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008460513.1 PREDICTED: U-box domain-containing protein 2 isoform X1 [Cucumis melo]9.1e-19394.79Show/hide
Query:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK
        MDYCTSPS+  HHRT  TAA SP+AA+HKALLLVQSDALDSK QGACEIRRLTKTSQRCRR LS+SIPHLVSMLHR HSPESHLEAALLALLNLAVKDEK
Subjt:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK

Query:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL
        NKIKIVEAGALGPI+GFLQSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQAKADAVMALSNLSTLPHNLSIILDS PVP+IVSLL
Subjt:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL

Query:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
        KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
Subjt:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT

Query:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        LLRLLRDSPYPRSELQ DTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPT+MPIN CTSEVSSK
Subjt:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

XP_022956620.1 U-box domain-containing protein 2-like [Cucurbita moschata]1.4e-19093.56Show/hide
Query:  YCTSPSSG--SHHR----TTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAV
        YCTSP++    HHR     TATA +SPE AVHKALLLVQSD+ DSK QGACEIRRLTKTSQ CRR LSESIPHLVSMLHRPHSPESHLEAALLALLNLAV
Subjt:  YCTSPSSG--SHHR----TTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAV

Query:  KDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSI
        KDEKNKIKIVEAGALGPI+GF QSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQAKADAVMALSNLSTLP NLSIILDSKP+PSI
Subjt:  KDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSI

Query:  VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS
        VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS
Subjt:  VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS

Query:  KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
Subjt:  KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

XP_022990963.1 U-box domain-containing protein 2-like [Cucurbita maxima]1.7e-19193.3Show/hide
Query:  YCTSPSSG------SHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAV
        YCTSP++        HHR+T TA +SPE AVHKALLLVQSD+ DSK QGACEIRRLTKTSQRCRR LSESIPHLVSMLHRPHSPESHLEAALLALLNLAV
Subjt:  YCTSPSSG------SHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAV

Query:  KDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSI
        KDEKNKIKIVEAGALGPIVGF QSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQA+ADAVMALSNLSTLP NLSIILDSKP+PSI
Subjt:  KDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSI

Query:  VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS
        VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIAL SEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS
Subjt:  VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS

Query:  KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
Subjt:  KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

XP_023524665.1 U-box domain-containing protein 2-like [Cucurbita pepo subsp. pepo]2.5e-19092.09Show/hide
Query:  YCTSPSSG----------SHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALL
        YCTSP++             HR+T TA +SPE AVHKALLLVQSD+ DSK QGACEIRRLTKTSQRCRR LSESIPHLVSMLHRPHSPESHLEAALLALL
Subjt:  YCTSPSSG----------SHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALL

Query:  NLAVKDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKP
        NLAVKDEKNKIKIVEAGALGPI+GF QSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQAKADAVMALSNLSTLP NLSIILDSKP
Subjt:  NLAVKDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKP

Query:  VPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTP
        +PSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIAL SEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTP
Subjt:  VPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTP

Query:  KSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        KSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
Subjt:  KSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

XP_038876343.1 U-box domain-containing protein 2 [Benincasa hispida]7.7e-19294.79Show/hide
Query:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK
        MDYC+SP SG+HHR  A+A+ SP+AAV KALLLVQSD+LDSK  GACEIRRLTKTSQRCRR LS+SIPHLVSMLHR HSPESHLEAALLALLNLAVKDEK
Subjt:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK

Query:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL
        NKIKIV AGALGPI+GFLQSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL
Subjt:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL

Query:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
        KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
Subjt:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT

Query:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
Subjt:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

TrEMBL top hitse value%identityAlignment
A0A0A0KNA6 Uncharacterized protein3.8e-18994.01Show/hide
Query:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK
        MDYCTSPS+ +HHRT  ++A SP+AAV KALLLVQSDALDSK QGA EIRRLTKTSQRCRR LS+SIPHLVSMLHR HSPESHLEAALLALLNLAVKDEK
Subjt:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK

Query:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL
        NKIKIVEAGALGPI+GFLQSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQAKADAVMALSNLSTLPHNLSIILDS PVP+IVSLL
Subjt:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL

Query:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
        KTCKKSSKTAEKCCSLIE LVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
Subjt:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT

Query:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        LLRLLRDSPYPRSELQ DTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPT+MPINTCTSEVSSK
Subjt:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

A0A1S3CC78 U-box domain-containing protein 2 isoform X14.4e-19394.79Show/hide
Query:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK
        MDYCTSPS+  HHRT  TAA SP+AA+HKALLLVQSDALDSK QGACEIRRLTKTSQRCRR LS+SIPHLVSMLHR HSPESHLEAALLALLNLAVKDEK
Subjt:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK

Query:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL
        NKIKIVEAGALGPI+GFLQSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQAKADAVMALSNLSTLPHNLSIILDS PVP+IVSLL
Subjt:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL

Query:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
        KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
Subjt:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT

Query:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        LLRLLRDSPYPRSELQ DTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPT+MPIN CTSEVSSK
Subjt:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

A0A5A7V2N1 U-box domain-containing protein 2 isoform X14.4e-19394.79Show/hide
Query:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK
        MDYCTSPS+  HHRT  TAA SP+AA+HKALLLVQSDALDSK QGACEIRRLTKTSQRCRR LS+SIPHLVSMLHR HSPESHLEAALLALLNLAVKDEK
Subjt:  MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEK

Query:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL
        NKIKIVEAGALGPI+GFLQSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQAKADAVMALSNLSTLPHNLSIILDS PVP+IVSLL
Subjt:  NKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLL

Query:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
        KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT
Subjt:  KTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKT

Query:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        LLRLLRDSPYPRSELQ DTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPT+MPIN CTSEVSSK
Subjt:  LLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

A0A6J1GYA0 U-box domain-containing protein 2-like7.0e-19193.56Show/hide
Query:  YCTSPSSG--SHHR----TTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAV
        YCTSP++    HHR     TATA +SPE AVHKALLLVQSD+ DSK QGACEIRRLTKTSQ CRR LSESIPHLVSMLHRPHSPESHLEAALLALLNLAV
Subjt:  YCTSPSSG--SHHR----TTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAV

Query:  KDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSI
        KDEKNKIKIVEAGALGPI+GF QSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQAKADAVMALSNLSTLP NLSIILDSKP+PSI
Subjt:  KDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSI

Query:  VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS
        VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS
Subjt:  VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS

Query:  KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
Subjt:  KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

A0A6J1JUU0 U-box domain-containing protein 2-like8.3e-19293.3Show/hide
Query:  YCTSPSSG------SHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAV
        YCTSP++        HHR+T TA +SPE AVHKALLLVQSD+ DSK QGACEIRRLTKTSQRCRR LSESIPHLVSMLHRPHSPESHLEAALLALLNLAV
Subjt:  YCTSPSSG------SHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAV

Query:  KDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSI
        KDEKNKIKIVEAGALGPIVGF QSESLILQENA ASLLTLSASTVNKPLISAAGAIPLLV+ILRCGSPQA+ADAVMALSNLSTLP NLSIILDSKP+PSI
Subjt:  KDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSI

Query:  VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS
        VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIAL SEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS
Subjt:  VSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQS

Query:  KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
        KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK
Subjt:  KAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK

SwissProt top hitse value%identityAlignment
O22193 U-box domain-containing protein 46.0e-3034.97Show/hide
Query:  EAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSES--IPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSE
        E  V K +  ++S +LD++ Q   E+R L K +   R  +  S  I  LV +L+   S     E A+ ALLNL++ D  NK  I +AGA+ P++  L++ 
Subjt:  EAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSES--IPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSE

Query:  SLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLV
        S   +EN+AA+L +LS    NK  I  +GAI  LVD+L  G+P+ K DA  AL NLS    N ++I+ S  V  ++ L+     ++   +K  +++ +L 
Subjt:  SLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLV

Query:  GFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD
           EGR A+  +EGG+  +VEV+E GS + +++A  ALL +  ++  ++   +L EG +P L+ L+  GTP+++ KA+ LL   R+
Subjt:  GFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD

Q5VRH9 U-box domain-containing protein 121.7e-2132Show/hide
Query:  SHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSE--SIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEA
        S  +  A ++    A +   +  ++S   D +   A EIR L K +   R  ++E  +IP LV++L    S     E A+ ALLNL++  E NK  IV++
Subjt:  SHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSE--SIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEA

Query:  GALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSK
         A+  IV  L++ S+  +ENAAA+L +LS    NK  I AAGAIP L+++L  GSP+ K DA  A+ NL     N    + +  V  +++ L     +  
Subjt:  GALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSK

Query:  TAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDS
          ++  SL+  L G  EG+I +   E  +  +VEV++ GS ++R++A   L  +C +D  +        GV   L EL+  GT +++ KA ++L L+  +
Subjt:  TAEKCCSLIESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDS

Q5XEZ8 U-box domain-containing protein 29.6e-2835.64Show/hide
Query:  SPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLS--ESIPHLVSMLHRPHSPESHLEA-ALLALLNLAVKDEKNKIKIVEAGALGPIVGFL
        S E  V K +  ++S +LD++ +    IR L + S   R  ++  E+IP LVS+L   +S +  ++A A+  LLNL++ D  NK  I E+GA+ P++  L
Subjt:  SPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLS--ESIPHLVSMLHRPHSPESHLEA-ALLALLNLAVKDEKNKIKIVEAGALGPIVGFL

Query:  QSESL-ILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLI
        ++  L   + N+AA+L +LS     K  I  AGAI  LVD+L  GS   K DA  AL NLS    N + ++++  V  +V L+     +    EK   ++
Subjt:  QSESL-ILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLI

Query:  ESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLR
         +L    EG+IA+  EEGG+  +VEV+E GS + +++A  ALL +C +   K+   ++ EGVIP L+ LT  GT + + KA+ LL+  +
Subjt:  ESLVGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLR

Q8GWV5 U-box domain-containing protein 31.1e-2335.69Show/hide
Query:  KLQGACEIRRLTKTSQRCRRQLSE--SIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSAS
        K   A EIR LT  S   R  +    +I  L+S+L+         E A+ ALLNL++  E NK  IVE GA+ P+V  L + +   +EN+AASL +LS  
Subjt:  KLQGACEIRRLTKTSQRCRRQLSE--SIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSAS

Query:  TVNKPLISAA-GAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVL
         VN+  I  +  AI  LV++L  G+ + K DA  AL NLS    N + I+ +K V  +V LL       +  +K  +L+ +L    EGR A+   EGG+ 
Subjt:  TVNKPLISAA-GAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGGVL

Query:  AVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD
         +VE ++ GS + +++A   LL +C +   K+   +L EG IP L+ L+  GT +++ KA+ LL   R+
Subjt:  AVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD

Q9C9A6 U-box domain-containing protein 107.8e-2232.17Show/hide
Query:  AAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSE--SIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSES
        +A+   +  + S +++ +     EIR L+K S   R  ++E  +IP LV +L      E+  E A+  +LNL++  E NK  I+ AGA+  IV  L++ S
Subjt:  AAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSE--SIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSES

Query:  LILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLVG
        +  +ENAAA+L +LS +  NK +I A+GAI  LVD+L+ GS + K DA  AL NL     N    + +  V  +V +L T   S + A++  +++  L  
Subjt:  LILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLVG

Query:  FDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDS
            + A+      +  +++ L+    ++R++A   LL +C+ D  K    I   G +  L+EL+  GT +++ KA +LL LLR S
Subjt:  FDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDS

Arabidopsis top hitse value%identityAlignment
AT2G23140.1 RING/U-box superfamily protein with ARM repeat domain4.3e-3134.97Show/hide
Query:  EAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSES--IPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSE
        E  V K +  ++S +LD++ Q   E+R L K +   R  +  S  I  LV +L+   S     E A+ ALLNL++ D  NK  I +AGA+ P++  L++ 
Subjt:  EAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSES--IPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSE

Query:  SLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLV
        S   +EN+AA+L +LS    NK  I  +GAI  LVD+L  G+P+ K DA  AL NLS    N ++I+ S  V  ++ L+     ++   +K  +++ +L 
Subjt:  SLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLV

Query:  GFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD
           EGR A+  +EGG+  +VEV+E GS + +++A  ALL +  ++  ++   +L EG +P L+ L+  GTP+++ KA+ LL   R+
Subjt:  GFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD

AT2G23140.2 RING/U-box superfamily protein with ARM repeat domain4.3e-3134.97Show/hide
Query:  EAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSES--IPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSE
        E  V K +  ++S +LD++ Q   E+R L K +   R  +  S  I  LV +L+   S     E A+ ALLNL++ D  NK  I +AGA+ P++  L++ 
Subjt:  EAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSES--IPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSE

Query:  SLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLV
        S   +EN+AA+L +LS    NK  I  +GAI  LVD+L  G+P+ K DA  AL NLS    N ++I+ S  V  ++ L+     ++   +K  +++ +L 
Subjt:  SLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLV

Query:  GFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD
           EGR A+  +EGG+  +VEV+E GS + +++A  ALL +  ++  ++   +L EG +P L+ L+  GTP+++ KA+ LL   R+
Subjt:  GFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD

AT3G03440.1 ARM repeat superfamily protein4.4e-12167.47Show/hide
Query:  CTSPSSGS------HHRTTATAAV----SPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLN
        C SP  G       H   + +++V    S  A++ + L L++S+  DS+L  A EIRRLTKTS RCRR  S+++  LVSML R  SPESH EAALLALLN
Subjt:  CTSPSSGS------HHRTTATAAV----SPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLN

Query:  LAVKDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPV
        LAVKDEKNK+ I+EAGAL PI+ FLQS S  LQE A+ASLLTLSAS  NKP+I A G +PLLV +++ GSPQAKADAVMALSNLSTLP NLS+IL +KP+
Subjt:  LAVKDEKNKIKIVEAGALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPV

Query:  PSIVSLLKTCKKSSKTAEKCCSLIESL-VGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTP
          I++LLK+ KKSSKT+EKCCSLIE+L V  +E R  L S+EGGVLAVVEVLENGSLQ+R+HAVG LLT+C+SDR KYREPIL EGVIPGLLELTVQGT 
Subjt:  PSIVSLLKTCKKSSKTAEKCCSLIESL-VGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTP

Query:  KSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRA
        KS+ KA+ LL LLR+S  PRSE+QPDTIENIV ++IS IDG DDQS KAKKMLAEMVQVSME+SLRHLQ RA
Subjt:  KSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRA

AT4G12710.1 ARM repeat superfamily protein2.8e-6745.83Show/hide
Query:  TTATAAVSPEAAV-HKALLLVQSDALDSKLQGACEIRRL-----TKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEA
        TT T   + E  + H +  L+  D LD +++ A EIR+L      K+S R +   +  IP LV ML   +    H  A+LLALLNLAV++E+NKI+IV+A
Subjt:  TTATAAVSPEAAV-HKALLLVQSDALDSKLQGACEIRRL-----TKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEA

Query:  GALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSK
        GA+ P++  L+  +  L+E A A++LTLSA+  NK +I ++G  PLL+ +L  G+ Q K DAV AL NLS      + ILD+K V  ++ LLK CKK SK
Subjt:  GALGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSK

Query:  TAEKCCSLIESLVGFDE-GRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD
         AEK  +L+E ++   E GR A+TS E G+L +VE +E+GS  S +HAVGALL++C SDR KYR+ IL EG IPGLL  TV GT KS+ +A+ LL LLR+
Subjt:  TAEKCCSLIESLVGFDE-GRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRD

Query:  SPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVC
        +P  + E+ P T+E IV  I  Q+DG +  +  AKK+L +MV  SME S++ +Q +A  C
Subjt:  SPYPRSELQPDTIENIVCNIISQIDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVC

AT5G14510.1 ARM repeat superfamily protein1.4e-3434.71Show/hide
Query:  DSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSES-LILQENAAASLLTLSA
        +S+++ A E+  L++  QR +    E I  L+SML       +  E AL ALL+LA   E+NK++IV++GA+  ++  LQSE+ +++ E A A LL LS+
Subjt:  DSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGALGPIVGFLQSES-LILQENAAASLLTLSA

Query:  STVNKPLISAAGAIPLLVDILRCG--SPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGG
           NK  +++   + LLV ++     + QAK D +  L NLSTL   + +++ S    +++ ++  C KSS+ A+K  +L+E+++       +++S  G 
Subjt:  STVNKPLISAAGAIPLLVDILRCG--SPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESLVGFDEGRIALTSEEGG

Query:  VLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDD
        +  +VE +E GS Q ++HAVG LL +C +DR   R  IL EGV+PGLL+++V GT +++  A+ LL LLRD      + +   IE IV  I+ +ID + +
Subjt:  VLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQIDGDDD

Query:  Q-SSKAKKMLAEMV
        +      K++ EM+
Subjt:  Q-SSKAKKMLAEMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTATTGTACCTCTCCCTCCTCCGGGAGCCACCACCGGACCACCGCCACCGCCGCCGTCTCTCCTGAGGCCGCCGTCCACAAGGCGCTCCTCCTCGTCCAATCCGA
CGCCCTCGATTCCAAGCTCCAAGGCGCCTGCGAAATTCGCCGACTCACCAAAACCTCCCAACGCTGCCGCCGCCAGCTCTCTGAATCAATCCCTCATCTCGTCTCCATGC
TCCACCGTCCCCACTCCCCTGAGTCCCACCTCGAGGCTGCTCTCCTCGCCCTTCTCAACCTCGCCGTCAAAGATGAAAAGAATAAGATCAAGATTGTAGAAGCTGGTGCC
TTGGGACCAATAGTTGGTTTTCTTCAATCAGAGAGTTTGATCCTACAGGAGAATGCAGCTGCATCCTTACTCACTCTATCTGCTTCTACTGTCAATAAGCCATTAATAAG
TGCTGCTGGTGCCATTCCCCTCCTAGTGGATATTCTTAGATGTGGAAGCCCACAAGCAAAGGCAGATGCTGTAATGGCTCTTTCCAATCTTTCAACACTTCCACATAATC
TTAGCATCATTCTAGATTCAAAGCCAGTCCCATCAATAGTTAGTCTGCTGAAAACTTGTAAAAAATCTTCAAAAACAGCAGAAAAATGCTGCTCACTGATTGAATCCTTA
GTTGGTTTTGATGAAGGCAGAATAGCATTGACATCTGAGGAAGGTGGAGTTCTTGCAGTTGTGGAAGTGCTTGAGAATGGCTCTCTTCAAAGTCGTGACCATGCTGTCGG
TGCACTACTGACAATGTGTGAGAGCGACCGGTGTAAATACAGAGAACCTATCTTAGGAGAAGGGGTAATCCCTGGCCTTCTTGAACTCACTGTCCAAGGAACACCTAAAT
CTCAGTCAAAGGCTAAAACCCTCTTGAGGTTACTAAGGGACTCTCCATATCCAAGATCTGAGCTTCAACCCGACACAATCGAGAACATCGTTTGTAACATCATCTCTCAG
ATCGATGGAGACGACGATCAATCTAGTAAAGCAAAGAAGATGCTGGCAGAGATGGTGCAAGTGAGTATGGAGCAGAGCTTGAGGCATTTACAACGAAGGGCTCTGGTATG
CACGCCCACTGATATGCCTATTAATACTTGCACCTCTGAAGTTTCTTCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
CTTCCTCTTAAATTTCACTCCCATTTCTCTCTCTCTCATCTTTCTCTCTCATCTCTCAACTGTCTCACAGAATTTTCAATCCTTCTTCTTCCTCATGGATTATTGTACCT
CTCCCTCCTCCGGGAGCCACCACCGGACCACCGCCACCGCCGCCGTCTCTCCTGAGGCCGCCGTCCACAAGGCGCTCCTCCTCGTCCAATCCGACGCCCTCGATTCCAAG
CTCCAAGGCGCCTGCGAAATTCGCCGACTCACCAAAACCTCCCAACGCTGCCGCCGCCAGCTCTCTGAATCAATCCCTCATCTCGTCTCCATGCTCCACCGTCCCCACTC
CCCTGAGTCCCACCTCGAGGCTGCTCTCCTCGCCCTTCTCAACCTCGCCGTCAAAGATGAAAAGAATAAGATCAAGATTGTAGAAGCTGGTGCCTTGGGACCAATAGTTG
GTTTTCTTCAATCAGAGAGTTTGATCCTACAGGAGAATGCAGCTGCATCCTTACTCACTCTATCTGCTTCTACTGTCAATAAGCCATTAATAAGTGCTGCTGGTGCCATT
CCCCTCCTAGTGGATATTCTTAGATGTGGAAGCCCACAAGCAAAGGCAGATGCTGTAATGGCTCTTTCCAATCTTTCAACACTTCCACATAATCTTAGCATCATTCTAGA
TTCAAAGCCAGTCCCATCAATAGTTAGTCTGCTGAAAACTTGTAAAAAATCTTCAAAAACAGCAGAAAAATGCTGCTCACTGATTGAATCCTTAGTTGGTTTTGATGAAG
GCAGAATAGCATTGACATCTGAGGAAGGTGGAGTTCTTGCAGTTGTGGAAGTGCTTGAGAATGGCTCTCTTCAAAGTCGTGACCATGCTGTCGGTGCACTACTGACAATG
TGTGAGAGCGACCGGTGTAAATACAGAGAACCTATCTTAGGAGAAGGGGTAATCCCTGGCCTTCTTGAACTCACTGTCCAAGGAACACCTAAATCTCAGTCAAAGGCTAA
AACCCTCTTGAGGTTACTAAGGGACTCTCCATATCCAAGATCTGAGCTTCAACCCGACACAATCGAGAACATCGTTTGTAACATCATCTCTCAGATCGATGGAGACGACG
ATCAATCTAGTAAAGCAAAGAAGATGCTGGCAGAGATGGTGCAAGTGAGTATGGAGCAGAGCTTGAGGCATTTACAACGAAGGGCTCTGGTATGCACGCCCACTGATATG
CCTATTAATACTTGCACCTCTGAAGTTTCTTCAAAGTAACAAAATTTGTACTCATGCTTTGTCCATGGAGTTTGTAAGATAAAAACTTTTATAGTTGTTACTGTTAGAGA
ACAGGAAATGAAAATAAGATATAGGAAAGCAAGCCAGTCCAGGCAGGTTGTGGGAATGTCGGGAGACTGGTAAATGTCAATAAAACTGGAAATGAGATCTGTATGAGGTT
GAGATCATAGTTCTTTTTTTCTTTTTGCTTGTTCTGTAAAGCAACTGTTGTATTACAGTATAGAAATAGTGAAGTTTAAAAGTGTTCATTCCCCTTCTGATCCTGAATTG
TCCCGTGTTATTACAGTCCAGAAATATCAAATAGTACACTTAACACA
Protein sequenceShow/hide protein sequence
MDYCTSPSSGSHHRTTATAAVSPEAAVHKALLLVQSDALDSKLQGACEIRRLTKTSQRCRRQLSESIPHLVSMLHRPHSPESHLEAALLALLNLAVKDEKNKIKIVEAGA
LGPIVGFLQSESLILQENAAASLLTLSASTVNKPLISAAGAIPLLVDILRCGSPQAKADAVMALSNLSTLPHNLSIILDSKPVPSIVSLLKTCKKSSKTAEKCCSLIESL
VGFDEGRIALTSEEGGVLAVVEVLENGSLQSRDHAVGALLTMCESDRCKYREPILGEGVIPGLLELTVQGTPKSQSKAKTLLRLLRDSPYPRSELQPDTIENIVCNIISQ
IDGDDDQSSKAKKMLAEMVQVSMEQSLRHLQRRALVCTPTDMPINTCTSEVSSK