; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0005634 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0005634
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionU-box domain-containing protein 4
Genome locationchr01:12998218..12999629
RNA-Seq ExpressionPI0005634
SyntenyPI0005634
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048717.1 vacuolar protein 8 [Cucumis melo var. makuwa]4.8e-22590Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        MGDTKIHRASE PNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLED PTNNA QSMR AAAYCLRCISC+GDGTLA A
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        VGHSGALESLLRSLLHSSGCFCKILVKCIWS VTFD+SSRVI+ RNGGLEVIICMLV V DGTRRYLLEILSAMALLREVRKAL+ LRGLPFLVQAARFG
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
         MASRERAC+AIGLIAITKRGRNKLVELGVV VLIEL+REGDYVTKLVAGNALGIVSAHLAYIRP AQAGAIPLFADLLQ PDPICKEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE
        ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH                ETDRVALADAGAIQ LIGLLQDE
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL
        LEKVKLNVTE LTSFSKDPLY TR+SE VS+PAFQNLQE +TH RATEWQSMVSVHQLGINQFTFD DL+
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL

XP_008462971.1 PREDICTED: uncharacterized protein LOC103501229 [Cucumis melo]2.7e-21289.96Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        MGDTKIHRASE PNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLED PTNNA QSMR AAAYCLRCISC+GDGTLA A
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        VGHSGALESLLRSLLHSSGCFCKILVKCIWS VTFD+SSRVI+ARNGGLEVIICMLV V DGTRRYLLEILSAMALLREVRKAL+ LRGLPFLVQAARFG
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
         MASRERAC+AIGLIAITKRGRNKLVELGVV VLIEL+REGDYVTKLVAGNALGIVSAHLAYIRP AQAGAIPLFADLLQ P PICKEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE
        ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH                ETDRVALADAGAIQ LIGLLQDE
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATE
        LEKVKLNVTE LTSFSKDPLY TR+SE VS+PAFQNLQE +TH RATE
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATE

XP_022936629.1 uncharacterized protein LOC111443172 [Cucurbita moschata]2.2e-18575.74Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        MG ++IHRASEKPNLDWEA+LKQYENVMA+ESEA+KVKATIKLA LS +AP NIL S IPIIAK LE NP +N+ QSMRGAAAYCL+ IS +GDGTLATA
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        V HSGALES+LRSL  +SG F KILVKCIWSIVT D+ SRVI+ARNGGLEV+I M  SV+DGTRRYLLEILSA+AL+REVRKALI LRGLPFLVQAAR+G
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
        CMASRERAC+AIGLIAIT+RGR+ LVELGV+PVLIEL  EGDYVTKLVAGN+LGIVSAH+AYIRP AQAGAIPLFADLLQWP+PI KEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDE
        E NAV+I DHLV++LKEGDD GKAAAADVL  LSSYKYSI +VQ+SGAIPVLVDLLHDGN                 E DR+ALADAGAI  LIGLLQDE
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL
        L++VK N  E L +FS+D LYC RVSEA+S+PAFQNL E +TH RATE   + S+ Q+GINQ T D DLL
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL

XP_031744763.1 uncharacterized protein LOC101212744 [Cucumis sativus]6.4e-23091.06Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        MGD KIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSK APENILKS IPIIAKPLEDNPTNNA QSMR AAAYCLRCISCRGDGTLATA
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        VGHSGALESL++SLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLV VIDGTRRYLLEILSAMALLREVRKALIR RGLPFLVQAARFG
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
        CMASRERAC+AIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGN LGIVSAHLAYIRP AQAGAIPLFADLLQW DPICKEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE
        ETNAVVI DHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH                ETDRVALADAGAIQ LIGLLQD+
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL
        LEKVKLNVTE +TSFSKDPLYCTRV+EA+S+PAFQNLQE +TH RA EWQSMVSVHQLGINQFTFD DLL
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL

XP_038897634.1 protein spotted leaf 11 [Benincasa hispida]1.0e-20683.4Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        MG ++IHRASEKPNLDWEA+L QYENVMASESEA+KVKATIKLA LSKNAPENIL SAIP IAK LEDNPT+N+ QSMRGAAAYCLRCISC+GDGTLA A
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        VGHSGALESLLRSL HSSGCF KILVKC+WSIVTFDQ+SRVIIARNGGLEVII M  SV DGTRRYLLEILSAMALLREVRKALI LRGLPFLVQAARFG
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
        CMASRERAC+AIGL+AITKRGR  LVELGVVPVLIELLREGDYVTKLVAGN+LG+VSAH+ YIRP AQAGAIPLFA+LLQWPDPI KEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDE
        E NAVVISDHLV++LKEGD  GKAAAADVL  LSSYKYSISVVQNSGAIPVLVDLLHDGN                +ETDRVALADAGAIQRLIGLLQDE
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL
        LE+VK NVTE L +FS+DPLYCTRVSEAVS+PAFQNLQE + H RA E QS+ S+HQ+GINQFT D DLL
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL

TrEMBL top hitse value%identityAlignment
A0A0A0LGL6 Uncharacterized protein3.1e-23091.06Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        MGD KIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSK APENILKS IPIIAKPLEDNPTNNA QSMR AAAYCLRCISCRGDGTLATA
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        VGHSGALESL++SLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLV VIDGTRRYLLEILSAMALLREVRKALIR RGLPFLVQAARFG
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
        CMASRERAC+AIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGN LGIVSAHLAYIRP AQAGAIPLFADLLQW DPICKEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE
        ETNAVVI DHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH                ETDRVALADAGAIQ LIGLLQD+
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL
        LEKVKLNVTE +TSFSKDPLYCTRV+EA+S+PAFQNLQE +TH RA EWQSMVSVHQLGINQFTFD DLL
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL

A0A1S3CJQ4 uncharacterized protein LOC1035012291.3e-21289.96Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        MGDTKIHRASE PNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLED PTNNA QSMR AAAYCLRCISC+GDGTLA A
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        VGHSGALESLLRSLLHSSGCFCKILVKCIWS VTFD+SSRVI+ARNGGLEVIICMLV V DGTRRYLLEILSAMALLREVRKAL+ LRGLPFLVQAARFG
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
         MASRERAC+AIGLIAITKRGRNKLVELGVV VLIEL+REGDYVTKLVAGNALGIVSAHLAYIRP AQAGAIPLFADLLQ P PICKEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE
        ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH                ETDRVALADAGAIQ LIGLLQDE
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATE
        LEKVKLNVTE LTSFSKDPLY TR+SE VS+PAFQNLQE +TH RATE
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATE

A0A5A7U359 Vacuolar protein 82.3e-22590Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        MGDTKIHRASE PNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLED PTNNA QSMR AAAYCLRCISC+GDGTLA A
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        VGHSGALESLLRSLLHSSGCFCKILVKCIWS VTFD+SSRVI+ RNGGLEVIICMLV V DGTRRYLLEILSAMALLREVRKAL+ LRGLPFLVQAARFG
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
         MASRERAC+AIGLIAITKRGRNKLVELGVV VLIEL+REGDYVTKLVAGNALGIVSAHLAYIRP AQAGAIPLFADLLQ PDPICKEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE
        ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH                ETDRVALADAGAIQ LIGLLQDE
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH----------------ETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL
        LEKVKLNVTE LTSFSKDPLY TR+SE VS+PAFQNLQE +TH RATEWQSMVSVHQLGINQFTFD DL+
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL

A0A6J1F8U2 uncharacterized protein LOC1114431721.0e-18575.74Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        MG ++IHRASEKPNLDWEA+LKQYENVMA+ESEA+KVKATIKLA LS +AP NIL S IPIIAK LE NP +N+ QSMRGAAAYCL+ IS +GDGTLATA
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        V HSGALES+LRSL  +SG F KILVKCIWSIVT D+ SRVI+ARNGGLEV+I M  SV+DGTRRYLLEILSA+AL+REVRKALI LRGLPFLVQAAR+G
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
        CMASRERAC+AIGLIAIT+RGR+ LVELGV+PVLIEL  EGDYVTKLVAGN+LGIVSAH+AYIRP AQAGAIPLFADLLQWP+PI KEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDE
        E NAV+I DHLV++LKEGDD GKAAAADVL  LSSYKYSI +VQ+SGAIPVLVDLLHDGN                 E DR+ALADAGAI  LIGLLQDE
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL
        L++VK N  E L +FS+D LYC RVSEA+S+PAFQNL E +TH RATE   + S+ Q+GINQ T D DLL
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL

A0A6J1ILS8 uncharacterized protein LOC1114767611.3e-18375.11Show/hide
Query:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA
        M  ++I RASEKPNLDWEA+LKQYENVMA+ESEA+KVKATIKLA LS +AP NIL S IPIIAK LE NP +N+ QSMRGAAAYCL+ IS +GDGTLATA
Subjt:  MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATA

Query:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG
        + HSGALES+LRSL  +SG F KILVKCIWSIVT D+ SRVI+ARNGGLEV+I M  SV+DGTRRYLLEILSA+AL+REVRKALI LRGLPFLVQAAR+G
Subjt:  VGHSGALESLLRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFG

Query:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA
        CMASRERAC+AIGLIAIT+RGR+ LVELGV+PVLIEL  EGDYVTKLVAGN+LGIVSAH+AYIRP AQAGAIPLFADLLQWPDPI KEIAEDVFCLLAVA
Subjt:  CMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVA

Query:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDE
        E NAV+I DHLV++LKEGDD GKAAAADVL  LSSYKYSI +VQ+SGAIPVLVDLLHDGN                 E DR+ALADAGAI  LI LLQDE
Subjt:  ETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDE

Query:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL
        L++VK N  E L +FS+D LYC RVSEA+S+PAFQNL E +TH RATE   + S+ Q+GINQ T D DLL
Subjt:  LEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDLL

SwissProt top hitse value%identityAlignment
Q5VRH9 U-box domain-containing protein 122.4e-0630.68Show/hide
Query:  LIELLREGDYVTKLVAGNALGIVSAHLAYIRPA-AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETN-AVVISDH----LVKVLKEGDDRGKAAAA
        L+  LR G+   +  A   + +++      R   A+AGAIPL  +LL   DP  +E A      L++ E N A ++  H    +V+VLK G    +  AA
Subjt:  LIELLREGDYVTKLVAGNALGIVSAHLAYIRPA-AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETN-AVVISDH----LVKVLKEGDDRGKAAAA

Query:  DVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNHETDRVALADAGAIQRLIGLLQDELEKVKLNVTETLTSFSKDP
          L  LS    +   +  +GAIP L++LL DG+    + A   A AI  L     +++  VK  +   L +F  DP
Subjt:  DVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNHETDRVALADAGAIQRLIGLLQDELEKVKLNVTETLTSFSKDP

Q681N2 U-box domain-containing protein 151.4e-0630.94Show/hide
Query:  AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVISD-----HLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH
        A AGAIPL   LL +PD   +E A      L++ E N  +IS+     +++++L+ G+   +  +A  L  LS    +   +  S  IP LVDLL  G  
Subjt:  AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVISD-----HLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH

Query:  ETDRVALA----------------DAGAIQRLIGLLQDE
           + AL                 DAG +Q L+ LL+D+
Subjt:  ETDRVALA----------------DAGAIQRLIGLLQDE

Q8GUG9 U-box domain-containing protein 117.9e-0524.02Show/hide
Query:  RFGCMASRERACKAIGLIAITKR---GRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVF
        R    ++ +R      + +++KR    R  + E G +PVL+ LL   D  T+  A   +  +S +         AGA+     +L+      +E A    
Subjt:  RFGCMASRERACKAIGLIAITKR---GRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVF

Query:  CLLAVAETNAVVISDH-----LVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHD-----------------GNHETDRVALADAG
          L++A+ N ++I        LV +L+ G  RGK  AA  L  L  Y  +      +G +  LV +L D                  N++  + A+  A 
Subjt:  CLLAVAETNAVVISDH-----LVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHD-----------------GNHETDRVALADAG

Query:  AIQRLIGLLQDELEKVKLNVTETLTSFSK
         +  LIG+LQ +  + + N    L S  K
Subjt:  AIQRLIGLLQDELEKVKLNVTETLTSFSK

Q8VZ40 U-box domain-containing protein 147.1e-0630.29Show/hide
Query:  LIELLREGDYVTKLVAGNALGIVSAHLAYIRPA-AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVISD-----HLVKVLKEGDDRGKAAAA
        L+E L  G    +  A   L +++      R   A+AGAIPL  +LL  PDP  +E +      L++ E N   I D      +V+VLK G    +  AA
Subjt:  LIELLREGDYVTKLVAGNALGIVSAHLAYIRPA-AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVISD-----HLVKVLKEGDDRGKAAAA

Query:  DVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNHETDRVALADAGAIQRLIGLLQDELEKVKLNVTETLTSFSKD
          L  LS    +   +  +GAI  L+ LL +G     + A   A AI  L     ++   VK  + + LT   KD
Subjt:  DVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNHETDRVALADAGAIQRLIGLLQDELEKVKLNVTETLTSFSKD

Q9ZV31 U-box domain-containing protein 121.8e-0423Show/hide
Query:  AQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATAVGHSGALESLLRSLLHSSGCFC-KILVKCIWSIVTFDQSSRVI
        A  S +AP++       ++ K     P +      R +AA  +R ++ + +     A+  SGA+  L+  L  S+     +  V  I ++    ++   I
Subjt:  AQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATAVGHSGALESLLRSLLHSSGCFC-KILVKCIWSIVTFDQSSRVI

Query:  IARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFGCMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGD
        +  +G +  I+ +L       R      L +++++ E +  +     +P LV     G    ++ A  A+  + I +  + K V  G+VPVL+ LL E +
Subjt:  IARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFGCMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGD

Query:  YVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLA------VAETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSY
              + + L I+S+H         A A+P+  D ++   P  KE +  V   L       + E   + I D L+++ + G DRGK  AA +L   S +
Subjt:  YVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLA------VAETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSY

Arabidopsis top hitse value%identityAlignment
AT1G23030.1 ARM repeat superfamily protein5.6e-0624.02Show/hide
Query:  RFGCMASRERACKAIGLIAITKR---GRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVF
        R    ++ +R      + +++KR    R  + E G +PVL+ LL   D  T+  A   +  +S +         AGA+     +L+      +E A    
Subjt:  RFGCMASRERACKAIGLIAITKR---GRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVF

Query:  CLLAVAETNAVVISDH-----LVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHD-----------------GNHETDRVALADAG
          L++A+ N ++I        LV +L+ G  RGK  AA  L  L  Y  +      +G +  LV +L D                  N++  + A+  A 
Subjt:  CLLAVAETNAVVISDH-----LVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHD-----------------GNHETDRVALADAG

Query:  AIQRLIGLLQDELEKVKLNVTETLTSFSK
         +  LIG+LQ +  + + N    L S  K
Subjt:  AIQRLIGLLQDELEKVKLNVTETLTSFSK

AT2G28830.1 PLANT U-BOX 121.2e-0523Show/hide
Query:  AQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATAVGHSGALESLLRSLLHSSGCFC-KILVKCIWSIVTFDQSSRVI
        A  S +AP++       ++ K     P +      R +AA  +R ++ + +     A+  SGA+  L+  L  S+     +  V  I ++    ++   I
Subjt:  AQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATAVGHSGALESLLRSLLHSSGCFC-KILVKCIWSIVTFDQSSRVI

Query:  IARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFGCMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGD
        +  +G +  I+ +L       R      L +++++ E +  +     +P LV     G    ++ A  A+  + I +  + K V  G+VPVL+ LL E +
Subjt:  IARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFGCMASRERACKAIGLIAITKRGRNKLVELGVVPVLIELLREGD

Query:  YVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLA------VAETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSY
              + + L I+S+H         A A+P+  D ++   P  KE +  V   L       + E   + I D L+++ + G DRGK  AA +L   S +
Subjt:  YVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLA------VAETNAVVISDHLVKVLKEGDDRGKAAAADVLRILSSY

AT3G20170.1 ARM repeat superfamily protein1.6e-10145.67Show/hide
Query:  NLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRG--DGTLATAVGHSGALESLL
        N DWE    ++EN ++S S +++V++ +KL+ L+   PE+ +  AIPI+A  L  +  +N  +S++ AAA+CL+CI+C G  +   A  +G  G + SLL
Subjt:  NLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRG--DGTLATAVGHSGALESLL

Query:  RSLLHSS---GCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVS-VIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFGCMASRER
          LL ++     F +I VKC+WS+VTF  S RV +AR GGLE++I  L +   DG+R YLLEILSA+  +RE R+ L+   GL FLV+AA+ G +ASRER
Subjt:  RSLLHSS---GCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVS-VIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFGCMASRER

Query:  ACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVI
        AC AIGLI +T+R R  LVE GV+P L++L R+GD   KL+AGNALGI+SA   YIRP  +AG+IPL+ +LL   DP+ K+IAEDVFC+LAVAE NAV+I
Subjt:  ACKAIGLIAITKRGRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVI

Query:  SDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDELEKVKLN
        ++ LV++L+ GD+  K AA+DVL  L+ Y++S+SV++ SGAIP+L++LL DG+                +E DR A +D+G I  LI  L DE E+++ N
Subjt:  SDHLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGN----------------HETDRVALADAGAIQRLIGLLQDELEKVKLN

Query:  VTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDL
          E L +FS+D  +  RV EA+  P FQ++Q  L   RA+    + S+ ++ I     D DL
Subjt:  VTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVHQLGINQFTFDSDL

AT3G54850.1 plant U-box 145.1e-0730.29Show/hide
Query:  LIELLREGDYVTKLVAGNALGIVSAHLAYIRPA-AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVISD-----HLVKVLKEGDDRGKAAAA
        L+E L  G    +  A   L +++      R   A+AGAIPL  +LL  PDP  +E +      L++ E N   I D      +V+VLK G    +  AA
Subjt:  LIELLREGDYVTKLVAGNALGIVSAHLAYIRPA-AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVISD-----HLVKVLKEGDDRGKAAAA

Query:  DVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNHETDRVALADAGAIQRLIGLLQDELEKVKLNVTETLTSFSKD
          L  LS    +   +  +GAI  L+ LL +G     + A   A AI  L     ++   VK  + + LT   KD
Subjt:  DVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNHETDRVALADAGAIQRLIGLLQDELEKVKLNVTETLTSFSKD

AT5G42340.1 Plant U-Box 151.0e-0730.94Show/hide
Query:  AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVISD-----HLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH
        A AGAIPL   LL +PD   +E A      L++ E N  +IS+     +++++L+ G+   +  +A  L  LS    +   +  S  IP LVDLL  G  
Subjt:  AQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVISD-----HLVKVLKEGDDRGKAAAADVLRILSSYKYSISVVQNSGAIPVLVDLLHDGNH

Query:  ETDRVALA----------------DAGAIQRLIGLLQDE
           + AL                 DAG +Q L+ LL+D+
Subjt:  ETDRVALA----------------DAGAIQRLIGLLQDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGACACAAAGATCCATAGAGCATCTGAAAAGCCCAATCTAGATTGGGAAGCATCACTGAAGCAGTACGAAAACGTTATGGCTTCAGAATCCGAAGCACTTAAAGT
GAAAGCCACAATCAAATTAGCCCAACTCTCCAAAAACGCCCCGGAAAATATTTTGAAGAGCGCCATACCCATTATTGCTAAGCCTCTTGAAGATAACCCCACCAATAATG
CAGGCCAATCCATGAGAGGCGCTGCTGCATATTGTCTAAGATGTATATCTTGTCGAGGTGATGGTACGTTGGCAACGGCGGTTGGTCATTCCGGCGCTTTAGAATCTTTG
CTCAGGTCATTGCTACATTCTAGTGGTTGTTTTTGTAAGATTCTAGTGAAATGTATTTGGAGTATTGTTACTTTTGATCAAAGTAGTCGTGTGATTATAGCAAGAAATGG
GGGTTTGGAAGTTATTATTTGTATGCTTGTTTCTGTAATTGATGGGACTAGGAGATACTTACTGGAGATTTTGAGTGCTATGGCACTACTGAGAGAGGTTAGAAAGGCAC
TTATTAGATTAAGAGGCCTCCCTTTTCTTGTGCAAGCTGCTAGATTTGGCTGCATGGCTTCCAGAGAAAGAGCTTGTAAAGCAATTGGGTTAATTGCAATTACAAAACGT
GGAAGAAACAAGCTTGTTGAATTGGGAGTGGTTCCAGTGCTTATTGAGCTACTTCGTGAAGGAGATTATGTGACAAAATTGGTGGCTGGTAATGCCCTTGGAATAGTTTC
AGCTCATCTAGCCTATATTAGGCCTGCTGCACAAGCTGGGGCTATTCCTTTATTTGCTGATCTTCTTCAGTGGCCTGATCCTATTTGTAAGGAGATTGCAGAGGATGTCT
TCTGTCTCTTAGCTGTTGCTGAAACGAATGCAGTTGTAATTTCTGATCATCTAGTGAAAGTTCTTAAAGAAGGTGATGATAGAGGAAAGGCAGCAGCTGCTGATGTATTG
AGGATTCTTTCTAGCTATAAGTACTCCATATCAGTTGTGCAGAACTCCGGTGCAATTCCAGTTTTGGTGGATCTATTACACGATGGGAATCATGAAACAGACAGGGTTGC
ACTTGCTGATGCAGGGGCAATCCAACGACTAATAGGGCTATTGCAAGATGAGTTAGAAAAAGTGAAACTTAATGTTACTGAGACCCTTACAAGTTTTTCTAAAGACCCCT
TATACTGTACTAGAGTATCTGAAGCTGTCAGCTCTCCTGCTTTCCAGAATTTGCAGGAAATATTAACTCATAGTCGTGCTACTGAATGGCAAAGCATGGTGTCCGTGCAT
CAATTGGGAATCAACCAATTTACATTTGATTCGGATCTTCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGACACAAAGATCCATAGAGCATCTGAAAAGCCCAATCTAGATTGGGAAGCATCACTGAAGCAGTACGAAAACGTTATGGCTTCAGAATCCGAAGCACTTAAAGT
GAAAGCCACAATCAAATTAGCCCAACTCTCCAAAAACGCCCCGGAAAATATTTTGAAGAGCGCCATACCCATTATTGCTAAGCCTCTTGAAGATAACCCCACCAATAATG
CAGGCCAATCCATGAGAGGCGCTGCTGCATATTGTCTAAGATGTATATCTTGTCGAGGTGATGGTACGTTGGCAACGGCGGTTGGTCATTCCGGCGCTTTAGAATCTTTG
CTCAGGTCATTGCTACATTCTAGTGGTTGTTTTTGTAAGATTCTAGTGAAATGTATTTGGAGTATTGTTACTTTTGATCAAAGTAGTCGTGTGATTATAGCAAGAAATGG
GGGTTTGGAAGTTATTATTTGTATGCTTGTTTCTGTAATTGATGGGACTAGGAGATACTTACTGGAGATTTTGAGTGCTATGGCACTACTGAGAGAGGTTAGAAAGGCAC
TTATTAGATTAAGAGGCCTCCCTTTTCTTGTGCAAGCTGCTAGATTTGGCTGCATGGCTTCCAGAGAAAGAGCTTGTAAAGCAATTGGGTTAATTGCAATTACAAAACGT
GGAAGAAACAAGCTTGTTGAATTGGGAGTGGTTCCAGTGCTTATTGAGCTACTTCGTGAAGGAGATTATGTGACAAAATTGGTGGCTGGTAATGCCCTTGGAATAGTTTC
AGCTCATCTAGCCTATATTAGGCCTGCTGCACAAGCTGGGGCTATTCCTTTATTTGCTGATCTTCTTCAGTGGCCTGATCCTATTTGTAAGGAGATTGCAGAGGATGTCT
TCTGTCTCTTAGCTGTTGCTGAAACGAATGCAGTTGTAATTTCTGATCATCTAGTGAAAGTTCTTAAAGAAGGTGATGATAGAGGAAAGGCAGCAGCTGCTGATGTATTG
AGGATTCTTTCTAGCTATAAGTACTCCATATCAGTTGTGCAGAACTCCGGTGCAATTCCAGTTTTGGTGGATCTATTACACGATGGGAATCATGAAACAGACAGGGTTGC
ACTTGCTGATGCAGGGGCAATCCAACGACTAATAGGGCTATTGCAAGATGAGTTAGAAAAAGTGAAACTTAATGTTACTGAGACCCTTACAAGTTTTTCTAAAGACCCCT
TATACTGTACTAGAGTATCTGAAGCTGTCAGCTCTCCTGCTTTCCAGAATTTGCAGGAAATATTAACTCATAGTCGTGCTACTGAATGGCAAAGCATGGTGTCCGTGCAT
CAATTGGGAATCAACCAATTTACATTTGATTCGGATCTTCTATAA
Protein sequenceShow/hide protein sequence
MGDTKIHRASEKPNLDWEASLKQYENVMASESEALKVKATIKLAQLSKNAPENILKSAIPIIAKPLEDNPTNNAGQSMRGAAAYCLRCISCRGDGTLATAVGHSGALESL
LRSLLHSSGCFCKILVKCIWSIVTFDQSSRVIIARNGGLEVIICMLVSVIDGTRRYLLEILSAMALLREVRKALIRLRGLPFLVQAARFGCMASRERACKAIGLIAITKR
GRNKLVELGVVPVLIELLREGDYVTKLVAGNALGIVSAHLAYIRPAAQAGAIPLFADLLQWPDPICKEIAEDVFCLLAVAETNAVVISDHLVKVLKEGDDRGKAAAADVL
RILSSYKYSISVVQNSGAIPVLVDLLHDGNHETDRVALADAGAIQRLIGLLQDELEKVKLNVTETLTSFSKDPLYCTRVSEAVSSPAFQNLQEILTHSRATEWQSMVSVH
QLGINQFTFDSDLL