; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017901 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017901
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionU-box domain-containing protein 4
Genome locationtig00153057:333311..337381
RNA-Seq ExpressionSgr017901
SyntenySgr017901
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592081.1 U-box domain-containing protein 12, partial [Cucurbita argyrosperma subsp. sororia]1.4e-19878.89Show/hide
Query:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA
        M  SQIH+ SEK NLDWE ALKQYE VMA+E EAVKV AT+KLA LS  AP NILNSTIPIIAKHL  NPISNSSQSM+GAAAYCL+ IS QG+GTLATA
Subjt:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA

Query:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG
        V HSGA++S+LR LP + G FRKILVKCIWSIVT  K SRV++ RNGGLEV+IG+FDSV+DGTR+YLLEILSALAL++EVRKALIS  GLPFLV+AAR G
Subjt:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG

Query:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA
        CMASRERACQAIGL+AIT+R R +LVELGV+PVLIELF EGD VTKLV+GNSLG+VSAHVAYIRPVAQAGAIPLFADLLQWP+PI KEIAEDVFCLLA+A
Subjt:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA

Query:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE
        E NA +I DHLVRILKEGDD  KAAAADVLWDLS YKYSI + Q+SGAIP+LVDLL DGN EVRE+VSGA+AQ+SY+  DR ALA AGAI  LIGLLQDE
Subjt:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE

Query:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        L+E+KDNAAEALINFSED LY +RVSEA+S PAFQ + ERMTHIRATERH +RSLRQ+GI+QLT DPDL
Subjt:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

XP_022139563.1 uncharacterized protein LOC111010437 [Momordica charantia]3.4e-21384.5Show/hide
Query:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQG-EGTLAT
        MSCS+I K+SEK NL+WEEAL+QY+ VMASE EAVKV ATVKLA+LS+CAPENILNSTIPIIAKHL DNPI+NSSQSMQGAAAYCLR ISC+G +GTLA 
Subjt:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQG-EGTLAT

Query:  AVGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARC
        AVG+SGA++SLLRLLP+S G FRKILVKC+WS+VTFGKTSRV++ RNGGLEVIIG+ DSVID +R+YLLEILSALALL+EVRKALIS  GLPFLVEAARC
Subjt:  AVGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARC

Query:  GCMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPD-PIGKEIAEDVFCLLA
        GC+ASRERACQAIGLLAITKR RR+L ELGVVPVLIELFR GDC TKLVAGNSLG+VSAHVAYIRPVAQAGAIPLFADLLQWP+ PIGKEIAEDVFCLLA
Subjt:  GCMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPD-PIGKEIAEDVFCLLA

Query:  IAEVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQ
        +AE NA VISDHLVRILKEGDD AKAAAADVLWDLSGYKYSIS A NSGAIP++VDLL D N+EVRE+VSGAIAQ+SYN  DRAALA AGAIERLI LLQ
Subjt:  IAEVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQ

Query:  DELEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        DELEELKDNAAEALINFSED  Y  RVSEAVSTPAF+ MQERMTHIR TE+HSMRSLRQ+GIDQLTRDPDL
Subjt:  DELEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

XP_022936629.1 uncharacterized protein LOC111443172 [Cucurbita moschata]6.3e-19979.1Show/hide
Query:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA
        M  SQIH+ SEK NLDWE ALKQYE VMA+E EAVKV AT+KLA LS  AP NILNSTIPIIAKHL  NPISNSSQSM+GAAAYCL+ IS QG+GTLATA
Subjt:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA

Query:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG
        V HSGA++S+LR LP + G FRKILVKCIWSIVT  K SRV++ RNGGLEV+IG+FDSV+DGTR+YLLEILSALAL++EVRKALIS  GLPFLV+AAR G
Subjt:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG

Query:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA
        CMASRERACQAIGL+AIT+R R +LVELGV+PVLIELF EGD VTKLVAGNSLG+VSAHVAYIRPVAQAGAIPLFADLLQWP+PI KEIAEDVFCLLA+A
Subjt:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA

Query:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE
        E NA +I DHLVRILKEGDD  KAAAADVLWDLS YKYSI + Q+SGAIP+LVDLL DGN EVRE+VSGA+AQ+SY+  DR ALA AGAI  LIGLLQDE
Subjt:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE

Query:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        L+E+KDNAAEALINFSED LY +RVSEA+S PAFQ + ERMTHIRATERH +RSLRQ+GI+QLT DPDL
Subjt:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

XP_022976328.1 uncharacterized protein LOC111476761 [Cucurbita maxima]1.5e-19778.68Show/hide
Query:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA
        MS S+I + SEK NLDWE ALKQYE VMA+E EAVKV AT+KLA LS  AP NILNSTIPIIAKHL  NPISNSSQSM+GAAAYCL+ IS QG+GTLATA
Subjt:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA

Query:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG
        + HSGA++S+LR LP + G FRKILVKCIWSIVT  K SRV++ RNGGLEV+IG+FDSV+DGTR+YLLEILSALAL++EVRKALIS  GLPFLV+AAR G
Subjt:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG

Query:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA
        CMASRERACQAIGL+AIT+R R +LVELGV+PVLIELF EGD VTKLVAGNSLG+VSAHVAYIRPVAQAGAIPLFADLLQWPDPI KEIAEDVFCLLA+A
Subjt:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA

Query:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE
        E NA +I DHLVRILKEGDD  KAAAADVLWDLS YKYSI + Q+SGAIP+LVDLL DGN EVRE+VSGA+AQ+SY+  DR ALA AGAI  LI LLQDE
Subjt:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE

Query:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        L+E+KDNAAEALINFSED LY +RVSEA+S PAFQ + ERMTHIRATERH +RSLRQ+GI+QLT DPDL
Subjt:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

XP_038897634.1 protein spotted leaf 11 [Benincasa hispida]2.6e-21383.37Show/hide
Query:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA
        M  SQIH+ SEK NLDWE AL QYE VMASE EAVKV AT+KLAHLSK APENILNS IP IAKHL DNP SNSSQSM+GAAAYCLRCISCQG+GTLA A
Subjt:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA

Query:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG
        VGHSGA++SLLR LPHS GCFRKILVKC+WSIVTF +TSRV++ RNGGLEVIIG+FDSV DGTR+YLLEILSA+ALL+EVRKALIS  GLPFLV+AAR G
Subjt:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG

Query:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA
        CMASRERACQAIGLLAITKR RR+LVELGVVPVLIEL REGD VTKLVAGNSLG+VSAHV YIRP+AQAGAIPLFA+LLQWPDPIGKEIAEDVFCLLA+A
Subjt:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA

Query:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE
        E NA VISDHLVRILKEGD G KAAAADVLWDLS YKYSISV QNSGAIP+LVDLL DGNDEVRE+VSGAIAQ+SYN TDR ALA AGAI+RLIGLLQDE
Subjt:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE

Query:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        LEE+KDN  EAL+NFSEDPLY  RVSEAVSTPAFQ +QERM HIRA ER S+RSL Q+GI+Q T DPDL
Subjt:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

TrEMBL top hitse value%identityAlignment
A0A0A0LGL6 Uncharacterized protein4.3e-18575.27Show/hide
Query:  QIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATAVGHS
        +IH+ SEK NLDWE +LKQYE VMASE EA+KV AT+KLA LSK APENIL S IPIIAK L DNP +N+SQSM+ AAAYCLRCISC+G+GTLATAVGHS
Subjt:  QIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATAVGHS

Query:  GAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCGCMAS
        GA++SL++ L HS GCF KILVKCIWSIVTF ++SRV++ RNGGLEVII +   VIDGTR+YLLEILSA+ALL+EVRKALI S GLPFLV+AAR GCMAS
Subjt:  GAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCGCMAS

Query:  RERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNA
        RERAC+AIGL+AITKR R  LVELGVVPVLIEL REGD VTKLVAGN+LG+VSAH+AYIRPVAQAGAIPLFADLLQW DPI KEIAEDVFCLLA+AE NA
Subjt:  RERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNA

Query:  DVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEEL
         VI DHLV++LKEGDD  KAAAADVL  LS YKYSISV QNSGAIP+LVDLL DGN EVRE+VSGAIA++S   TDR ALA AGAI+ LIGLLQD+LE++
Subjt:  DVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEEL

Query:  KDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        K N  EA+ +FS+DPLY  RV+EA+STPAFQ +QER+THIRA E  SM S+ Q+GI+Q T D DL
Subjt:  KDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

A0A5A7U359 Vacuolar protein 88.9e-18374.84Show/hide
Query:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA
        M  ++IH+ SE  NLDWE +LKQYE VMASE EA+KV AT+KLA LSK APENIL S IPIIAK L D P +N+SQSM+ AAAYCLRCISCQG+GTLA A
Subjt:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA

Query:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG
        VGHSGA++SLLR L HS GCF KILVKCIWS VTF K+SRV++ RNGGLEVII +   V DGTR+YLLEILSA+ALL+EVRKAL+   GLPFLV+AAR G
Subjt:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG

Query:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA
         MASRERAC+AIGL+AITKR R  LVELGVV VLIEL REGD VTKLVAGN+LG+VSAH+AYIRPVAQAGAIPLFADLLQ PDPI KEIAEDVFCLLA+A
Subjt:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA

Query:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE
        E NA VISDHLV++LKEGDD  KAAAADVL  LS YKYSISV QNSGAIP+LVDLL DGN EVRE+VSGAIA++SY  TDR ALA AGAI+ LIGLLQDE
Subjt:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE

Query:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        LE++K N  EAL +FS+DPLY  R+SE VSTPAFQ +QER+THIRATE  SM S+ Q+GI+Q T D DL
Subjt:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

A0A6J1CFX3 uncharacterized protein LOC1110104371.7e-21384.5Show/hide
Query:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQG-EGTLAT
        MSCS+I K+SEK NL+WEEAL+QY+ VMASE EAVKV ATVKLA+LS+CAPENILNSTIPIIAKHL DNPI+NSSQSMQGAAAYCLR ISC+G +GTLA 
Subjt:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQG-EGTLAT

Query:  AVGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARC
        AVG+SGA++SLLRLLP+S G FRKILVKC+WS+VTFGKTSRV++ RNGGLEVIIG+ DSVID +R+YLLEILSALALL+EVRKALIS  GLPFLVEAARC
Subjt:  AVGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARC

Query:  GCMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPD-PIGKEIAEDVFCLLA
        GC+ASRERACQAIGLLAITKR RR+L ELGVVPVLIELFR GDC TKLVAGNSLG+VSAHVAYIRPVAQAGAIPLFADLLQWP+ PIGKEIAEDVFCLLA
Subjt:  GCMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPD-PIGKEIAEDVFCLLA

Query:  IAEVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQ
        +AE NA VISDHLVRILKEGDD AKAAAADVLWDLSGYKYSIS A NSGAIP++VDLL D N+EVRE+VSGAIAQ+SYN  DRAALA AGAIERLI LLQ
Subjt:  IAEVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQ

Query:  DELEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        DELEELKDNAAEALINFSED  Y  RVSEAVSTPAF+ MQERMTHIR TE+HSMRSLRQ+GIDQLTRDPDL
Subjt:  DELEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

A0A6J1F8U2 uncharacterized protein LOC1114431723.0e-19979.1Show/hide
Query:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA
        M  SQIH+ SEK NLDWE ALKQYE VMA+E EAVKV AT+KLA LS  AP NILNSTIPIIAKHL  NPISNSSQSM+GAAAYCL+ IS QG+GTLATA
Subjt:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA

Query:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG
        V HSGA++S+LR LP + G FRKILVKCIWSIVT  K SRV++ RNGGLEV+IG+FDSV+DGTR+YLLEILSALAL++EVRKALIS  GLPFLV+AAR G
Subjt:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG

Query:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA
        CMASRERACQAIGL+AIT+R R +LVELGV+PVLIELF EGD VTKLVAGNSLG+VSAHVAYIRPVAQAGAIPLFADLLQWP+PI KEIAEDVFCLLA+A
Subjt:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA

Query:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE
        E NA +I DHLVRILKEGDD  KAAAADVLWDLS YKYSI + Q+SGAIP+LVDLL DGN EVRE+VSGA+AQ+SY+  DR ALA AGAI  LIGLLQDE
Subjt:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE

Query:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        L+E+KDNAAEALINFSED LY +RVSEA+S PAFQ + ERMTHIRATERH +RSLRQ+GI+QLT DPDL
Subjt:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

A0A6J1ILS8 uncharacterized protein LOC1114767617.5e-19878.68Show/hide
Query:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA
        MS S+I + SEK NLDWE ALKQYE VMA+E EAVKV AT+KLA LS  AP NILNSTIPIIAKHL  NPISNSSQSM+GAAAYCL+ IS QG+GTLATA
Subjt:  MSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQGEGTLATA

Query:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG
        + HSGA++S+LR LP + G FRKILVKCIWSIVT  K SRV++ RNGGLEV+IG+FDSV+DGTR+YLLEILSALAL++EVRKALIS  GLPFLV+AAR G
Subjt:  VGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCG

Query:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA
        CMASRERACQAIGL+AIT+R R +LVELGV+PVLIELF EGD VTKLVAGNSLG+VSAHVAYIRPVAQAGAIPLFADLLQWPDPI KEIAEDVFCLLA+A
Subjt:  CMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIA

Query:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE
        E NA +I DHLVRILKEGDD  KAAAADVLWDLS YKYSI + Q+SGAIP+LVDLL DGN EVRE+VSGA+AQ+SY+  DR ALA AGAI  LI LLQDE
Subjt:  EVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDE

Query:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        L+E+KDNAAEALINFSED LY +RVSEA+S PAFQ + ERMTHIRATERH +RSLRQ+GI+QLT DPDL
Subjt:  LEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

SwissProt top hitse value%identityAlignment
A2ZLU6 Protein spotted leaf 111.1e-0729.78Show/hide
Query:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVN-ADVISD----HLVRILKEGDDGAKAAAADVLWDLSGY-KYSISVAQNSGAIPILVDLLWDG
        +A+AGAIPL   LL   D   +E A      L+I E N A +IS      +V +LK G   A+  AA  L+ LS   +Y +++    GAIP LV LL +G
Subjt:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVN-ADVISD----HLVRILKEGDDGAKAAAADVLWDLSGY-KYSISVAQNSGAIPILVDLLWDG

Query:  NDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTM
        +   ++  + A+  +     ++     AG +  ++GL+ +    L D A   L   S  P  +  +  A   P    M
Subjt:  NDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTM

Q5VRH9 U-box domain-containing protein 124.8e-0830.19Show/hide
Query:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVN-ADVISDH----LVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN
        +A+AGAIPL  +LL   DP  +E A      L+I E N A ++  H    +V +LK G    +  AA  L+ LS    +      +GAIP L++LL DG+
Subjt:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVN-ADVISDH----LVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN

Query:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDP
           ++  + AI  +     ++     AG +  L+  L D    + D A   L   + +P
Subjt:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDP

Q681N2 U-box domain-containing protein 157.9e-1130.54Show/hide
Query:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNADVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN
        +A AGAIPL   LL +PD   +E A      L+I EVN  +IS+     +++ IL+ G+  A+  +A  L+ LS    +      S  IP LVDLL  G 
Subjt:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNADVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN

Query:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSE
           ++    A+  +S N+ ++     AG ++ L+ LL+D+   + D A   L+  +  P  R  + +
Subjt:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSE

Q8VZ40 U-box domain-containing protein 149.6e-0929.65Show/hide
Query:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNADVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN
        +A+AGAIPL  +LL  PDP  +E +      L+I E N   I D      +V +LK G   A+  AA  L+ LS    +      +GAI  L+ LL +G 
Subjt:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNADVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN

Query:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSEAVSTP
           ++  + AI  +     +++     G ++ L  LL+D    + D A   L   S +   +  ++EA S P
Subjt:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSEAVSTP

Q9SNC6 U-box domain-containing protein 133.5e-1123.98Show/hide
Query:  SRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCGCMASRERACQAIGLLAITKRERRILVELGVVPVLIELF
        +RV +   G + +++G+  +     +++ +  L  L++ +  + A++S+G +P +V+  + G M +RE A   +  L++    +  +  LG +P L+ L 
Subjt:  SRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCGCMASRERACQAIGLLAITKRERRILVELGVVPVLIELF

Query:  REGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAI--------AEVNADVISDHLVRILKEGDDGAKAAAADVL
         EG    K  A  +L  +  +        +AG IP    LL  P   G  + ++   +LAI        A + +      LV  ++ G    +  AA VL
Subjt:  REGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAI--------AEVNADVISDHLVRILKEGDDGAKAAAADVL

Query:  WDL-SGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQIS
          L SG    +  AQ  G +  L+DL  +G D  + + +  + +IS
Subjt:  WDL-SGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQIS

Arabidopsis top hitse value%identityAlignment
AT2G45720.1 ARM repeat superfamily protein2.0e-0923.53Show/hide
Query:  RVVMTRNGGLEV--IIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCGCMASRERACQAIGLLAITKRERRILVELGVVPVLIEL
        + V+T  G   V  ++ +  +     R+  + ++ +LA        LIS   LP L+     G + ++E+A  ++  ++I+    R +V  G V  LIE+
Subjt:  RVVMTRNGGLEV--IIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCGCMASRERACQAIGLLAITKRERRILVELGVVPVLIEL

Query:  FREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIG-KEIAEDVFCLLAIAEVN----ADVISDHLVRILKEGDDG--AKAAAADVL
         + GD V++  +  +L  +SA     + +A+ G + +  ++L     +G KE A +  CL  +   N      VIS++ ++ L    DG   + +    +
Subjt:  FREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIG-KEIAEDVFCLLAIAEVN----ADVISDHLVRILKEGDDG--AKAAAADVL

Query:  WDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDP
         +L G   S+SV      IP LV +L  G+   ++  +  I +I+ +   +  +  +G I  LI +L+ +    ++ AA+A+ +    P
Subjt:  WDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDP

AT3G20170.1 ARM repeat superfamily protein1.7e-12551.47Show/hide
Query:  MSCSQIHKDSEKL-NLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQG--EGTL
        M  S+   +S    N DWE    ++E  ++S   +++V + +KL+ L+   PE+ ++  IPI+A  L      +S++S+Q AAA+CL+CI+C G  E   
Subjt:  MSCSQIHKDSEKL-NLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQG--EGTL

Query:  ATAVGHSGAIQSLLRLLPHSE---GCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDS-VIDGTRKYLLEILSALALLKEVRKALISSGGLPFL
        A  +G  G I SLL LL  +      FR+I VKC+WS+VTFG + RV + R GGLE++I   ++   DG+R YLLEILSAL  ++E R+ L+ SGGL FL
Subjt:  ATAVGHSGAIQSLLRLLPHSE---GCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDS-VIDGTRKYLLEILSALALLKEVRKALISSGGLPFL

Query:  VEAARCGCMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDV
        VEAA+ G +ASRERAC AIGL+ +T+R RRILVE GV+P L++L+R+GD   KL+AGN+LG++SA   YIRPV +AG+IPL+ +LL   DP+GK+IAEDV
Subjt:  VEAARCGCMASRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDV

Query:  FCLLAIAEVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERL
        FC+LA+AE NA +I++ LVRIL+ GD+ AK AA+DVLWDL+GY++S+SV + SGAIP+L++LL DG+ E RER+SGAI+Q+SYN  DR A + +G I  L
Subjt:  FCLLAIAEVNADVISDHLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERL

Query:  IGLLQDELEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL
        I  L DE EEL+DNAAEALINFSED  +  RV EA+  P FQ+MQ R+  IRA+    +RS+R++ I  L  D DL
Subjt:  IGLLQDELEELKDNAAEALINFSEDPLYRDRVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL

AT3G46510.1 plant U-box 132.5e-1223.98Show/hide
Query:  SRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCGCMASRERACQAIGLLAITKRERRILVELGVVPVLIELF
        +RV +   G + +++G+  +     +++ +  L  L++ +  + A++S+G +P +V+  + G M +RE A   +  L++    +  +  LG +P L+ L 
Subjt:  SRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCGCMASRERACQAIGLLAITKRERRILVELGVVPVLIELF

Query:  REGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAI--------AEVNADVISDHLVRILKEGDDGAKAAAADVL
         EG    K  A  +L  +  +        +AG IP    LL  P   G  + ++   +LAI        A + +      LV  ++ G    +  AA VL
Subjt:  REGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAI--------AEVNADVISDHLVRILKEGDDGAKAAAADVL

Query:  WDL-SGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQIS
          L SG    +  AQ  G +  L+DL  +G D  + + +  + +IS
Subjt:  WDL-SGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQIS

AT3G54850.1 plant U-box 146.8e-1029.65Show/hide
Query:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNADVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN
        +A+AGAIPL  +LL  PDP  +E +      L+I E N   I D      +V +LK G   A+  AA  L+ LS    +      +GAI  L+ LL +G 
Subjt:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNADVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN

Query:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSEAVSTP
           ++  + AI  +     +++     G ++ L  LL+D    + D A   L   S +   +  ++EA S P
Subjt:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSEAVSTP

AT5G42340.1 Plant U-Box 155.6e-1230.54Show/hide
Query:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNADVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN
        +A AGAIPL   LL +PD   +E A      L+I EVN  +IS+     +++ IL+ G+  A+  +A  L+ LS    +      S  IP LVDLL  G 
Subjt:  VAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNADVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGN

Query:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSE
           ++    A+  +S N+ ++     AG ++ L+ LL+D+   + D A   L+  +  P  R  + +
Subjt:  DEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRDRVSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACTGCTCTTGCTCATTCAGAAGGGGCAGGGGCTTTGAATTTCGTCATGAGTTGCTCACAGATTCATAAAGATTCTGAAAAGCTGAACCTAGATTGGGAAGAAGC
GCTGAAGCAGTATGAAAAAGTTATGGCTTCAGAACCCGAGGCTGTTAAAGTGACAGCCACAGTCAAATTGGCCCATCTCTCCAAATGTGCCCCGGAGAATATCTTGAACA
GCACTATACCCATTATTGCTAAGCATCTTGTAGATAATCCCATCTCTAATTCAAGCCAGTCCATGCAAGGGGCTGCTGCATACTGTCTAAGATGTATATCTTGTCAAGGT
GAAGGTACGTTGGCGACGGCTGTTGGTCATTCCGGCGCTATACAGTCATTGCTGAGGTTATTGCCTCATTCTGAAGGTTGTTTTCGTAAGATTCTAGTCAAATGTATTTG
GAGTATTGTTACTTTTGGTAAAACTAGTCGTGTGGTTATGACAAGGAATGGGGGTTTGGAAGTTATTATTGGTATATTCGATTCTGTTATTGATGGCACTAGAAAATATT
TATTGGAGATTTTGAGTGCATTGGCACTATTGAAAGAGGTTAGGAAGGCTCTTATCAGTTCAGGAGGCCTCCCCTTTCTTGTGGAAGCTGCTAGATGTGGCTGCATGGCC
TCCAGAGAAAGAGCTTGCCAAGCAATTGGGTTACTTGCAATTACAAAGCGTGAAAGACGCATACTTGTTGAGTTGGGAGTGGTGCCGGTGCTTATTGAGCTATTTCGTGA
AGGAGACTGTGTAACAAAACTTGTAGCTGGTAACTCGCTTGGAATGGTTTCAGCTCACGTAGCCTATATTAGGCCTGTTGCACAAGCTGGGGCCATCCCTTTGTTTGCTG
ATCTTCTTCAGTGGCCTGACCCTATTGGTAAGGAGATTGCTGAGGACGTCTTCTGTCTCTTAGCTATTGCTGAAGTGAATGCAGATGTAATTTCTGATCATCTAGTGAGA
ATTCTTAAAGAAGGTGATGATGGAGCAAAGGCTGCAGCTGCTGATGTTTTGTGGGATCTTTCAGGATATAAGTACTCCATATCAGTTGCACAAAATTCGGGTGCCATTCC
AATTTTGGTGGATCTATTATGGGATGGGAATGATGAAGTGAGGGAAAGAGTCTCTGGAGCTATAGCCCAAATAAGTTATAATGCGACAGACAGGGCTGCACTTGCTGGTG
CAGGGGCAATTGAAAGACTAATAGGGCTGTTGCAAGATGAGTTAGAAGAACTGAAAGATAATGCTGCTGAGGCCCTTATAAACTTTTCTGAAGACCCCTTATACCGTGAT
AGAGTATCCGAAGCGGTCAGCACTCCTGCTTTCCAGACTATGCAGGAAAGAATGACCCATATTCGTGCTACTGAAAGACACAGCATGAGGTCCCTGCGTCAAATTGGAAT
TGACCAACTTACAAGGGATCCAGATCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTACTGCTCTTGCTCATTCAGAAGGGGCAGGGGCTTTGAATTTCGTCATGAGTTGCTCACAGATTCATAAAGATTCTGAAAAGCTGAACCTAGATTGGGAAGAAGC
GCTGAAGCAGTATGAAAAAGTTATGGCTTCAGAACCCGAGGCTGTTAAAGTGACAGCCACAGTCAAATTGGCCCATCTCTCCAAATGTGCCCCGGAGAATATCTTGAACA
GCACTATACCCATTATTGCTAAGCATCTTGTAGATAATCCCATCTCTAATTCAAGCCAGTCCATGCAAGGGGCTGCTGCATACTGTCTAAGATGTATATCTTGTCAAGGT
GAAGGTACGTTGGCGACGGCTGTTGGTCATTCCGGCGCTATACAGTCATTGCTGAGGTTATTGCCTCATTCTGAAGGTTGTTTTCGTAAGATTCTAGTCAAATGTATTTG
GAGTATTGTTACTTTTGGTAAAACTAGTCGTGTGGTTATGACAAGGAATGGGGGTTTGGAAGTTATTATTGGTATATTCGATTCTGTTATTGATGGCACTAGAAAATATT
TATTGGAGATTTTGAGTGCATTGGCACTATTGAAAGAGGTTAGGAAGGCTCTTATCAGTTCAGGAGGCCTCCCCTTTCTTGTGGAAGCTGCTAGATGTGGCTGCATGGCC
TCCAGAGAAAGAGCTTGCCAAGCAATTGGGTTACTTGCAATTACAAAGCGTGAAAGACGCATACTTGTTGAGTTGGGAGTGGTGCCGGTGCTTATTGAGCTATTTCGTGA
AGGAGACTGTGTAACAAAACTTGTAGCTGGTAACTCGCTTGGAATGGTTTCAGCTCACGTAGCCTATATTAGGCCTGTTGCACAAGCTGGGGCCATCCCTTTGTTTGCTG
ATCTTCTTCAGTGGCCTGACCCTATTGGTAAGGAGATTGCTGAGGACGTCTTCTGTCTCTTAGCTATTGCTGAAGTGAATGCAGATGTAATTTCTGATCATCTAGTGAGA
ATTCTTAAAGAAGGTGATGATGGAGCAAAGGCTGCAGCTGCTGATGTTTTGTGGGATCTTTCAGGATATAAGTACTCCATATCAGTTGCACAAAATTCGGGTGCCATTCC
AATTTTGGTGGATCTATTATGGGATGGGAATGATGAAGTGAGGGAAAGAGTCTCTGGAGCTATAGCCCAAATAAGTTATAATGCGACAGACAGGGCTGCACTTGCTGGTG
CAGGGGCAATTGAAAGACTAATAGGGCTGTTGCAAGATGAGTTAGAAGAACTGAAAGATAATGCTGCTGAGGCCCTTATAAACTTTTCTGAAGACCCCTTATACCGTGAT
AGAGTATCCGAAGCGGTCAGCACTCCTGCTTTCCAGACTATGCAGGAAAGAATGACCCATATTCGTGCTACTGAAAGACACAGCATGAGGTCCCTGCGTCAAATTGGAAT
TGACCAACTTACAAGGGATCCAGATCTATAA
Protein sequenceShow/hide protein sequence
MSTALAHSEGAGALNFVMSCSQIHKDSEKLNLDWEEALKQYEKVMASEPEAVKVTATVKLAHLSKCAPENILNSTIPIIAKHLVDNPISNSSQSMQGAAAYCLRCISCQG
EGTLATAVGHSGAIQSLLRLLPHSEGCFRKILVKCIWSIVTFGKTSRVVMTRNGGLEVIIGIFDSVIDGTRKYLLEILSALALLKEVRKALISSGGLPFLVEAARCGCMA
SRERACQAIGLLAITKRERRILVELGVVPVLIELFREGDCVTKLVAGNSLGMVSAHVAYIRPVAQAGAIPLFADLLQWPDPIGKEIAEDVFCLLAIAEVNADVISDHLVR
ILKEGDDGAKAAAADVLWDLSGYKYSISVAQNSGAIPILVDLLWDGNDEVRERVSGAIAQISYNATDRAALAGAGAIERLIGLLQDELEELKDNAAEALINFSEDPLYRD
RVSEAVSTPAFQTMQERMTHIRATERHSMRSLRQIGIDQLTRDPDL