; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002917 (gene) of Chayote v1 genome

Gene IDSed0002917
OrganismSechium edule (Chayote v1)
DescriptionU-box domain-containing protein 4
Genome locationLG04:45724415..45725821
RNA-Seq ExpressionSed0002917
SyntenySed0002917
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592081.1 U-box domain-containing protein 12, partial [Cucurbita argyrosperma subsp. sororia]2.8e-17772.55Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA
        M HSQ+H  S++P++DWE+ALK YENVMA+ESEAVKVKAT+KLA L+   P ++LNS I IIA+HLEVNPI+NSSQSM+GAAAYCL+ IS QGDG LA A
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA

Query:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG
        V +SGAL+                       IVT DK SRVIVARNGGLEV+IG+FDSV DG+RRYLLEILSA+AL+REVRKALISLRGLPFLV+AAR+G
Subjt:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG

Query:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA
        CM SRERACQAIGL+A   RGR MLVELGV+PVLIELF EGDY TKL++GN+LGIVSA +AYIRPVA+AGAIPLFADLLQ P+P+ KEIAED FCLLAVA
Subjt:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA

Query:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE
        E NAV I DHLVRILK GDD  KAAAADVLWD SSYKYS  +V SSGAIPVLVDLL DGN EVR KVSGA+AQLSY+E DR ALADAGAI  LI LLQDE
Subjt:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE

Query:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL
        L+E+KDNAAEALI+FSED +YC+RVSEA S PAFQN+ ER+T IR  ERHG  + R MGIN LT D DLL
Subjt:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL

XP_022936629.1 uncharacterized protein LOC111443172 [Cucurbita moschata]1.2e-17772.77Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA
        M HSQ+H  S++P++DWE+ALK YENVMA+ESEAVKVKAT+KLA L+   P ++LNS I IIA+HLEVNPI+NSSQSM+GAAAYCL+ IS QGDG LA A
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA

Query:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG
        V +SGAL+                       IVT DK SRVIVARNGGLEV+IG+FDSV DG+RRYLLEILSA+AL+REVRKALISLRGLPFLV+AAR+G
Subjt:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG

Query:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA
        CM SRERACQAIGL+A   RGR MLVELGV+PVLIELF EGDY TKL+AGN+LGIVSA +AYIRPVA+AGAIPLFADLLQ P+P+ KEIAED FCLLAVA
Subjt:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA

Query:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE
        E NAV I DHLVRILK GDD  KAAAADVLWD SSYKYS  +V SSGAIPVLVDLL DGN EVR KVSGA+AQLSY+E DR ALADAGAI  LI LLQDE
Subjt:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE

Query:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL
        L+E+KDNAAEALI+FSED +YC+RVSEA S PAFQN+ ER+T IR  ERHG  + R MGIN LT D DLL
Subjt:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL

XP_022976328.1 uncharacterized protein LOC111476761 [Cucurbita maxima]4.7e-17772.34Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA
        M+HS++   S++P++DWE+ALK YENVMA+ESEAVKVKAT+KLA L+   P ++LNS I IIA+HLEVNPI+NSSQSM+GAAAYCL+ IS QGDG LA A
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA

Query:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG
        + +SGAL+                       IVT DK SRVIVARNGGLEV+IG+FDSV DG+RRYLLEILSA+AL+REVRKALISLRGLPFLV+AAR+G
Subjt:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG

Query:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA
        CM SRERACQAIGL+A   RGR MLVELGV+PVLIELF EGDY TKL+AGN+LGIVSA +AYIRPVA+AGAIPLFADLLQ PDP+ KEIAED FCLLAVA
Subjt:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA

Query:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE
        E NAV I DHLVRILK GDD  KAAAADVLWD SSYKYS  +V SSGAIPVLVDLL DGN EVR KVSGA+AQLSY+E DR ALADAGAI  LI LLQDE
Subjt:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE

Query:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL
        L+E+KDNAAEALI+FSED +YC+RVSEA S PAFQN+ ER+T IR  ERHG  + R MGIN LT D DLL
Subjt:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL

XP_023534876.1 uncharacterized protein LOC111796476 [Cucurbita pepo subsp. pepo]2.4e-17672.13Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA
        M+HS++   S++P++DWE+ALK YENVMA+ESEAVKVKAT+KLA L+   P ++LNS I IIA+HLEVNPI+NSSQSM+GAAAYCL+ IS QGDG LA A
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA

Query:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG
        V +SGAL+                       IVT DK SRVIVARNGGLEV+IG+FDSV DG+RRYLLEILSA+AL+REVRKALISLRGLPFLV+AAR+G
Subjt:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG

Query:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA
        CM SRERACQAIGL+A   RGR MLVELGV+PVLIELF EGDY TKL++GN+LGIVSA +AYIRPVA+AGAIPLFADLLQ P+P+ KEIAED FCLLAVA
Subjt:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA

Query:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE
        E NAV I DHLVRILK GDD  KAAAADVLWD SSYKYS  +V SSGAIPVLVDLL DGN EVR KVSGA+AQLSY+E DR ALADAGAI  LI LLQDE
Subjt:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE

Query:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL
        L+E+KDNAAEALI+FSED +YC+RVSEA S PAFQN+ ER+T IR  ERHG  + R MGIN LT D DLL
Subjt:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL

XP_038897634.1 protein spotted leaf 11 [Benincasa hispida]3.9e-18775.32Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA
        M HSQ+H  S++P++DWE+AL  YENVMASESEAVKVKAT+KLAHL+K  PE++LNSAI  IA+HLE NP +NSSQSM+GAAAYCLRCISCQGDG LAAA
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA

Query:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG
        VG+SGAL+                       IVTFD+TSRVI+ARNGGLEVIIG+FDSV DG+RRYLLEILSAMALLREVRKALISLRGLPFLV+AARFG
Subjt:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG

Query:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA
        CM SRERACQAIGLLA   RGR MLVELGVVPVLIEL REGDY TKL+AGN+LG+VSA + YIRP+A+AGAIPLFA+LLQ PDP+GKEIAED FCLLAVA
Subjt:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA

Query:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE
        E NAV ISDHLVRILK GD G KAAAADVLWD SSYKYS S+V +SGAIPVLVDLL DGNDEVR KVSGAIAQLSYNETDR ALADAGAI+RLI LLQDE
Subjt:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE

Query:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL
        LEE+KDN  EAL++FSEDP+YC RVSEA STPAFQN+QER+  IR  ER    +   MGIN  T D DLL
Subjt:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL

TrEMBL top hitse value%identityAlignment
A0A0A0LGL6 Uncharacterized protein2.1e-16268.51Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA
        M   ++H  S++P++DWE++LK YENVMASESEA+KVKAT+KLA L+K  PE++L S I IIA+ LE NP NN+SQSM+ AAAYCLRCISC+GDG LA A
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA

Query:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG
        VG+SGAL+                       IVTFD++SRVI+ARNGGLEVII +   V DG+RRYLLEILSAMALLREVRKALI  RGLPFLV+AARFG
Subjt:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG

Query:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA
        CM SRERAC+AIGL+A   RGR  LVELGVVPVLIEL REGDY TKL+AGN LGIVSA LAYIRPVA+AGAIPLFADLLQ  DP+ KEIAED FCLLAVA
Subjt:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA

Query:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE
        E NAV I DHLV++LK GDD  KAAAADVL   SSYKYS S+V +SGAIPVLVDLL DGN EVR KVSGAIA+LS  ETDR ALADAGAI+ LI LLQD+
Subjt:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE

Query:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL
        LE++K N  EA+ SFS+DP+YC RV+EA STPAFQN+QERIT IR  E     +   +GIN  T D DLL
Subjt:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL

A0A5A7U359 Vacuolar protein 84.2e-16368.94Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA
        M  +++H  S+ P++DWE++LK YENVMASESEA+KVKAT+KLA L+K  PE++L SAI IIA+ LE  P NN+SQSM+ AAAYCLRCISCQGDG LA A
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA

Query:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG
        VG+SGAL+                        VTFDK+SRVI+ RNGGLEVII +   VTDG+RRYLLEILSAMALLREVRKAL+ LRGLPFLV+AARFG
Subjt:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG

Query:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA
         M SRERAC+AIGL+A   RGR  LVELGVV VLIEL REGDY TKL+AGNALGIVSA LAYIRPVA+AGAIPLFADLLQ PDP+ KEIAED FCLLAVA
Subjt:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA

Query:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE
        E NAV ISDHLV++LK GDD  KAAAADVL   SSYKYS S+V +SGAIPVLVDLL DGN EVR KVSGAIA+LSY ETDR ALADAGAI+ LI LLQDE
Subjt:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE

Query:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL
        LE++K N  EAL SFS+DP+Y  R+SE  STPAFQN+QERIT IR  E     +   +GIN  T D DL+
Subjt:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL

A0A6J1CFX3 uncharacterized protein LOC1110104371.1e-17473.46Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQG-DGALAA
        M+ S++ ++S++P+++WE AL+ Y+NVMASESEAVKVKATVKLA+L++  PE++LNS I IIA+HL  NPI NSSQSM+GAAAYCLR ISC+G DG LA 
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQG-DGALAA

Query:  AVGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARF
        AVGNSGAL+                       +VTF KTSRVI+ARNGGLEVIIG+ DSV D SRRYLLEILSA+ALLREVRKALISLRGLPFLVEAAR 
Subjt:  AVGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARF

Query:  GCMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPD-PVGKEIAEDAFCLLA
        GC+ SRERACQAIGLLA   RGR ML ELGVVPVLIELFR GD  TKL+AGN+LGIVSA +AYIRPVA+AGAIPLFADLLQ P+ P+GKEIAED FCLLA
Subjt:  GCMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPD-PVGKEIAEDAFCLLA

Query:  VAEENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQ
        VAEENAV ISDHLVRILK GDD AKAAAADVLWD S YKYS S   +SGAIPV+VDLLQD N+EVR KVSGAIAQLSYNE DRAALADAGAIERLI LLQ
Subjt:  VAEENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQ

Query:  DELEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDL
        DELEELKDNAAEALI+FSED  YC RVSEA STPAF+NMQER+T IR  E+H   + R MGI+ LT D DL
Subjt:  DELEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDL

A0A6J1F8U2 uncharacterized protein LOC1114431726.0e-17872.77Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA
        M HSQ+H  S++P++DWE+ALK YENVMA+ESEAVKVKAT+KLA L+   P ++LNS I IIA+HLEVNPI+NSSQSM+GAAAYCL+ IS QGDG LA A
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA

Query:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG
        V +SGAL+                       IVT DK SRVIVARNGGLEV+IG+FDSV DG+RRYLLEILSA+AL+REVRKALISLRGLPFLV+AAR+G
Subjt:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG

Query:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA
        CM SRERACQAIGL+A   RGR MLVELGV+PVLIELF EGDY TKL+AGN+LGIVSA +AYIRPVA+AGAIPLFADLLQ P+P+ KEIAED FCLLAVA
Subjt:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA

Query:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE
        E NAV I DHLVRILK GDD  KAAAADVLWD SSYKYS  +V SSGAIPVLVDLL DGN EVR KVSGA+AQLSY+E DR ALADAGAI  LI LLQDE
Subjt:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE

Query:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL
        L+E+KDNAAEALI+FSED +YC+RVSEA S PAFQN+ ER+T IR  ERHG  + R MGIN LT D DLL
Subjt:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL

A0A6J1ILS8 uncharacterized protein LOC1114767612.3e-17772.34Show/hide
Query:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA
        M+HS++   S++P++DWE+ALK YENVMA+ESEAVKVKAT+KLA L+   P ++LNS I IIA+HLEVNPI+NSSQSM+GAAAYCL+ IS QGDG LA A
Subjt:  MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAA

Query:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG
        + +SGAL+                       IVT DK SRVIVARNGGLEV+IG+FDSV DG+RRYLLEILSA+AL+REVRKALISLRGLPFLV+AAR+G
Subjt:  VGNSGALD-----------------------IVTFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFG

Query:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA
        CM SRERACQAIGL+A   RGR MLVELGV+PVLIELF EGDY TKL+AGN+LGIVSA +AYIRPVA+AGAIPLFADLLQ PDP+ KEIAED FCLLAVA
Subjt:  CMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVA

Query:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE
        E NAV I DHLVRILK GDD  KAAAADVLWD SSYKYS  +V SSGAIPVLVDLL DGN EVR KVSGA+AQLSY+E DR ALADAGAI  LI LLQDE
Subjt:  EENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDE

Query:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL
        L+E+KDNAAEALI+FSED +YC+RVSEA S PAFQN+ ER+T IR  ERHG  + R MGIN LT D DLL
Subjt:  LEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHG--ATRHMGINILTSDSDLL

SwissProt top hitse value%identityAlignment
O22161 Protein ARABIDILLO 11.2e-0822.51Show/hide
Query:  VARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFGCMTSRERACQAIGLLATANRGREMLVELGVVPVLIEL-FREG
        V ++GG+ +++ L  S  +G +    + ++ +++   + K++    G+  L   A+       E A   +  L+     +  + + G V  L++L FR  
Subjt:  VARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFGCMTSRERACQAIGLLATANRGREMLVELGVVPVLIEL-FREG

Query:  DYATKLI--AGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVG-KEIAEDAFCLLAV---AEENAVEIS------DHLVRILKNGDDGAKAAAADV
        +    ++  A  AL  ++A       VA+AG +     L +     G +E A  A   LA    +  N   +       + LV++ K+  +G +  AA  
Subjt:  DYATKLI--AGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVG-KEIAEDAFCLLAV---AEENAVEIS------DHLVRILKNGDDGAKAAAADV

Query:  LWDFSSYKYSTSLVDSSGAIPVLVDLLQ---DGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDPVYCDRVS
        LW+ S    +   +  +G +  LV L Q   + +  ++ + +GA+  LS +E +  A+   G +  LI L + E E++ + AA AL + + +P    R+ 
Subjt:  LWDFSSYKYSTSLVDSSGAIPVLVDLLQ---DGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDPVYCDRVS

Query:  EAASTPAFQNM
        E    PA  ++
Subjt:  EAASTPAFQNM

Q5VRH9 U-box domain-containing protein 124.0e-0928.12Show/hide
Query:  LIELFREGDYATKLIAGNALGIVSARLAYIR-PVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAA
        L+   R G+   +  A   + +++ R    R  +A AGAIPL  +LL   DP  +E A  A   L++ E N   I D      +V +LK G    +  AA
Subjt:  LIELFREGDYATKLIAGNALGIVSARLAYIR-PVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAA

Query:  DVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP
          L+  S    +   + ++GAIP L++LL DG+   +   + AI  L   + ++     AG +  L+  L D    + D A   L   + +P
Subjt:  DVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP

Q681N2 U-box domain-containing protein 155.5e-1132.08Show/hide
Query:  VARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGN
        +A AGAIPL   LL  PD   +E A      L++ E N   IS+     +++ IL+NG+  A+  +A  L+  S    +   +  S  IP LVDLLQ G 
Subjt:  VARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGN

Query:  DEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP
           +     A+  LS N  ++    DAG ++ L+ LL+D+   + D A   L+  +  P
Subjt:  DEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP

Q8VZ40 U-box domain-containing protein 141.9e-1127.5Show/hide
Query:  CQAIGLLATANRGREMLVELG----------VVPVLIELFREGDYATKLIAGNALGIVSARLAYIRP-VARAGAIPLFADLLQLPDPVGKEIAEDAFCLL
        C++ G+    N+G     ++G           V  L+E    G    +  A   L +++ R    R  +A AGAIPL  +LL  PDP  +E +  A   L
Subjt:  CQAIGLLATANRGREMLVELG----------VVPVLIELFREGDYATKLIAGNALGIVSARLAYIRP-VARAGAIPLFADLLQLPDPVGKEIAEDAFCLL

Query:  AVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIER
        ++ E N   I D      +V +LKNG   A+  AA  L+  S    +   + ++GAI  L+ LL++G    +   + AI  L   + +++     G ++ 
Subjt:  AVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIER

Query:  LIRLLQDELEELKDNAAEALISFSEDPVYCDRVSEAASTP
        L RLL+D    + D A   L   S +      ++EA S P
Subjt:  LIRLLQDELEELKDNAAEALISFSEDPVYCDRVSEAASTP

Q9SNC6 U-box domain-containing protein 139.7e-0830.82Show/hide
Query:  VARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGN
        +A AGAIPL   LL  PD   +E +  A   L++ E N   I        +V++LK G   A+  AA  L+  S    +   + + GAIP LV LL +G 
Subjt:  VARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGN

Query:  DEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP
           +   + A+  L   + ++     AG I  L RLL +    + D A   L   S  P
Subjt:  DEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP

Arabidopsis top hitse value%identityAlignment
AT2G44900.1 ARABIDILLO-18.2e-1022.51Show/hide
Query:  VARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFGCMTSRERACQAIGLLATANRGREMLVELGVVPVLIEL-FREG
        V ++GG+ +++ L  S  +G +    + ++ +++   + K++    G+  L   A+       E A   +  L+     +  + + G V  L++L FR  
Subjt:  VARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFGCMTSRERACQAIGLLATANRGREMLVELGVVPVLIEL-FREG

Query:  DYATKLI--AGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVG-KEIAEDAFCLLAV---AEENAVEIS------DHLVRILKNGDDGAKAAAADV
        +    ++  A  AL  ++A       VA+AG +     L +     G +E A  A   LA    +  N   +       + LV++ K+  +G +  AA  
Subjt:  DYATKLI--AGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVG-KEIAEDAFCLLAV---AEENAVEIS------DHLVRILKNGDDGAKAAAADV

Query:  LWDFSSYKYSTSLVDSSGAIPVLVDLLQ---DGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDPVYCDRVS
        LW+ S    +   +  +G +  LV L Q   + +  ++ + +GA+  LS +E +  A+   G +  LI L + E E++ + AA AL + + +P    R+ 
Subjt:  LWDFSSYKYSTSLVDSSGAIPVLVDLLQ---DGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDPVYCDRVS

Query:  EAASTPAFQNM
        E    PA  ++
Subjt:  EAASTPAFQNM

AT3G20170.1 ARM repeat superfamily protein1.1e-10747.48Show/hide
Query:  MNHSQVHEKSKR-PDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQG--DGAL
        M  S+   +S+   + DWE+    +EN ++S S +++V++ +KL+ L   VPE  ++ AI I+A  L V+  ++S++S++ AAA+CL+CI+C G  +   
Subjt:  MNHSQVHEKSKR-PDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQG--DGAL

Query:  AAAVGNSGAL--------------------------DIVTFDKTSRVIVARNGGLEVII-GLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFL
        A  +G  G +                           +VTF  + RV +AR GGLE++I  L +   DGSR YLLEILSA+  +RE R+ L+   GL FL
Subjt:  AAAVGNSGAL--------------------------DIVTFDKTSRVIVARNGGLEVII-GLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFL

Query:  VEAARFGCMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDA
        VEAA+ G + SRERAC AIGL+    R R +LVE GV+P L++L+R+GD   KL+AGNALGI+SA+  YIRPV  AG+IPL+ +LL   DP+GK+IAED 
Subjt:  VEAARFGCMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDYATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDA

Query:  FCLLAVAEENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERL
        FC+LAVAE NAV I++ LVRIL+ GD+ AK AA+DVLWD + Y++S S++  SGAIP+L++LL+DG+ E R ++SGAI+QLSYNE DR A +D+G I  L
Subjt:  FCLLAVAEENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERL

Query:  IRLLQDELEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNER--HGATRHMGINILTSDSDL
        I  L DE EEL+DNAAEALI+FSED  +  RV EA   P FQ+MQ R+ RIR +      + R + I  L  D DL
Subjt:  IRLLQDELEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNER--HGATRHMGINILTSDSDL

AT3G46510.1 plant U-box 136.9e-0930.82Show/hide
Query:  VARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGN
        +A AGAIPL   LL  PD   +E +  A   L++ E N   I        +V++LK G   A+  AA  L+  S    +   + + GAIP LV LL +G 
Subjt:  VARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGN

Query:  DEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP
           +   + A+  L   + ++     AG I  L RLL +    + D A   L   S  P
Subjt:  DEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP

AT3G54850.1 plant U-box 141.3e-1227.5Show/hide
Query:  CQAIGLLATANRGREMLVELG----------VVPVLIELFREGDYATKLIAGNALGIVSARLAYIRP-VARAGAIPLFADLLQLPDPVGKEIAEDAFCLL
        C++ G+    N+G     ++G           V  L+E    G    +  A   L +++ R    R  +A AGAIPL  +LL  PDP  +E +  A   L
Subjt:  CQAIGLLATANRGREMLVELG----------VVPVLIELFREGDYATKLIAGNALGIVSARLAYIRP-VARAGAIPLFADLLQLPDPVGKEIAEDAFCLL

Query:  AVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIER
        ++ E N   I D      +V +LKNG   A+  AA  L+  S    +   + ++GAI  L+ LL++G    +   + AI  L   + +++     G ++ 
Subjt:  AVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIER

Query:  LIRLLQDELEELKDNAAEALISFSEDPVYCDRVSEAASTP
        L RLL+D    + D A   L   S +      ++EA S P
Subjt:  LIRLLQDELEELKDNAAEALISFSEDPVYCDRVSEAASTP

AT5G42340.1 Plant U-Box 153.9e-1232.08Show/hide
Query:  VARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGN
        +A AGAIPL   LL  PD   +E A      L++ E N   IS+     +++ IL+NG+  A+  +A  L+  S    +   +  S  IP LVDLLQ G 
Subjt:  VARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISD-----HLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLVDLLQDGN

Query:  DEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP
           +     A+  LS N  ++    DAG ++ L+ LL+D+   + D A   L+  +  P
Subjt:  DEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCACTCACAGGTTCATGAAAAATCTAAAAGGCCGGATGTAGATTGGGAATCAGCGCTGAAACATTATGAAAACGTTATGGCTTCAGAATCCGAGGCTGTTAAAGT
GAAAGCTACAGTCAAATTGGCCCATCTCGCCAAAACTGTACCCGAGGATGTTTTGAACAGCGCCATAGCCATTATTGCCGAGCATCTCGAAGTTAATCCCATCAATAATT
CAAGCCAGTCCATGAAAGGAGCTGCTGCATATTGCCTGAGATGTATATCTTGTCAAGGTGATGGTGCGTTGGCGGCGGCGGTTGGTAATTCCGGCGCTTTAGATATTGTT
ACTTTTGATAAAACTAGTCGTGTGATTGTAGCAAGAAATGGGGGTTTAGAAGTTATTATTGGTTTGTTTGATTCTGTAACTGATGGCAGTAGAAGATATTTATTGGAGAT
TTTGAGTGCAATGGCACTATTGAGAGAGGTTAGGAAGGCCCTTATCAGTTTAAGAGGCCTCCCTTTTCTTGTGGAAGCTGCAAGATTTGGCTGCATGACCTCTAGAGAAA
GAGCTTGCCAAGCAATTGGGTTGCTTGCAACTGCGAACCGTGGAAGAGAAATGCTTGTTGAGTTGGGAGTGGTTCCAGTGCTTATTGAGCTATTTCGTGAAGGAGATTAT
GCGACAAAACTTATAGCTGGTAATGCTCTTGGAATTGTTTCAGCTCGTCTGGCCTATATTAGGCCTGTTGCACGAGCTGGGGCGATCCCGTTATTTGCTGATCTTCTTCA
GTTGCCTGACCCTGTTGGTAAGGAGATTGCAGAGGATGCCTTCTGTCTCTTAGCTGTTGCCGAAGAGAATGCGGTTGAAATTTCCGATCATCTAGTGAGAATTCTTAAAA
ATGGTGATGATGGAGCAAAGGCTGCAGCTGCTGATGTTTTGTGGGATTTTTCAAGCTATAAGTATTCCACTTCACTTGTGGATAGTTCAGGTGCCATTCCAGTTTTGGTG
GATCTATTACAGGACGGGAATGATGAGGTAAGGGCAAAAGTCTCTGGAGCAATAGCCCAGTTAAGTTATAATGAGACAGACAGAGCAGCACTTGCTGATGCAGGGGCAAT
CGAACGACTAATACGGCTATTGCAAGATGAGTTAGAAGAATTGAAGGATAATGCTGCTGAGGCGCTTATAAGTTTTTCTGAAGACCCCGTATACTGTGATAGAGTATCCG
AAGCAGCAAGCACTCCTGCTTTCCAAAACATGCAGGAAAGAATAACCCGTATTCGCGTAAATGAAAGGCACGGAGCAACGCGTCATATGGGAATCAACATACTTACAAGT
GATTCTGATCTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCACTCACAGGTTCATGAAAAATCTAAAAGGCCGGATGTAGATTGGGAATCAGCGCTGAAACATTATGAAAACGTTATGGCTTCAGAATCCGAGGCTGTTAAAGT
GAAAGCTACAGTCAAATTGGCCCATCTCGCCAAAACTGTACCCGAGGATGTTTTGAACAGCGCCATAGCCATTATTGCCGAGCATCTCGAAGTTAATCCCATCAATAATT
CAAGCCAGTCCATGAAAGGAGCTGCTGCATATTGCCTGAGATGTATATCTTGTCAAGGTGATGGTGCGTTGGCGGCGGCGGTTGGTAATTCCGGCGCTTTAGATATTGTT
ACTTTTGATAAAACTAGTCGTGTGATTGTAGCAAGAAATGGGGGTTTAGAAGTTATTATTGGTTTGTTTGATTCTGTAACTGATGGCAGTAGAAGATATTTATTGGAGAT
TTTGAGTGCAATGGCACTATTGAGAGAGGTTAGGAAGGCCCTTATCAGTTTAAGAGGCCTCCCTTTTCTTGTGGAAGCTGCAAGATTTGGCTGCATGACCTCTAGAGAAA
GAGCTTGCCAAGCAATTGGGTTGCTTGCAACTGCGAACCGTGGAAGAGAAATGCTTGTTGAGTTGGGAGTGGTTCCAGTGCTTATTGAGCTATTTCGTGAAGGAGATTAT
GCGACAAAACTTATAGCTGGTAATGCTCTTGGAATTGTTTCAGCTCGTCTGGCCTATATTAGGCCTGTTGCACGAGCTGGGGCGATCCCGTTATTTGCTGATCTTCTTCA
GTTGCCTGACCCTGTTGGTAAGGAGATTGCAGAGGATGCCTTCTGTCTCTTAGCTGTTGCCGAAGAGAATGCGGTTGAAATTTCCGATCATCTAGTGAGAATTCTTAAAA
ATGGTGATGATGGAGCAAAGGCTGCAGCTGCTGATGTTTTGTGGGATTTTTCAAGCTATAAGTATTCCACTTCACTTGTGGATAGTTCAGGTGCCATTCCAGTTTTGGTG
GATCTATTACAGGACGGGAATGATGAGGTAAGGGCAAAAGTCTCTGGAGCAATAGCCCAGTTAAGTTATAATGAGACAGACAGAGCAGCACTTGCTGATGCAGGGGCAAT
CGAACGACTAATACGGCTATTGCAAGATGAGTTAGAAGAATTGAAGGATAATGCTGCTGAGGCGCTTATAAGTTTTTCTGAAGACCCCGTATACTGTGATAGAGTATCCG
AAGCAGCAAGCACTCCTGCTTTCCAAAACATGCAGGAAAGAATAACCCGTATTCGCGTAAATGAAAGGCACGGAGCAACGCGTCATATGGGAATCAACATACTTACAAGT
GATTCTGATCTTCTTTAA
Protein sequenceShow/hide protein sequence
MNHSQVHEKSKRPDVDWESALKHYENVMASESEAVKVKATVKLAHLAKTVPEDVLNSAIAIIAEHLEVNPINNSSQSMKGAAAYCLRCISCQGDGALAAAVGNSGALDIV
TFDKTSRVIVARNGGLEVIIGLFDSVTDGSRRYLLEILSAMALLREVRKALISLRGLPFLVEAARFGCMTSRERACQAIGLLATANRGREMLVELGVVPVLIELFREGDY
ATKLIAGNALGIVSARLAYIRPVARAGAIPLFADLLQLPDPVGKEIAEDAFCLLAVAEENAVEISDHLVRILKNGDDGAKAAAADVLWDFSSYKYSTSLVDSSGAIPVLV
DLLQDGNDEVRAKVSGAIAQLSYNETDRAALADAGAIERLIRLLQDELEELKDNAAEALISFSEDPVYCDRVSEAASTPAFQNMQERITRIRVNERHGATRHMGINILTS
DSDLL