; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018311 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018311
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionU-box domain-containing protein 4
Genome locationscaffold3:9464720..9466134
RNA-Seq ExpressionSpg018311
SyntenySpg018311
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592081.1 U-box domain-containing protein 12, partial [Cucurbita argyrosperma subsp. sororia]1.4e-18175.11Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA
        MGHSQI R S K +LDW+AALK+YENVMA+ESEA+KVKAT+KLA LS  AP NILNSTIPIIAKHLE N I NSS S++ AAAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA

Query:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG
        V +SGAL+S+LR LP + G F KILVKC+WSIVT DK SRVI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGL FL+QAAR G
Subjt:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLV+GN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+ I KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----
        EANAV+I DHLVRILK GDD  K AAADVLWDLS YKYSI +V+SSGAIPV VDLL DGN EVREKVS A+AQLSY+E DR+ALADAGA+  L G     
Subjt:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----

Query:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                           LYC+RVSEA+S PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

XP_022139563.1 uncharacterized protein LOC111010437 [Momordica charantia]1.2e-18076.22Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQG-DGTLAT
        M  S+I++ES K +L+W+ AL++Y+NVMASESEA+KVKATVKLA+LS+ APENILNSTIPIIAKHL  N I NSS S++ AAAYCLR ISC+G DGTLA 
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQG-DGTLAT

Query:  AVGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARC
        AVG SGAL+SLLRLLP+S G F KILVKCVWS+VTF KTSRVIIARNGGLEVIIGM DSVID +RRYLLEILSA+ALLREVRKALISLRGL FL++AARC
Subjt:  AVGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARC

Query:  GCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPD-LIGKEIAEDVFCLLA
        GC+ASRERACQAIGLLAI+KRGR ML ELGVV VLIELFR GDC TKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+  IGKEIAEDVFCLLA
Subjt:  GCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPD-LIGKEIAEDVFCLLA

Query:  VAEANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVE-------
        VAE NAVVISDHLVRILK GDD AK AAADVLWDLSGYKYSIS   +SGAIPV VDLL D N+EVREKVS AIAQLSYNE DR ALADAGA+E       
Subjt:  VAEANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVE-------

Query:  -------------ILTGQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL
                     ++       YC RVSEAVSTPAF+NMQERMTHI  TE+++ RSLRQMGIDQLTRDPDL
Subjt:  -------------ILTGQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL

XP_022936629.1 uncharacterized protein LOC111443172 [Cucurbita moschata]6.4e-18275.32Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA
        MGHSQI R S K +LDW+AALK+YENVMA+ESEA+KVKAT+KLA LS  AP NILNSTIPIIAKHLE N I NSS S++ AAAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA

Query:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG
        V +SGAL+S+LR LP + G F KILVKC+WSIVT DK SRVI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGL FL+QAAR G
Subjt:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+ I KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----
        EANAV+I DHLVRILK GDD  K AAADVLWDLS YKYSI +V+SSGAIPV VDLL DGN EVREKVS A+AQLSY+E DR+ALADAGA+  L G     
Subjt:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----

Query:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                           LYC+RVSEA+S PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

XP_022976328.1 uncharacterized protein LOC111476761 [Cucurbita maxima]7.1e-18174.47Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA
        M HS+I+R S K +LDW+AALK+YENVMA+ESEA+KVKAT+KLA LS  AP NILNSTIPIIAKHLE N I NSS S++ AAAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA

Query:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG
        + +SGAL+S+LR LP + G F KILVKC+WSIVT DK SRVI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGL FL+QAAR G
Subjt:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ PD I KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVE---------
        EANAV+I DHLVRILK GDD  K AAADVLWDLS YKYSI +V+SSGAIPV VDLL DGN EVREKVS A+AQLSY+E DR+ALADAGA+          
Subjt:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVE---------

Query:  -----------ILTGQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                   ++      LYC+RVSEA+S PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  -----------ILTGQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

XP_038897634.1 protein spotted leaf 11 [Benincasa hispida]1.5e-19479.79Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA
        MGHSQI R S K +LDW+AAL +YENVMASESEA+KVKAT+KLAHLSK APENILNS IP IAKHLE N   NSS S++ AAAYCLRCISCQGDGTLA A
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA

Query:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG
        VG+SGAL+SLLR LPHS GCF KILVKCVWSIVTFD+TSRVIIARNGGLEVIIGMFDSV DGTRRYLLEILSAMALLREVRKALISLRGL FL+QAAR G
Subjt:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA
        CMASRERACQAIGLLAI+KRGR MLVELGVV VLIEL REGD VTKLVAGN+LG+VSAHV YIRP+AQAG IPLF +LLQ PD IGKEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----
        EANAVVISDHLVRILK GD G K AAADVLWDLS YKYSISVV++SGAIPV VDLL DGNDEVREKVS AIAQLSYNETDRVALADAGA++ L G     
Subjt:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----

Query:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                          PLYC RVSEAVSTPAFQN+QERM HI A ER + RSL QMGI+Q T DPDLL
Subjt:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

TrEMBL top hitse value%identityAlignment
A0A0A0LGL6 Uncharacterized protein1.2e-17072.55Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA
        MG  +I R S K +LDW+A+LK+YENVMASESEA+KVKAT+KLA LSK APENIL S IPIIAK LE N  +N+S S++ AAAYCLRCISC+GDGTLATA
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA

Query:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG
        VG+SGAL+SL++ L HS GCFCKILVKC+WSIVTFD++SRVIIARNGGLEVII M   VIDGTRRYLLEILSAMALLREVRKALI  RGL FL+QAAR G
Subjt:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA
        CMASRERAC+AIGL+AI+KRGR+ LVELGVV VLIEL REGD VTKLVAGN LGIVSAH+AYIRPVAQAG IPLF DLLQ  D I KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----
        E NAVVI DHLV++LK GDD  K AAADVL  LS YKYSISVV++SGAIPV VDLL DGN EVREKVS AIA+LS  ETDRVALADAGA++ L G     
Subjt:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----

Query:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                          PLYC RV+EA+STPAFQN+QER+THI A E  +  S+ Q+GI+Q T D DLL
Subjt:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

A0A5A7U359 Vacuolar protein 81.6e-17072.34Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA
        MG ++I R S   +LDW+A+LK+YENVMASESEA+KVKAT+KLA LSK APENIL S IPIIAK LE    +N+S S++ AAAYCLRCISCQGDGTLA A
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA

Query:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG
        VG+SGAL+SLLR L HS GCFCKILVKC+WS VTFDK+SRVI+ RNGGLEVII M   V DGTRRYLLEILSAMALLREVRKAL+ LRGL FL+QAAR G
Subjt:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA
         MASRERAC+AIGL+AI+KRGR+ LVELGVV+VLIEL REGD VTKLVAGNALGIVSAH+AYIRPVAQAG IPLF DLLQ PD I KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----
        E NAVVISDHLV++LK GDD  K AAADVL  LS YKYSISVV++SGAIPV VDLL DGN EVREKVS AIA+LSY ETDRVALADAGA++ L G     
Subjt:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----

Query:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                          PLY  R+SE VSTPAFQN+QER+THI ATE  +  S+ Q+GI+Q T D DL+
Subjt:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

A0A6J1CFX3 uncharacterized protein LOC1110104375.9e-18176.22Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQG-DGTLAT
        M  S+I++ES K +L+W+ AL++Y+NVMASESEA+KVKATVKLA+LS+ APENILNSTIPIIAKHL  N I NSS S++ AAAYCLR ISC+G DGTLA 
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQG-DGTLAT

Query:  AVGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARC
        AVG SGAL+SLLRLLP+S G F KILVKCVWS+VTF KTSRVIIARNGGLEVIIGM DSVID +RRYLLEILSA+ALLREVRKALISLRGL FL++AARC
Subjt:  AVGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARC

Query:  GCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPD-LIGKEIAEDVFCLLA
        GC+ASRERACQAIGLLAI+KRGR ML ELGVV VLIELFR GDC TKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+  IGKEIAEDVFCLLA
Subjt:  GCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPD-LIGKEIAEDVFCLLA

Query:  VAEANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVE-------
        VAE NAVVISDHLVRILK GDD AK AAADVLWDLSGYKYSIS   +SGAIPV VDLL D N+EVREKVS AIAQLSYNE DR ALADAGA+E       
Subjt:  VAEANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVE-------

Query:  -------------ILTGQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL
                     ++       YC RVSEAVSTPAF+NMQERMTHI  TE+++ RSLRQMGIDQLTRDPDL
Subjt:  -------------ILTGQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL

A0A6J1F8U2 uncharacterized protein LOC1114431723.1e-18275.32Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA
        MGHSQI R S K +LDW+AALK+YENVMA+ESEA+KVKAT+KLA LS  AP NILNSTIPIIAKHLE N I NSS S++ AAAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA

Query:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG
        V +SGAL+S+LR LP + G F KILVKC+WSIVT DK SRVI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGL FL+QAAR G
Subjt:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ P+ I KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----
        EANAV+I DHLVRILK GDD  K AAADVLWDLS YKYSI +V+SSGAIPV VDLL DGN EVREKVS A+AQLSY+E DR+ALADAGA+  L G     
Subjt:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTG-----

Query:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                           LYC+RVSEA+S PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  ---------------QGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

A0A6J1ILS8 uncharacterized protein LOC1114767613.4e-18174.47Show/hide
Query:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA
        M HS+I+R S K +LDW+AALK+YENVMA+ESEA+KVKAT+KLA LS  AP NILNSTIPIIAKHLE N I NSS S++ AAAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATA

Query:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG
        + +SGAL+S+LR LP + G F KILVKC+WSIVT DK SRVI+ARNGGLEV+IGMFDSV+DGTRRYLLEILSA+AL+REVRKALISLRGL FL+QAAR G
Subjt:  VGYSGALQSLLRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQAG IPLF DLLQ PD I KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVE---------
        EANAV+I DHLVRILK GDD  K AAADVLWDLS YKYSI +V+SSGAIPV VDLL DGN EVREKVS A+AQLSY+E DR+ALADAGA+          
Subjt:  EANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVE---------

Query:  -----------ILTGQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                   ++      LYC+RVSEA+S PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  -----------ILTGQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

SwissProt top hitse value%identityAlignment
A2ZLU6 Protein spotted leaf 115.0e-0431.11Show/hide
Query:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEAN-AVVISD----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN
        +A+AG IPL + LL   DL  +E A      L++ E N A +IS      +V +LK G   A+  AA  L+ LS        +   GAIP  V LL +G+
Subjt:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEAN-AVVISD----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN

Query:  DEVREKVSRAIAQLSYNETDRVALADAGAVEILTG
           ++  + A+  L   + ++     AG V ++ G
Subjt:  DEVREKVSRAIAQLSYNETDRVALADAGAVEILTG

Q5VRH9 U-box domain-containing protein 121.2e-0528.74Show/hide
Query:  GVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEAN-AVVISDH----LVRILK-GDDGAK
        G+VS++  L        +  AG    +   +V     +A+AG IPL V+LL   D   +E A      L++ E N A ++  H    +V +LK G    +
Subjt:  GVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEAN-AVVISDH----LVRILK-GDDGAK

Query:  VAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAV
          AA  L+ LS    +   + ++GAIP  ++LL DG+   ++  + AI  L   + ++V    AG V
Subjt:  VAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAV

Q681N2 U-box domain-containing protein 151.6e-0532.33Show/hide
Query:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVISD-----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN
        +A AG IPL V LL  PD   +E A      L++ E N  +IS+     +++ IL+ G+  A+  +A  L+ LS    +   +  S  IP  VDLL  G 
Subjt:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVISD-----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN

Query:  DEVREKVSRAIAQLSYNETDRVALADAGAVEIL
           ++    A+  LS N  ++    DAG V+ L
Subjt:  DEVREKVSRAIAQLSYNETDRVALADAGAVEIL

Q8GUG9 U-box domain-containing protein 115.0e-0430.08Show/hide
Query:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVIS-----DHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN
        +A+AG IP+ V+LL   D+  +E A      L++ E N  +I        +V++L+ G   A+  AA  L+ LS    +  ++  SGAIP  VDLL +G 
Subjt:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVIS-----DHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN

Query:  DEVREKVSRAIAQLSYNETDRVALADAGAVEIL
           ++  + A+  L     ++     AG V  L
Subjt:  DEVREKVSRAIAQLSYNETDRVALADAGAVEIL

Q8VZ40 U-box domain-containing protein 141.0e-0429.85Show/hide
Query:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVISD-----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN
        +A+AG IPL V+LL  PD   +E +      L++ E N   I D      +V +LK G   A+  AA  L+ LS    +   + ++GAI   + LL +G 
Subjt:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVISD-----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN

Query:  DEVREKVSRAIAQLSYNETDRVALADAGAVEILT
           ++  + AI  L   + ++      G V+ LT
Subjt:  DEVREKVSRAIAQLSYNETDRVALADAGAVEILT

Arabidopsis top hitse value%identityAlignment
AT2G45720.1 ARM repeat superfamily protein1.1e-0620.73Show/hide
Query:  IIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCGCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGN
        ++ +  +     R   + ++ ++A        LIS   L  L++    G + ++E+A  ++  ++IS      +V  G V  LIE+ + GD V++  +  
Subjt:  IIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCGCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGN

Query:  ALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIG-KEIAEDVFCLLAVAEANAV----VISDHLVRILKGDDGAKVAAADVLWDLSGYKYSISVVRSSG
         L  +SA     + +A+ G++ + +++L C  L+G KE A +  CL  +  +N      VIS++ ++ L       +     +  +     S+SV     
Subjt:  ALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIG-KEIAEDVFCLLAVAEANAV----VISDHLVRILKGDDGAKVAAADVLWDLSGYKYSISVVRSSG

Query:  AIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEIL
         IP  V +L  G+   ++  +  I +++ +   +  + ++G + +L
Subjt:  AIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEIL

AT2G45720.2 ARM repeat superfamily protein1.1e-0620.73Show/hide
Query:  IIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCGCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGN
        ++ +  +     R   + ++ ++A        LIS   L  L++    G + ++E+A  ++  ++IS      +V  G V  LIE+ + GD V++  +  
Subjt:  IIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCGCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGN

Query:  ALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIG-KEIAEDVFCLLAVAEANAV----VISDHLVRILKGDDGAKVAAADVLWDLSGYKYSISVVRSSG
         L  +SA     + +A+ G++ + +++L C  L+G KE A +  CL  +  +N      VIS++ ++ L       +     +  +     S+SV     
Subjt:  ALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIG-KEIAEDVFCLLAVAEANAV----VISDHLVRILKGDDGAKVAAADVLWDLSGYKYSISVVRSSG

Query:  AIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEIL
         IP  V +L  G+   ++  +  I +++ +   +  + ++G + +L
Subjt:  AIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEIL

AT3G20170.1 ARM repeat superfamily protein3.8e-10846.96Show/hide
Query:  MGHSQIQRES-AKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQG--DGTL
        MG S+ + ES    + DW+    R+EN ++S S +I+V++ +KL+ L+ + PE+ ++  IPI+A  L   + D+S+ SV+ AAA+CL+CI+C G  +   
Subjt:  MGHSQIQRES-AKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQG--DGTL

Query:  ATAVGYSGALQSLLRLLPHSR---GCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDS-VIDGTRRYLLEILSAMALLREVRKALISLRGLHFL
        A  +G  G + SLL LL  +      F +I VKC+WS+VTF  + RV +AR GGLE++I   ++   DG+R YLLEILSA+  +RE R+ L+   GL FL
Subjt:  ATAVGYSGALQSLLRLLPHSR---GCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDS-VIDGTRRYLLEILSAMALLREVRKALISLRGLHFL

Query:  MQAARCGCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDV
        ++AA+ G +ASRERAC AIGL+ +++R R +LVE GV+  L++L+R+GD   KL+AGNALGI+SA   YIRPV +AG IPL+V+LL   D +GK+IAEDV
Subjt:  MQAARCGCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDV

Query:  FCLLAVAEANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEIL
        FC+LAVAE NAV+I++ LVRIL+ GD+ AK+AA+DVLWDL+GY++S+SV+R SGAIP+ ++LL DG+ E RE++S AI+QLSYNE DR A +D+G + IL
Subjt:  FCLLAVAEANAVVISDHLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEIL

Query:  T---------------------GQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL
                               +    Y  RV EA+  P FQ+MQ R+  I A+     RS+R++ I  L  D DL
Subjt:  T---------------------GQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL

AT3G54850.1 plant U-box 147.2e-0629.85Show/hide
Query:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVISD-----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN
        +A+AG IPL V+LL  PD   +E +      L++ E N   I D      +V +LK G   A+  AA  L+ LS    +   + ++GAI   + LL +G 
Subjt:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVISD-----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN

Query:  DEVREKVSRAIAQLSYNETDRVALADAGAVEILT
           ++  + AI  L   + ++      G V+ LT
Subjt:  DEVREKVSRAIAQLSYNETDRVALADAGAVEILT

AT5G42340.1 Plant U-Box 151.1e-0632.33Show/hide
Query:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVISD-----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN
        +A AG IPL V LL  PD   +E A      L++ E N  +IS+     +++ IL+ G+  A+  +A  L+ LS    +   +  S  IP  VDLL  G 
Subjt:  VAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVISD-----HLVRILK-GDDGAKVAAADVLWDLSGYKYSISVVRSSGAIPVFVDLLLDGN

Query:  DEVREKVSRAIAQLSYNETDRVALADAGAVEIL
           ++    A+  LS N  ++    DAG V+ L
Subjt:  DEVREKVSRAIAQLSYNETDRVALADAGAVEIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCACTCACAGATTCAGAGAGAATCTGCAAAGTCGAGTCTAGATTGGAAAGCAGCGCTGAAGCGGTATGAAAACGTTATGGCTTCAGAATCCGAGGCTATTAAAGT
GAAAGCCACGGTCAAATTGGCCCATCTCTCCAAAAAGGCCCCCGAGAATATTTTGAACAGCACCATACCCATTATTGCTAAGCATCTTGAGGGTAACCTCATCGATAATT
CAAGCCATTCCGTGAAAGAAGCTGCTGCATATTGTCTAAGATGTATATCTTGTCAAGGTGATGGTACTTTGGCGACGGCGGTAGGTTATTCAGGCGCTCTACAGTCGTTG
TTGAGGTTATTGCCACATTCTAGAGGTTGTTTTTGTAAGATTTTAGTTAAATGTGTTTGGAGTATTGTCACTTTTGATAAAACTAGTCGTGTGATTATAGCAAGAAATGG
GGGTTTGGAAGTTATTATTGGCATGTTTGATTCTGTAATTGATGGCACTAGGAGATATTTATTGGAGATTTTGAGTGCAATGGCACTATTGAGAGAGGTTAGGAAGGCCC
TTATCAGTTTAAGAGGACTCCATTTTCTTATGCAAGCTGCTAGATGTGGCTGCATGGCCTCCAGAGAAAGAGCTTGCCAAGCAATTGGGTTGCTTGCAATTTCCAAGCGT
GGAAGACACATGCTTGTTGAGTTGGGAGTGGTTTCAGTGCTTATTGAACTATTTCGTGAAGGAGATTGTGTGACAAAACTAGTAGCCGGTAATGCTCTTGGAATAGTTTC
AGCTCATGTAGCCTATATTAGGCCTGTTGCACAAGCTGGGGTCATTCCTTTATTTGTTGATCTTCTTCAGTGCCCTGACCTTATTGGTAAGGAGATTGCAGAAGATGTCT
TCTGTCTCTTAGCTGTTGCTGAAGCGAATGCAGTAGTCATTTCTGATCATCTAGTGAGAATTCTTAAAGGTGATGATGGAGCAAAGGTTGCAGCTGCTGATGTTTTATGG
GATCTTTCAGGCTATAAGTACTCCATATCAGTTGTGCGTAGTTCGGGTGCCATTCCAGTTTTTGTGGATCTATTACTGGACGGGAATGATGAAGTGAGGGAAAAAGTCTC
TAGAGCTATAGCCCAATTAAGTTATAATGAGACAGACAGGGTTGCACTTGCTGATGCAGGAGCGGTTGAAATACTAACAGGCCAAGGCTACCCCTTATACTGTGATAGAG
TATCTGAAGCAGTTAGCACTCCTGCTTTCCAGAATATGCAGGAAAGAATGACCCATATTCATGCTACTGAAAGGTACACCACAAGGTCCCTACGTCAAATGGGAATCGAC
CAACTTACCAGGGATCCAGATCTTCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCACTCACAGATTCAGAGAGAATCTGCAAAGTCGAGTCTAGATTGGAAAGCAGCGCTGAAGCGGTATGAAAACGTTATGGCTTCAGAATCCGAGGCTATTAAAGT
GAAAGCCACGGTCAAATTGGCCCATCTCTCCAAAAAGGCCCCCGAGAATATTTTGAACAGCACCATACCCATTATTGCTAAGCATCTTGAGGGTAACCTCATCGATAATT
CAAGCCATTCCGTGAAAGAAGCTGCTGCATATTGTCTAAGATGTATATCTTGTCAAGGTGATGGTACTTTGGCGACGGCGGTAGGTTATTCAGGCGCTCTACAGTCGTTG
TTGAGGTTATTGCCACATTCTAGAGGTTGTTTTTGTAAGATTTTAGTTAAATGTGTTTGGAGTATTGTCACTTTTGATAAAACTAGTCGTGTGATTATAGCAAGAAATGG
GGGTTTGGAAGTTATTATTGGCATGTTTGATTCTGTAATTGATGGCACTAGGAGATATTTATTGGAGATTTTGAGTGCAATGGCACTATTGAGAGAGGTTAGGAAGGCCC
TTATCAGTTTAAGAGGACTCCATTTTCTTATGCAAGCTGCTAGATGTGGCTGCATGGCCTCCAGAGAAAGAGCTTGCCAAGCAATTGGGTTGCTTGCAATTTCCAAGCGT
GGAAGACACATGCTTGTTGAGTTGGGAGTGGTTTCAGTGCTTATTGAACTATTTCGTGAAGGAGATTGTGTGACAAAACTAGTAGCCGGTAATGCTCTTGGAATAGTTTC
AGCTCATGTAGCCTATATTAGGCCTGTTGCACAAGCTGGGGTCATTCCTTTATTTGTTGATCTTCTTCAGTGCCCTGACCTTATTGGTAAGGAGATTGCAGAAGATGTCT
TCTGTCTCTTAGCTGTTGCTGAAGCGAATGCAGTAGTCATTTCTGATCATCTAGTGAGAATTCTTAAAGGTGATGATGGAGCAAAGGTTGCAGCTGCTGATGTTTTATGG
GATCTTTCAGGCTATAAGTACTCCATATCAGTTGTGCGTAGTTCGGGTGCCATTCCAGTTTTTGTGGATCTATTACTGGACGGGAATGATGAAGTGAGGGAAAAAGTCTC
TAGAGCTATAGCCCAATTAAGTTATAATGAGACAGACAGGGTTGCACTTGCTGATGCAGGAGCGGTTGAAATACTAACAGGCCAAGGCTACCCCTTATACTGTGATAGAG
TATCTGAAGCAGTTAGCACTCCTGCTTTCCAGAATATGCAGGAAAGAATGACCCATATTCATGCTACTGAAAGGTACACCACAAGGTCCCTACGTCAAATGGGAATCGAC
CAACTTACCAGGGATCCAGATCTTCTATAA
Protein sequenceShow/hide protein sequence
MGHSQIQRESAKSSLDWKAALKRYENVMASESEAIKVKATVKLAHLSKKAPENILNSTIPIIAKHLEGNLIDNSSHSVKEAAAYCLRCISCQGDGTLATAVGYSGALQSL
LRLLPHSRGCFCKILVKCVWSIVTFDKTSRVIIARNGGLEVIIGMFDSVIDGTRRYLLEILSAMALLREVRKALISLRGLHFLMQAARCGCMASRERACQAIGLLAISKR
GRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQAGVIPLFVDLLQCPDLIGKEIAEDVFCLLAVAEANAVVISDHLVRILKGDDGAKVAAADVLW
DLSGYKYSISVVRSSGAIPVFVDLLLDGNDEVREKVSRAIAQLSYNETDRVALADAGAVEILTGQGYPLYCDRVSEAVSTPAFQNMQERMTHIHATERYTTRSLRQMGID
QLTRDPDLL