; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018432 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018432
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionVacuolar protein 8
Genome locationscaffold3:9580794..9583159
RNA-Seq ExpressionSpg018432
SyntenySpg018432
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592081.1 U-box domain-containing protein 12, partial [Cucurbita argyrosperma subsp. sororia]3.4e-18074.89Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA
        MGHSQI R S KP+LDWEAALK+YENV+A+ESEAVKVKAT+KLA LS  AP NILNSTI IIAKHLE N I NSS SM+G AAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA

Query:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG
        V HSGAL+S+LR L  + G FRKILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGT+RYLLEILSA+AL+REVRKALISLRGLPFLVQAAR G
Subjt:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLV+GN+LGIVSAHVAYIRPVAQ G IPLFADLLQ P+   KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----
        EANAV+I DHLVRILKEGDD  KAAAADVLWDLS YK SI +V SSGAIPVLVDLL  GN EVREK S A+ QLSY+E DR ALADAGA+  L G     
Subjt:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----

Query:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                           LY +RVSEA++ PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

XP_022139563.1 uncharacterized protein LOC111010437 [Momordica charantia]2.6e-18076.22Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQG-DGTLAT
        M  S+I++ES KP+L+WE AL++Y+NV+ASESEAVKVKATVKLA+LS+ APENILNSTI IIAKHL  N I NSS SM+G AAYCLR ISC+G DGTLA 
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQG-DGTLAT

Query:  AVDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARC
        AV +SGAL+SLLRLL +S G FRKILVKCVWS+VTF KTS VIIARNGGLEVIIGM DSVID ++RYLLEILSA+ALLREVRKALISLRGLPFLV+AARC
Subjt:  AVDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARC

Query:  GCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPD-LTGKEIAEDVFCLLA
        GC+ASRERACQAIGLLAI+KRGR ML ELGVV VLIELFR GDC TKLVAGN+LGIVSAHVAYIRPVAQ G IPLFADLLQ P+   GKEIAEDVFCLLA
Subjt:  GCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPD-LTGKEIAEDVFCLLA

Query:  VAEANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAV--------
        VAE NAVVISDHLVRILKEGDD AKAAAADVLWDLSGYK SIS  H+SGAIPV+VDLL   N+EVREK S AI QLSYNE DRAALADAGA+        
Subjt:  VAEANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAV--------

Query:  -----------EILTGQGYPLYY-DRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL
                   E L      L+Y  RVSEAV+TPAF+NMQERMTHI  TE+++ RSLRQMGIDQLTRDPDL
Subjt:  -----------EILTGQGYPLYY-DRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL

XP_022936629.1 uncharacterized protein LOC111443172 [Cucurbita moschata]1.5e-18075.11Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA
        MGHSQI R S KP+LDWEAALK+YENV+A+ESEAVKVKAT+KLA LS  AP NILNSTI IIAKHLE N I NSS SM+G AAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA

Query:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG
        V HSGAL+S+LR L  + G FRKILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGT+RYLLEILSA+AL+REVRKALISLRGLPFLVQAAR G
Subjt:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQ G IPLFADLLQ P+   KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----
        EANAV+I DHLVRILKEGDD  KAAAADVLWDLS YK SI +V SSGAIPVLVDLL  GN EVREK S A+ QLSY+E DR ALADAGA+  L G     
Subjt:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----

Query:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                           LY +RVSEA++ PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

XP_022976328.1 uncharacterized protein LOC111476761 [Cucurbita maxima]1.7e-17974.26Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA
        M HS+I+R S KP+LDWEAALK+YENV+A+ESEAVKVKAT+KLA LS  AP NILNSTI IIAKHLE N I NSS SM+G AAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA

Query:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG
        + HSGAL+S+LR L  + G FRKILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGT+RYLLEILSA+AL+REVRKALISLRGLPFLVQAAR G
Subjt:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQ G IPLFADLLQ PD   KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVE---------
        EANAV+I DHLVRILKEGDD  KAAAADVLWDLS YK SI +V SSGAIPVLVDLL  GN EVREK S A+ QLSY+E DR ALADAGA+          
Subjt:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVE---------

Query:  -----------ILTGQGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                   ++      LY +RVSEA++ PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  -----------ILTGQGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

XP_038897634.1 protein spotted leaf 11 [Benincasa hispida]1.3e-19279.15Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA
        MGHSQI R S KP+LDWEAAL +YENV+ASESEAVKVKAT+KLAHLSK APENILNS I  IAKHLE N   NSS SM+G AAYCLRCISCQGDGTLA A
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA

Query:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG
        V HSGAL+SLLR L HS GCFRKILVKCVWSIVTFD+TS VIIARNGGLEVIIGMFDSV DGT+RYLLEILSAMALLREVRKALISLRGLPFLVQAAR G
Subjt:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA
        CMASRERACQAIGLLAI+KRGR MLVELGVV VLIEL REGD VTKLVAGN+LG+VSAHV YIRP+AQ G IPLFA+LLQ PD  GKEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----
        EANAVVISDHLVRILKEGD G KAAAADVLWDLS YK SISVV +SGAIPVLVDLL  GNDEVREK S AI QLSYNETDR ALADAGA++ L G     
Subjt:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----

Query:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                          PLY  RVSEAV+TPAFQN+QERM HI A ER + RSL QMGI+Q T DPDLL
Subjt:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

TrEMBL top hitse value%identityAlignment
A0A0A0LGL6 Uncharacterized protein2.3e-16671.28Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA
        MG  +I R S KP+LDWEA+LK+YENV+ASESEA+KVKAT+KLA LSK APENIL S I IIAK LE N  +N+S SM+  AAYCLRCISC+GDGTLATA
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA

Query:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG
        V HSGAL+SL++ L HS GCF KILVKC+WSIVTFD++S VIIARNGGLEVII M   VIDGT+RYLLEILSAMALLREVRKALI  RGLPFLVQAAR G
Subjt:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA
        CMASRERAC+AIGL+AI+KRGR+ LVELGVV VLIEL REGD VTKLVAGN LGIVSAH+AYIRPVAQ G IPLFADLLQ  D   KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----
        E NAVVI DHLV++LKEGDD  KAAAADVL  LS YK SISVV +SGAIPVLVDLL  GN EVREK S AI +LS  ETDR ALADAGA++ L G     
Subjt:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----

Query:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                          PLY  RV+EA++TPAFQN+QER+THI A E  +  S+ Q+GI+Q T D DLL
Subjt:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

A0A5A7U359 Vacuolar protein 81.5e-16871.49Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA
        MG ++I R S  P+LDWEA+LK+YENV+ASESEA+KVKAT+KLA LSK APENIL S I IIAK LE    +N+S SM+  AAYCLRCISCQGDGTLA A
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA

Query:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG
        V HSGAL+SLLR L HS GCF KILVKC+WS VTFDK+S VI+ RNGGLEVII M   V DGT+RYLLEILSAMALLREVRKAL+ LRGLPFLVQAAR G
Subjt:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA
         MASRERAC+AIGL+AI+KRGR+ LVELGVV+VLIEL REGD VTKLVAGNALGIVSAH+AYIRPVAQ G IPLFADLLQ PD   KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----
        E NAVVISDHLV++LKEGDD  KAAAADVL  LS YK SISVV +SGAIPVLVDLL  GN EVREK S AI +LSY ETDR ALADAGA++ L G     
Subjt:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----

Query:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                          PLYY R+SE V+TPAFQN+QER+THI ATE  +  S+ Q+GI+Q T D DL+
Subjt:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

A0A6J1CFX3 uncharacterized protein LOC1110104371.3e-18076.22Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQG-DGTLAT
        M  S+I++ES KP+L+WE AL++Y+NV+ASESEAVKVKATVKLA+LS+ APENILNSTI IIAKHL  N I NSS SM+G AAYCLR ISC+G DGTLA 
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQG-DGTLAT

Query:  AVDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARC
        AV +SGAL+SLLRLL +S G FRKILVKCVWS+VTF KTS VIIARNGGLEVIIGM DSVID ++RYLLEILSA+ALLREVRKALISLRGLPFLV+AARC
Subjt:  AVDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARC

Query:  GCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPD-LTGKEIAEDVFCLLA
        GC+ASRERACQAIGLLAI+KRGR ML ELGVV VLIELFR GDC TKLVAGN+LGIVSAHVAYIRPVAQ G IPLFADLLQ P+   GKEIAEDVFCLLA
Subjt:  GCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPD-LTGKEIAEDVFCLLA

Query:  VAEANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAV--------
        VAE NAVVISDHLVRILKEGDD AKAAAADVLWDLSGYK SIS  H+SGAIPV+VDLL   N+EVREK S AI QLSYNE DRAALADAGA+        
Subjt:  VAEANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAV--------

Query:  -----------EILTGQGYPLYY-DRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL
                   E L      L+Y  RVSEAV+TPAF+NMQERMTHI  TE+++ RSLRQMGIDQLTRDPDL
Subjt:  -----------EILTGQGYPLYY-DRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL

A0A6J1F8U2 uncharacterized protein LOC1114431727.4e-18175.11Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA
        MGHSQI R S KP+LDWEAALK+YENV+A+ESEAVKVKAT+KLA LS  AP NILNSTI IIAKHLE N I NSS SM+G AAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA

Query:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG
        V HSGAL+S+LR L  + G FRKILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGT+RYLLEILSA+AL+REVRKALISLRGLPFLVQAAR G
Subjt:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQ G IPLFADLLQ P+   KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----
        EANAV+I DHLVRILKEGDD  KAAAADVLWDLS YK SI +V SSGAIPVLVDLL  GN EVREK S A+ QLSY+E DR ALADAGA+  L G     
Subjt:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTG-----

Query:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                           LY +RVSEA++ PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  ---------------QGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

A0A6J1ILS8 uncharacterized protein LOC1114767618.2e-18074.26Show/hide
Query:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA
        M HS+I+R S KP+LDWEAALK+YENV+A+ESEAVKVKAT+KLA LS  AP NILNSTI IIAKHLE N I NSS SM+G AAYCL+ IS QGDGTLATA
Subjt:  MGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATA

Query:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG
        + HSGAL+S+LR L  + G FRKILVKC+WSIVT DK S VI+ARNGGLEV+IGMFDSV+DGT+RYLLEILSA+AL+REVRKALISLRGLPFLVQAAR G
Subjt:  VDHSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCG

Query:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA
        CMASRERACQAIGL+AI++RGRHMLVELGV+ VLIELF EGD VTKLVAGN+LGIVSAHVAYIRPVAQ G IPLFADLLQ PD   KEIAEDVFCLLAVA
Subjt:  CMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVA

Query:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVE---------
        EANAV+I DHLVRILKEGDD  KAAAADVLWDLS YK SI +V SSGAIPVLVDLL  GN EVREK S A+ QLSY+E DR ALADAGA+          
Subjt:  EANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVE---------

Query:  -----------ILTGQGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL
                   ++      LY +RVSEA++ PAFQN+ ERMTHI ATER+  RSLRQMGI+QLT DPDLL
Subjt:  -----------ILTGQGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDLL

SwissProt top hitse value%identityAlignment
A2ZLU6 Protein spotted leaf 112.0e-0531.11Show/hide
Query:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEAN-AVVISD----HLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN
        +A+ G IPL   LL   DL  +E A      L++ E N A +IS      +V +LK G   A+  AA  L+ LS        +   GAIP LV LL  G+
Subjt:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEAN-AVVISD----HLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN

Query:  DEVREKFSRAITQLSYNETDRAALADAGAVEILTG
           ++  + A+  L   + ++     AG V ++ G
Subjt:  DEVREKFSRAITQLSYNETDRAALADAGAVEILTG

Q0IMG9 E3 ubiquitin-protein ligase SPL112.0e-0531.11Show/hide
Query:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEAN-AVVISD----HLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN
        +A+ G IPL   LL   DL  +E A      L++ E N A +IS      +V +LK G   A+  AA  L+ LS        +   GAIP LV LL  G+
Subjt:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEAN-AVVISD----HLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN

Query:  DEVREKFSRAITQLSYNETDRAALADAGAVEILTG
           ++  + A+  L   + ++     AG V ++ G
Subjt:  DEVREKFSRAITQLSYNETDRAALADAGAVEILTG

Q681N2 U-box domain-containing protein 156.2e-0731.58Show/hide
Query:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEANAVVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN
        +A  G IPL   LL  PD   +E A      L++ E N  +IS+     +++ IL+ G+  A+  +A  L+ LS    +   +  S  IP LVDLL  G 
Subjt:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEANAVVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN

Query:  DEVREKFSRAITQLSYNETDRAALADAGAVEIL
           ++    A+  LS N  ++    DAG V+ L
Subjt:  DEVREKFSRAITQLSYNETDRAALADAGAVEIL

Q8GUG9 U-box domain-containing protein 115.8e-0529.32Show/hide
Query:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEANAVVIS-----DHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN
        +A+ G IP+  +LL   D+  +E A      L++ E N  +I        +V++L+ G   A+  AA  L+ LS    +  ++  SGAIP LVDLL  G 
Subjt:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEANAVVIS-----DHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN

Query:  DEVREKFSRAITQLSYNETDRAALADAGAVEIL
           ++  + A+  L     ++     AG V  L
Subjt:  DEVREKFSRAITQLSYNETDRAALADAGAVEIL

Q9SNC6 U-box domain-containing protein 135.8e-0529.85Show/hide
Query:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEAN--AVVIS---DHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN
        +A+ G IPL   LL  PD   +E +      L++ E N  A+V +     +V++LK+G   A+  AA  L+ LS    +   + + GAIP LV LL  G 
Subjt:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEAN--AVVIS---DHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN

Query:  DEVREKFSRAITQLSYNETDRAALADAGAVEILT
           ++  + A+  L   + ++     AG +  LT
Subjt:  DEVREKFSRAITQLSYNETDRAALADAGAVEILT

Arabidopsis top hitse value%identityAlignment
AT1G23030.1 ARM repeat superfamily protein4.1e-0629.32Show/hide
Query:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEANAVVIS-----DHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN
        +A+ G IP+  +LL   D+  +E A      L++ E N  +I        +V++L+ G   A+  AA  L+ LS    +  ++  SGAIP LVDLL  G 
Subjt:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEANAVVIS-----DHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN

Query:  DEVREKFSRAITQLSYNETDRAALADAGAVEIL
           ++  + A+  L     ++     AG V  L
Subjt:  DEVREKFSRAITQLSYNETDRAALADAGAVEIL

AT2G45720.1 ARM repeat superfamily protein2.2e-0721.32Show/hide
Query:  NVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATAVDHSGALQSLLRLLSHSRGCFRKIL
        NV  SE +  K+K    L  LS K   ++ +  +L+    L G +    S S + +  + +R                    + L RL        RK L
Subjt:  NVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATAVDHSGALQSLLRLLSHSRGCFRKIL

Query:  VKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCGCMASRERACQAIGLLAISKRGRHML
         + V  ++  D+ + +       +  ++ +  +     +   + ++ ++A        LIS   LP L++    G + ++E+A  ++  ++IS      +
Subjt:  VKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCGCMASRERACQAIGLLAISKRGRHML

Query:  VELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTG-KEIAEDVFCLLAVAEANAV----VISDH----------
        V  G V  LIE+ + GD V++  +   L  +SA     + +A+ G++ +  ++L C  L G KE A +  CL  +  +N      VIS++          
Subjt:  VELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTG-KEIAEDVFCLLAVAEANAV----VISDH----------

Query:  ------------------------------LVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQL
                                      LV +LK G  GA+ AAA  +  ++    +  ++  SG IP+L+ +L       RE  ++AI  L
Subjt:  ------------------------------LVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQL

AT2G45720.2 ARM repeat superfamily protein2.2e-0721.32Show/hide
Query:  NVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATAVDHSGALQSLLRLLSHSRGCFRKIL
        NV  SE +  K+K    L  LS K   ++ +  +L+    L G +    S S + +  + +R                    + L RL        RK L
Subjt:  NVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATAVDHSGALQSLLRLLSHSRGCFRKIL

Query:  VKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCGCMASRERACQAIGLLAISKRGRHML
         + V  ++  D+ + +       +  ++ +  +     +   + ++ ++A        LIS   LP L++    G + ++E+A  ++  ++IS      +
Subjt:  VKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCGCMASRERACQAIGLLAISKRGRHML

Query:  VELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTG-KEIAEDVFCLLAVAEANAV----VISDH----------
        V  G V  LIE+ + GD V++  +   L  +SA     + +A+ G++ +  ++L C  L G KE A +  CL  +  +N      VIS++          
Subjt:  VELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTG-KEIAEDVFCLLAVAEANAV----VISDH----------

Query:  ------------------------------LVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQL
                                      LV +LK G  GA+ AAA  +  ++    +  ++  SG IP+L+ +L       RE  ++AI  L
Subjt:  ------------------------------LVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQL

AT3G20170.1 ARM repeat superfamily protein1.0e-10546.01Show/hide
Query:  MGHSQIQRES-AKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQG--DGTL
        MG S+ + ES    + DWE    R+EN I+S S +++V++ +KL+ L+ + PE+ ++  I I+A  L   + D+S+ S++  AA+CL+CI+C G  +   
Subjt:  MGHSQIQRES-AKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQG--DGTL

Query:  ATAVDHSGALQSLLRLLSHSR---GCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDS-VIDGTKRYLLEILSAMALLREVRKALISLRGLPFL
        A  +   G + SLL LL  +      FR+I VKC+WS+VTF  +  V +AR GGLE++I   ++   DG++ YLLEILSA+  +RE R+ L+   GL FL
Subjt:  ATAVDHSGALQSLLRLLSHSR---GCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDS-VIDGTKRYLLEILSAMALLREVRKALISLRGLPFL

Query:  VQAARCGCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDV
        V+AA+ G +ASRERAC AIGL+ +++R R +LVE GV+  L++L+R+GD   KL+AGNALGI+SA   YIRPV + G IPL+ +LL   D  GK+IAEDV
Subjt:  VQAARCGCMASRERACQAIGLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDV

Query:  FCLLAVAEANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEIL
        FC+LAVAE NAV+I++ LVRIL+ GD+ AK AA+DVLWDL+GY+ S+SV+  SGAIP+L++LL  G+ E RE+ S AI+QLSYNE DR A +D+G + IL
Subjt:  FCLLAVAEANAVVISDHLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEIL

Query:  T----GQGYPL----------------YYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL
              +   L                +Y RV EA+  P FQ+MQ R+  I A+     RS+R++ I  L  D DL
Subjt:  T----GQGYPL----------------YYDRVSEAVNTPAFQNMQERMTHIHATERYTTRSLRQMGIDQLTRDPDL

AT5G42340.1 Plant U-Box 154.4e-0831.58Show/hide
Query:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEANAVVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN
        +A  G IPL   LL  PD   +E A      L++ E N  +IS+     +++ IL+ G+  A+  +A  L+ LS    +   +  S  IP LVDLL  G 
Subjt:  VAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEANAVVISD-----HLVRILKEGDDGAKAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGN

Query:  DEVREKFSRAITQLSYNETDRAALADAGAVEIL
           ++    A+  LS N  ++    DAG V+ L
Subjt:  DEVREKFSRAITQLSYNETDRAALADAGAVEIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTCGACGTGTGGACTTCTTCGGGTTGGACGTCTTCCACGTTTTGGACTTCGTCGGGTTGGACATCCACCACATGGACTTCTTCTGGGGGTTCACGTCGTCCTCAC
CCAGATGTTGAACAAGCAATGCGTCGATCGCTTAACGTCCCTCGTTGCATTTGTTGGACGTCTTCCACGTTGTGGACTTCGTCGAGTTGGACGTCCTCCACGTTTTGGAC
TTTGTCGGGTTGGGCGTCCACTATTCTCTCTAGCTAAGTTCATCAATTCCGTCAAATTTCAAGCTGCTTGCTCATTGTTCTCGTCTCTTCATCTAATTGGACGCTTTGAA
TTTCGTCGGTTTTCATTTTCTAACATGGGCCACTCACAGATTCAGAGAGAATCTGCAAAGCCGAGTTTAGATTGGGAAGCAGCGCTGAAGCGGTATGAAAACGTTATTGC
TTCAGAATCCGAGGCTGTTAAAGTGAAAGCCACAGTCAAATTGGCCCATCTCTCCAAAAAGGCCCCCGAGAATATTTTGAACAGCACCATACTCATTATTGCTAAGCATC
TTGAGGGTAACCTCATCGATAATTCAAGCCATTCCATGAAAGGAGTTGCTGCATATTGTCTAAGATGTATATCTTGTCAAGGTGATGGTACTTTGGCGACGGCAGTAGAT
CATTCTGGCGCTCTACAGTCGTTGCTGAGGTTATTGTCACATTCTAGAGGTTGTTTTCGTAAGATTTTAGTTAAATGTGTTTGGAGTATTGTCACTTTTGATAAAACTAG
TTGTGTAATTATAGCAAGAAATGGGGGTTTGGAAGTTATTATTGGCATGTTTGATTCTGTAATTGATGGCACTAAGAGATATTTATTGGAGATTTTGAGTGCAATGGCAC
TATTGAGAGAGGTTAGGAAGGCCCTTATTAGTTTAAGAGGACTCCCTTTTCTTGTGCAAGCTGCTAGATGTGGTTGCATGGCCTCCAGAGAAAGAGCTTGCCAAGCAATT
GGGTTGCTTGCAATTTCCAAGCGTGGAAGACACATGCTTGTTGAGTTGGGAGTGGTTTCAGTGCTTATTGAACTATTTCGTGAAGGAGATTGTGTGACAAAACTGGTAGC
TGGTAATGCTCTAGGAATAGTTTCAGCTCATGTAGCCTATATTAGGCCTGTTGCACAAGGTGGGGTCATTCCTTTATTTGCTGATCTTCTTCAGTGCCCTGACCTTACTG
GTAAGGAGATTGCAGAAGATGTCTTCTGTCTCTTAGCTGTTGCTGAAGCGAATGCAGTAGTCATTTCTGATCATCTAGTGAGAATTCTTAAAGAAGGTGATGATGGAGCA
AAGGCTGCAGCTGCTGATGTTTTATGGGATCTTTCAGGCTATAAGTCCTCCATATCAGTTGTGCATAGTTCGGGTGCCATTCCAGTTTTGGTGGATCTATTACTGGTCGG
GAATGATGAAGTGAGGGAAAAATTCTCTAGAGCTATAACCCAATTAAGTTATAATGAGACAGACAGGGCTGCACTTGCTGATGCAGGAGCGGTTGAAATACTAACAGGCC
AAGGCTACCCCTTATACTATGATAGAGTATCTGAAGCAGTTAACACTCCTGCTTTCCAGAATATGCAGGAAAGAATGACCCATATTCATGCTACTGAAAGGTACACCACA
AGGTCCCTACGTCAAATGGGAATCGACCAACTTACCAGGGATCCAGATCTTCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCTCGACGTGTGGACTTCTTCGGGTTGGACGTCTTCCACGTTTTGGACTTCGTCGGGTTGGACATCCACCACATGGACTTCTTCTGGGGGTTCACGTCGTCCTCAC
CCAGATGTTGAACAAGCAATGCGTCGATCGCTTAACGTCCCTCGTTGCATTTGTTGGACGTCTTCCACGTTGTGGACTTCGTCGAGTTGGACGTCCTCCACGTTTTGGAC
TTTGTCGGGTTGGGCGTCCACTATTCTCTCTAGCTAAGTTCATCAATTCCGTCAAATTTCAAGCTGCTTGCTCATTGTTCTCGTCTCTTCATCTAATTGGACGCTTTGAA
TTTCGTCGGTTTTCATTTTCTAACATGGGCCACTCACAGATTCAGAGAGAATCTGCAAAGCCGAGTTTAGATTGGGAAGCAGCGCTGAAGCGGTATGAAAACGTTATTGC
TTCAGAATCCGAGGCTGTTAAAGTGAAAGCCACAGTCAAATTGGCCCATCTCTCCAAAAAGGCCCCCGAGAATATTTTGAACAGCACCATACTCATTATTGCTAAGCATC
TTGAGGGTAACCTCATCGATAATTCAAGCCATTCCATGAAAGGAGTTGCTGCATATTGTCTAAGATGTATATCTTGTCAAGGTGATGGTACTTTGGCGACGGCAGTAGAT
CATTCTGGCGCTCTACAGTCGTTGCTGAGGTTATTGTCACATTCTAGAGGTTGTTTTCGTAAGATTTTAGTTAAATGTGTTTGGAGTATTGTCACTTTTGATAAAACTAG
TTGTGTAATTATAGCAAGAAATGGGGGTTTGGAAGTTATTATTGGCATGTTTGATTCTGTAATTGATGGCACTAAGAGATATTTATTGGAGATTTTGAGTGCAATGGCAC
TATTGAGAGAGGTTAGGAAGGCCCTTATTAGTTTAAGAGGACTCCCTTTTCTTGTGCAAGCTGCTAGATGTGGTTGCATGGCCTCCAGAGAAAGAGCTTGCCAAGCAATT
GGGTTGCTTGCAATTTCCAAGCGTGGAAGACACATGCTTGTTGAGTTGGGAGTGGTTTCAGTGCTTATTGAACTATTTCGTGAAGGAGATTGTGTGACAAAACTGGTAGC
TGGTAATGCTCTAGGAATAGTTTCAGCTCATGTAGCCTATATTAGGCCTGTTGCACAAGGTGGGGTCATTCCTTTATTTGCTGATCTTCTTCAGTGCCCTGACCTTACTG
GTAAGGAGATTGCAGAAGATGTCTTCTGTCTCTTAGCTGTTGCTGAAGCGAATGCAGTAGTCATTTCTGATCATCTAGTGAGAATTCTTAAAGAAGGTGATGATGGAGCA
AAGGCTGCAGCTGCTGATGTTTTATGGGATCTTTCAGGCTATAAGTCCTCCATATCAGTTGTGCATAGTTCGGGTGCCATTCCAGTTTTGGTGGATCTATTACTGGTCGG
GAATGATGAAGTGAGGGAAAAATTCTCTAGAGCTATAACCCAATTAAGTTATAATGAGACAGACAGGGCTGCACTTGCTGATGCAGGAGCGGTTGAAATACTAACAGGCC
AAGGCTACCCCTTATACTATGATAGAGTATCTGAAGCAGTTAACACTCCTGCTTTCCAGAATATGCAGGAAAGAATGACCCATATTCATGCTACTGAAAGGTACACCACA
AGGTCCCTACGTCAAATGGGAATCGACCAACTTACCAGGGATCCAGATCTTCTATAA
Protein sequenceShow/hide protein sequence
MSSTCGLLRVGRLPRFGLRRVGHPPHGLLLGVHVVLTQMLNKQCVDRLTSLVAFVGRLPRCGLRRVGRPPRFGLCRVGRPLFSLAKFINSVKFQAACSLFSSLHLIGRFE
FRRFSFSNMGHSQIQRESAKPSLDWEAALKRYENVIASESEAVKVKATVKLAHLSKKAPENILNSTILIIAKHLEGNLIDNSSHSMKGVAAYCLRCISCQGDGTLATAVD
HSGALQSLLRLLSHSRGCFRKILVKCVWSIVTFDKTSCVIIARNGGLEVIIGMFDSVIDGTKRYLLEILSAMALLREVRKALISLRGLPFLVQAARCGCMASRERACQAI
GLLAISKRGRHMLVELGVVSVLIELFREGDCVTKLVAGNALGIVSAHVAYIRPVAQGGVIPLFADLLQCPDLTGKEIAEDVFCLLAVAEANAVVISDHLVRILKEGDDGA
KAAAADVLWDLSGYKSSISVVHSSGAIPVLVDLLLVGNDEVREKFSRAITQLSYNETDRAALADAGAVEILTGQGYPLYYDRVSEAVNTPAFQNMQERMTHIHATERYTT
RSLRQMGIDQLTRDPDLL