; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018079 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018079
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUPF0496 protein 4
Genome locationscaffold9:27897661..27898755
RNA-Seq ExpressionSpg018079
SyntenySpg018079
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059563.1 UPF0496 protein 4 [Cucumis melo var. makuwa]2.0e-18992.86Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        MEG SHDGI LDSDLDSFQKQVTQ FQDLSSASSD+ILSLSWIRKLLDAFICCQEEFKIILFGHK +ICRPPMDRMVSDYLERSVKALDVCN IRDGIEQ
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALDN S+KK LGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
        AAPRANE V TNGLAVPVFTMNMVLLFVTW L+AAIPCQDRGLHVHFSLPRQF+WAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNELAD
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTN+REEEVR+RVQELSQIC T+KIGLDPLERQIREVFHRIVRSRTEGLD LGRGN  E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

XP_008461918.1 PREDICTED: uncharacterized protein LOC103500406 [Cucumis melo]1.5e-18992.86Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        MEG SHDGI LDSDLDSFQKQVTQ FQDLSSASSD+ILSLSWIRKLLDAFICCQEEFKIILFGHK +ICRPPMDRMVSDYLERSVKALDVCN IRDGIEQ
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALDN SHKK LGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
        AAPRANE V TNGLAVPVFTMNMVLLFVTW L+AAIPCQDRGLHVHFSLPRQF+WAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNELAD
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTN+REEEVR+RVQELSQIC T+KIGLDPLE QIREVFHRIVRSRTEGLD LGRGN  E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

XP_022936002.1 uncharacterized protein LOC111442733 [Cucurbita moschata]8.6e-18590.66Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        ME S+HDG+S  SDLD FQKQVTQRFQDLSS SS+EILSLSWIRK+LDAFICCQEEFKIILF  K E+CR PMDRMVSDYLERSVKALDVCN IR+GI+Q
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALD+ SHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFGR+NGTRD R LGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
         APRANET+ATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGL+VHFSLPRQFAWAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNELAD
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTND+EEEVRQR+QELSQIC T+KIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNN E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

XP_031744688.1 uncharacterized protein LOC101216228 [Cucumis sativus]3.4e-18993.13Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        MEGSSHDGI LDSDLDSFQKQVTQRFQDLSSASSD+ILSLSWIRKLLDAFICCQEEFKIIL GHK EICRPP+DRMVSDYLERSVKALDVCN IRDGIEQ
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALDN S+KKTLGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
        AAPRANE V TNGLAVPVFTMNMVLLFVTW L+AAIPCQDRGLHVHFSLPRQF+WAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNEL D
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTN+REEEVRQRVQELSQIC T+KIGLDPLERQIREVFHRIVRSRTEGLD LG GN  E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

XP_038897608.1 uncharacterized protein LOC120085610 [Benincasa hispida]1.3e-19394.78Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        M+G SHDGISLDSDLDSFQKQVTQRFQDLSSASSD+ILSLSWIRKLLDAFICCQEEFKIILFGHK EIC+PPMDRMVSDYLERSVKALDVCN IRDGIEQ
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALDN S+KKTLGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
        AAPRANETV TNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNELAD
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTND+EEEVRQRV+ELSQIC T+KIGLDPLERQIREVFHRIVRSRTEGLD LGRGNN E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

TrEMBL top hitse value%identityAlignment
A0A0A0K9H9 Uncharacterized protein1.6e-18993.13Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        MEGSSHDGI LDSDLDSFQKQVTQRFQDLSSASSD+ILSLSWIRKLLDAFICCQEEFKIIL GHK EICRPP+DRMVSDYLERSVKALDVCN IRDGIEQ
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALDN S+KKTLGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
        AAPRANE V TNGLAVPVFTMNMVLLFVTW L+AAIPCQDRGLHVHFSLPRQF+WAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNEL D
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTN+REEEVRQRVQELSQIC T+KIGLDPLERQIREVFHRIVRSRTEGLD LG GN  E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

A0A1S3CFU0 uncharacterized protein LOC1035004067.3e-19092.86Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        MEG SHDGI LDSDLDSFQKQVTQ FQDLSSASSD+ILSLSWIRKLLDAFICCQEEFKIILFGHK +ICRPPMDRMVSDYLERSVKALDVCN IRDGIEQ
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALDN SHKK LGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
        AAPRANE V TNGLAVPVFTMNMVLLFVTW L+AAIPCQDRGLHVHFSLPRQF+WAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNELAD
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTN+REEEVR+RVQELSQIC T+KIGLDPLE QIREVFHRIVRSRTEGLD LGRGN  E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

A0A5A7UUI6 UPF0496 protein 49.6e-19092.86Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        MEG SHDGI LDSDLDSFQKQVTQ FQDLSSASSD+ILSLSWIRKLLDAFICCQEEFKIILFGHK +ICRPPMDRMVSDYLERSVKALDVCN IRDGIEQ
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALDN S+KK LGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
        AAPRANE V TNGLAVPVFTMNMVLLFVTW L+AAIPCQDRGLHVHFSLPRQF+WAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNELAD
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTN+REEEVR+RVQELSQIC T+KIGLDPLERQIREVFHRIVRSRTEGLD LGRGN  E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

A0A6J1F765 uncharacterized protein LOC1114427334.2e-18590.66Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        ME S+HDG+S  SDLD FQKQVTQRFQDLSS SS+EILSLSWIRK+LDAFICCQEEFKIILF  K E+CR PMDRMVSDYLERSVKALDVCN IR+GI+Q
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALD+ SHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFGR+NGTRD R LGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
         APRANET+ATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGL+VHFSLPRQFAWAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNELAD
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTND+EEEVRQR+QELSQIC T+KIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNN E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

A0A6J1IC10 uncharacterized protein LOC1114740471.0e-18389.84Show/hide
Query:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ
        +E S+HDG+S  SDLD FQKQVTQRFQDLSS SS+EILSLSWIRK+LDAFICCQEEFKIILF  K E+C+ PMDR+VSDYLERSVKALDVCN IR+GIEQ
Subjt:  MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQ

Query:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL
        LRQWQKLLEIVLSALDN SHK+TLGEGQFRRAKKALIDLAIAMLDENDANSPA+AQRNRSFG +NGTRD R LGHFRSLSWSVSRSWSAARQLQAIGSNL
Subjt:  LRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNL

Query:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
         APRANET+ATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGL+VHFSLPRQFAWAAPMLSLHDRI EESRRRERRNACGLLKEIHQIDKFAHIMNELAD
Subjt:  AAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELAD

Query:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        +A FPLTNDREEEVRQRVQELSQIC T+ IGLDPLERQ+REVFHRIVRSRTEGLDFLGRGNN E
Subjt:  SAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 46.5e-1023.44Show/hide
Query:  GSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPM---DRMVSDYLERSVKALDVCNAIRDGIE
        G +H    L   L S++  +    + L   ++ ++L+LSW+R  +D    C  E    +    T++  P     D+ V  YL  SVK LD+C A+   + 
Subjt:  GSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPM---DRMVSDYLERSVKALDVCNAIRDGIE

Query:  QLRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSN
        +L Q Q LL+  L  L   S      + Q +RA+ +L                      R +    G R  R +              S +  LQ +  N
Subjt:  QLRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSN

Query:  LAAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELA
        L+  +   +V    L   ++ +  V +FV    VA +    + L V   +P +F W+     LH  + EE  R+    +   +KE+ +++  A  ++ LA
Subjt:  LAAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELA

Query:  DSAHFP--------LTNDREEEVRQR--VQELSQICG
         ++             +  EEEV     VQE    CG
Subjt:  DSAHFP--------LTNDREEEVRQR--VQELSQICG

Q337C0 UPF0496 protein 42.5e-0922.91Show/hide
Query:  GSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPM---DRMVSDYLERSVKALDVCNAIRDGIE
        G +H    L   L S++  +    + L   ++ ++L+LSW+R  +D    C  E    +    T++  P     D+ V  YL  SVK LD+C A+   + 
Subjt:  GSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPM---DRMVSDYLERSVKALDVCNAIRDGIE

Query:  QLRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSN
        +L Q Q LL+  L  L   S      + Q +RA+ +L                      R +    G R               +R  S +  LQ +  N
Subjt:  QLRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSN

Query:  LAAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELA
        L+  +   +     L   ++ +  V +FV    VA +    + L V   +P +F W+     LH  + EE  R+    +   +KE+ +++  A  ++ LA
Subjt:  LAAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELA

Query:  DSAHFPLTNDREEEVRQRVQELS
               T+  EEE       +S
Subjt:  DSAHFPLTNDREEEVRQRVQELS

Q9CAK4 Protein ROH11.4e-7339.58Show/hide
Query:  DLDSFQKQVTQRFQDL----------------SSASSDEILSLSWIRKLLDAFICCQEEFK-IILFGH-KTEICRPPMDRMVSDYLERSVKALDVCNAIR
        DL+ FQK +  RF +L                S A++++I+S++W+RKL+D F+CC+ EFK I+L G   T+I +PP DR+V + L+RS+KALD+C A+ 
Subjt:  DLDSFQKQVTQRFQDL----------------SSASSDEILSLSWIRKLLDAFICCQEEFK-IILFGH-KTEICRPPMDRMVSDYLERSVKALDVCNAIR

Query:  DGIEQLRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDAN----------SPALAQRNRSFGRNNG-----TRDRRSLGHFRSLS
        +GI+ +R +Q+L EI ++AL+    ++ LG+G  RRAK+AL +L +A+  E+  N               +R+ SFGR +G     ++   ++G  +S S
Subjt:  DGIEQLRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDAN----------SPALAQRNRSFGRNNG-----TRDRRSLGHFRSLS

Query:  WSVSRSWSAARQLQAIGSNLAAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDR-GLHVHFSL-PRQFAWAAPMLSLHDRIWEESRRRERRNA
        W+V R+WSAA+Q+ A+ +NL  PR NE     GL  P+F M+ V++FV W L AA+PCQ+R GL  H  + P+   WA  ++ +H++I +E +++E++ +
Subjt:  WSVSRSWSAARQLQAIGSNLAAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDR-GLHVHFSL-PRQFAWAAPMLSLHDRIWEESRRRERRNA

Query:  CGLLKEIHQIDKFAHIMNELADSAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFL
         GL++E+ +++K  H + E AD  H+P   D  E    +V E+++IC  M+  L PL++QIREVFHRIVRSR E L+ L
Subjt:  CGLLKEIHQIDKFAHIMNELADSAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFL

Arabidopsis top hitse value%identityAlignment
AT1G18740.1 Protein of unknown function (DUF793)2.9e-13065.92Show/hide
Query:  SSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQLRQ
        S H+  +++ +LDSFQ+QV ++F DL +ASS+++LSL WI KLLD+F+CCQEEF+ I+F H+++I + PMDR++SDY ERS+KALDVCNAIRDGIEQ+RQ
Subjt:  SSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQLRQ

Query:  WQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPA-LAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNLAA
        W+KL +IV+SALD  SH + +GEGQ RRAKKALIDLAI MLDE D  S   LA RNRSFGR   +   RS+GHFRSLSWSVSRSWSA++QLQA+ SNLA 
Subjt:  WQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPA-LAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNLAA

Query:  PRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELADSA
        PR N+ VA+NGLAVPV+TM  VLLFV W LVAAIPCQDRGL V+F +PR F WAAP++SLHD+I EES+RR+R+N CGLLKEI +I+K + +MNEL DS 
Subjt:  PRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELADSA

Query:  HFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFL
        HFPL +D+E EV+QRV EL Q+   ++ GLDP ER++REVFHRIVRSRTE LD L
Subjt:  HFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFL

AT1G43630.1 Protein of unknown function (DUF793)3.6e-12058.52Show/hide
Query:  SSHDGISLDSDLDSFQKQVTQRFQDL-SSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQLR
        S  + ++++ +LDSFQ+QV ++F DL +SA   EILSL WI KLLD+F+CCQE+F++I+F HK ++ + PMDR++ +Y ERSVKALDVCNAIRDGIEQ+R
Subjt:  SSHDGISLDSDLDSFQKQVTQRFQDL-SSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQLR

Query:  QWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNLAA
        QWQKL+EIV+SALD  ++++ LGEG+  RAKKALIDLAI MLDE D+++     RNRSF RN      + +G+ RSLSWSVSRSWSA+RQLQ IG+NLA 
Subjt:  QWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNLAA

Query:  PRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRE-RRNACGLLKEIHQIDKFAHIMNELADS
        PRA++ +ATNGLA+ V+TM  +LLFVTW LVAAIPCQDRGLHVHF  PR F WA P++SLHD+I +ES++R+ ++  CGLL+EI+QI++ + ++++L DS
Subjt:  PRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRE-RRNACGLLKEIHQIDKFAHIMNELADS

Query:  AHFPLTNDR-EEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
         +F LT+++   EV++RVQEL  +C  +K GLDP +R++R+VFH+IVR+RTE LD LG+  N E
Subjt:  AHFPLTNDR-EEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

AT1G63930.1 from the Czech 'roh' meaning 'corner'1.0e-7439.58Show/hide
Query:  DLDSFQKQVTQRFQDL----------------SSASSDEILSLSWIRKLLDAFICCQEEFK-IILFGH-KTEICRPPMDRMVSDYLERSVKALDVCNAIR
        DL+ FQK +  RF +L                S A++++I+S++W+RKL+D F+CC+ EFK I+L G   T+I +PP DR+V + L+RS+KALD+C A+ 
Subjt:  DLDSFQKQVTQRFQDL----------------SSASSDEILSLSWIRKLLDAFICCQEEFK-IILFGH-KTEICRPPMDRMVSDYLERSVKALDVCNAIR

Query:  DGIEQLRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDAN----------SPALAQRNRSFGRNNG-----TRDRRSLGHFRSLS
        +GI+ +R +Q+L EI ++AL+    ++ LG+G  RRAK+AL +L +A+  E+  N               +R+ SFGR +G     ++   ++G  +S S
Subjt:  DGIEQLRQWQKLLEIVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDAN----------SPALAQRNRSFGRNNG-----TRDRRSLGHFRSLS

Query:  WSVSRSWSAARQLQAIGSNLAAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDR-GLHVHFSL-PRQFAWAAPMLSLHDRIWEESRRRERRNA
        W+V R+WSAA+Q+ A+ +NL  PR NE     GL  P+F M+ V++FV W L AA+PCQ+R GL  H  + P+   WA  ++ +H++I +E +++E++ +
Subjt:  WSVSRSWSAARQLQAIGSNLAAPRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDR-GLHVHFSL-PRQFAWAAPMLSLHDRIWEESRRRERRNA

Query:  CGLLKEIHQIDKFAHIMNELADSAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFL
         GL++E+ +++K  H + E AD  H+P   D  E    +V E+++IC  M+  L PL++QIREVFHRIVRSR E L+ L
Subjt:  CGLLKEIHQIDKFAHIMNELADSAHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFL

AT1G74450.1 Protein of unknown function (DUF793)1.7e-13062.22Show/hide
Query:  LDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQLRQWQKLLEI
        ++++L SFQ++V +RF DL+++S +++LSL W+ KLLD+F+ CQEEF+ I+  H++ I +PPMDR+VSDY ERSVKALDVCNAIRDG+EQ+RQWQKL+EI
Subjt:  LDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQLRQWQKLLEI

Query:  VLSALDN----CSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALA--QRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNLAAPR
        V+ A +N     S K+ LGEGQFRRA+K LI+LAI MLDE D++S +++   RNRSFGRN      R++GHFRSLSWSVSRSWSA++QLQAIG+NLA PR
Subjt:  VLSALDN----CSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALA--QRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNLAAPR

Query:  ANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELADSAHF
        A++  ATNGL VPV+TM  VLLFV W LVAAIPCQDRGL VHF++PR + W   ++SLHDRI EES++RER+N CGLLKEIHQ +K + +MNEL DS  F
Subjt:  ANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELADSAHF

Query:  PLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE
        PL+ ++E EVR+RV+EL ++   +K GLDP ER++REVFHRIVRSRTEGLD +G+ +  E
Subjt:  PLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE

AT4G11300.1 Protein of unknown function (DUF793)3.2e-6038.48Show/hide
Query:  DLDSFQKQVTQRFQDL----SSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQLRQWQKLLE
        +L+ FQK V +RF +L     S  S  ILS+ W+RKLLD F+  + EF  +L  + ++I +PP+D++V + L+R VKALD+C A+ +G++ +RQ Q+  E
Subjt:  DLDSFQKQVTQRFQDL----SSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQLRQWQKLLE

Query:  IVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAM-LDENDANSPALAQRN------RSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNLAA
        I ++AL     +  L +G  RRAK+AL  L  A+  D+N  +S   + R        SFGR +G     S G     +  VS++WSAA+Q+QA+ +NL A
Subjt:  IVLSALDNCSHKKTLGEGQFRRAKKALIDLAIAM-LDENDANSPALAQRN------RSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNLAA

Query:  PRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQ-DRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELADS
        PR  E       A P++ M+ V++ V WTLV A+PCQ   GL VH  LP+   WA   +S+ +R+ EE +R+E R   GL++E+ ++++    + E ++ 
Subjt:  PRANETVATNGLAVPVFTMNMVLLFVTWTLVAAIPCQ-DRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELADS

Query:  AHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFL
          F    + EE+V   V E+ +IC  M+ GL+ L+R++REVFHR+V+SR+E L+ +
Subjt:  AHFPLTNDREEEVRQRVQELSQICGTMKIGLDPLERQIREVFHRIVRSRTEGLDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGCAGCTCCCATGACGGTATCTCCTTGGATTCCGACCTCGATTCATTTCAGAAGCAGGTCACTCAACGGTTCCAGGACCTGTCATCGGCTTCTTCCGATGAGAT
TTTGTCGCTCTCGTGGATTAGGAAGCTTCTGGATGCTTTTATTTGCTGCCAGGAGGAGTTTAAGATCATTTTGTTTGGGCATAAGACGGAAATTTGTAGACCGCCGATGG
ATAGGATGGTTTCGGATTACTTGGAGAGGAGCGTGAAGGCGCTTGATGTTTGTAATGCGATTAGAGATGGGATTGAGCAGCTGAGGCAGTGGCAGAAGTTGTTGGAGATT
GTGCTTAGTGCTTTGGATAATTGTAGTCACAAGAAGACTCTTGGTGAGGGCCAATTCCGCCGCGCGAAGAAGGCTCTGATAGATTTGGCGATTGCAATGCTCGATGAAAA
TGATGCCAATTCCCCCGCTCTTGCACAGAGAAACCGTTCTTTCGGACGCAATAATGGTACCAGAGACCGCAGGTCTTTGGGGCATTTCCGTTCGCTTTCATGGAGTGTTT
CGCGTTCCTGGTCGGCGGCTAGGCAGCTCCAGGCAATTGGGAGCAATTTGGCTGCTCCGAGGGCGAATGAGACTGTGGCTACCAATGGTCTTGCGGTCCCTGTGTTTACG
ATGAACATGGTATTGCTGTTTGTAACATGGACTCTGGTGGCAGCGATTCCGTGCCAGGACCGCGGCTTACATGTTCACTTTTCGCTGCCCCGGCAGTTCGCTTGGGCGGC
GCCAATGCTTTCTCTTCACGATCGAATTTGGGAAGAATCGAGAAGGCGGGAAAGAAGAAATGCTTGTGGGTTGTTGAAGGAGATTCATCAGATCGATAAATTTGCGCATA
TTATGAATGAATTGGCTGATTCAGCACACTTTCCCTTGACGAATGATAGGGAAGAAGAGGTGAGGCAAAGAGTGCAGGAGCTATCTCAGATTTGTGGAACTATGAAGATT
GGCTTGGATCCTCTGGAACGGCAGATTAGAGAGGTCTTCCATCGAATTGTTCGCTCAAGAACTGAGGGGCTTGATTTTTTAGGACGGGGAAATAACCCTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGCAGCTCCCATGACGGTATCTCCTTGGATTCCGACCTCGATTCATTTCAGAAGCAGGTCACTCAACGGTTCCAGGACCTGTCATCGGCTTCTTCCGATGAGAT
TTTGTCGCTCTCGTGGATTAGGAAGCTTCTGGATGCTTTTATTTGCTGCCAGGAGGAGTTTAAGATCATTTTGTTTGGGCATAAGACGGAAATTTGTAGACCGCCGATGG
ATAGGATGGTTTCGGATTACTTGGAGAGGAGCGTGAAGGCGCTTGATGTTTGTAATGCGATTAGAGATGGGATTGAGCAGCTGAGGCAGTGGCAGAAGTTGTTGGAGATT
GTGCTTAGTGCTTTGGATAATTGTAGTCACAAGAAGACTCTTGGTGAGGGCCAATTCCGCCGCGCGAAGAAGGCTCTGATAGATTTGGCGATTGCAATGCTCGATGAAAA
TGATGCCAATTCCCCCGCTCTTGCACAGAGAAACCGTTCTTTCGGACGCAATAATGGTACCAGAGACCGCAGGTCTTTGGGGCATTTCCGTTCGCTTTCATGGAGTGTTT
CGCGTTCCTGGTCGGCGGCTAGGCAGCTCCAGGCAATTGGGAGCAATTTGGCTGCTCCGAGGGCGAATGAGACTGTGGCTACCAATGGTCTTGCGGTCCCTGTGTTTACG
ATGAACATGGTATTGCTGTTTGTAACATGGACTCTGGTGGCAGCGATTCCGTGCCAGGACCGCGGCTTACATGTTCACTTTTCGCTGCCCCGGCAGTTCGCTTGGGCGGC
GCCAATGCTTTCTCTTCACGATCGAATTTGGGAAGAATCGAGAAGGCGGGAAAGAAGAAATGCTTGTGGGTTGTTGAAGGAGATTCATCAGATCGATAAATTTGCGCATA
TTATGAATGAATTGGCTGATTCAGCACACTTTCCCTTGACGAATGATAGGGAAGAAGAGGTGAGGCAAAGAGTGCAGGAGCTATCTCAGATTTGTGGAACTATGAAGATT
GGCTTGGATCCTCTGGAACGGCAGATTAGAGAGGTCTTCCATCGAATTGTTCGCTCAAGAACTGAGGGGCTTGATTTTTTAGGACGGGGAAATAACCCTGAGTAG
Protein sequenceShow/hide protein sequence
MEGSSHDGISLDSDLDSFQKQVTQRFQDLSSASSDEILSLSWIRKLLDAFICCQEEFKIILFGHKTEICRPPMDRMVSDYLERSVKALDVCNAIRDGIEQLRQWQKLLEI
VLSALDNCSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPALAQRNRSFGRNNGTRDRRSLGHFRSLSWSVSRSWSAARQLQAIGSNLAAPRANETVATNGLAVPVFT
MNMVLLFVTWTLVAAIPCQDRGLHVHFSLPRQFAWAAPMLSLHDRIWEESRRRERRNACGLLKEIHQIDKFAHIMNELADSAHFPLTNDREEEVRQRVQELSQICGTMKI
GLDPLERQIREVFHRIVRSRTEGLDFLGRGNNPE