; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg23308 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg23308
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUPF0496 protein 4
Genome locationCarg_Chr09:6699777..6701774
RNA-Seq ExpressionCarg23308
SyntenyCarg23308
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592201.1 Protein ROH1, partial [Cucurbita argyrosperma subsp. sororia]1.9e-223100Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
        LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRINIFIVGFQSCEKFEQWTSEINER
        RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRINIFIVGFQSCEKFEQWTSEINER
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRINIFIVGFQSCEKFEQWTSEINER

KAG7025058.1 hypothetical protein SDJN02_13881, partial [Cucurbita argyrosperma subsp. argyrosperma]7.8e-238100Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
        LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRINIFIVGFQSCEKFEQWTSEINERL
        RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRINIFIVGFQSCEKFEQWTSEINERL
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRINIFIVGFQSCEKFEQWTSEINERL

Query:  FSVIFIAWESVQNYTLNFPASAN
        FSVIFIAWESVQNYTLNFPASAN
Subjt:  FSVIFIAWESVQNYTLNFPASAN

XP_022936002.1 uncharacterized protein LOC111442733 [Cucurbita moschata]9.9e-209100Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
        LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

XP_022975077.1 uncharacterized protein LOC111474047 [Cucurbita maxima]7.4e-20497.07Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQSSSLPFTNIGRSIFSLRRDQVHS+ESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMC+
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
        LPMDR+VSDYLERSVKALDVCNVIREGI+QLRQWQKLLEIVLSALD+SSHK+TLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFG SNGTRDN
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTND+EEEVRQR+QELSQICETL IGLDPLERQ+REVFHRI
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

XP_023536236.1 uncharacterized protein LOC111797467 [Cucurbita pepo subsp. pepo]1.4e-20297.87Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQSSSLPFTNIGRSIF+LRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
         PMDRMVSDYLERSVKALDVCNVIREGI+QLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFG SNGTRDN
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        R   HFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTND+EEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

TrEMBL top hitse value%identityAlignment
A0A0A0K9H9 Uncharacterized protein4.7e-18889.63Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQ SS P TNIGRSIFSLRRDQVHSME S+HDG+   SDLD FQKQVTQRFQDLSS SS++ILSLSWIRK+LDAFICCQEEFKIIL   KAE+CR
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
         P+DRMVSDYLERSVKALDVCNVIR+GI+QLRQWQKLLEIVLSALD+SS+KKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGR+NGTRD 
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        R LGHFRSLSWSVSRSWSAARQLQAIGSNL APRANE + TNGLAVPVFTMNMVLLFVTW L+AAIPCQDRGL+VHFSLPRQF+WAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        RRRERRNACGLLKEIHQIDKFAHIMNEL DTAQFPLTN++EEEVRQR+QELSQICETLKIGLDPLERQIREVFHRI
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

A0A1S3CFU0 uncharacterized protein LOC1035004061.4e-18789.1Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQ SS P TNIGRSIFSLRRDQVHSME  +HDG+   SDLD FQKQVTQ FQDLSS SS++ILSLSWIRK+LDAFICCQEEFKIILF  KA++CR
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
         PMDRMVSDYLERSVKALDVCNVIR+GI+QLRQWQKLLEIVLSALD+SSHKK LGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGR+NGTRD 
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        R LGHFRSLSWSVSRSWSAARQLQAIGSNL APRANE + TNGLAVPVFTMNMVLLFVTW L+AAIPCQDRGL+VHFSLPRQF+WAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTN++EEEVR+R+QELSQICETLKIGLDPLE QIREVFHRI
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

A0A5A7UUI6 UPF0496 protein 41.8e-18789.1Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQ SS P TNIGRSIFSLRRDQVHSME  +HDG+   SDLD FQKQVTQ FQDLSS SS++ILSLSWIRK+LDAFICCQEEFKIILF  KA++CR
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
         PMDRMVSDYLERSVKALDVCNVIR+GI+QLRQWQKLLEIVLSALD+SS+KK LGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGR+NGTRD 
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        R LGHFRSLSWSVSRSWSAARQLQAIGSNL APRANE + TNGLAVPVFTMNMVLLFVTW L+AAIPCQDRGL+VHFSLPRQF+WAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTN++EEEVR+R+QELSQICETLKIGLDPLERQIREVFHRI
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

A0A6J1F765 uncharacterized protein LOC1114427334.8e-209100Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
        LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

A0A6J1IC10 uncharacterized protein LOC1114740473.6e-20497.07Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR
        MPATDFQSSSLPFTNIGRSIFSLRRDQVHS+ESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMC+
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCR

Query:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN
        LPMDR+VSDYLERSVKALDVCNVIREGI+QLRQWQKLLEIVLSALD+SSHK+TLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFG SNGTRDN
Subjt:  LPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDN

Query:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
        RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES
Subjt:  RFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEES

Query:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTND+EEEVRQR+QELSQICETL IGLDPLERQ+REVFHRI
Subjt:  RRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 43.2e-0822.76Show/hide
Query:  FQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCRLPM----DRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLS
        ++  +    + L   ++ ++L+LSW+R  +D    C  E    + A       LP+    D+ V  YL  SVK LD+C  +   + +L Q Q LL+  L 
Subjt:  FQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCRLPM----DRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLS

Query:  ALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNG
         L   S      + Q +RA+ +L                      R +    G R  R +              S +  LQ +  NL+  +   ++    
Subjt:  ALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNG

Query:  LAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQ
        L   ++ +  V +FV    VA +    + L V   +P +F W+     LH  + EE  R+    +   +KE+ +++  A  ++ LA T+Q
Subjt:  LAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQ

Q337C0 UPF0496 protein 44.2e-0822.76Show/hide
Query:  FQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCRLPM----DRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLS
        ++  +    + L   ++ ++L+LSW+R  +D    C  E    + A       LP+    D+ V  YL  SVK LD+C  +   + +L Q Q LL+  L 
Subjt:  FQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCRLPM----DRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLS

Query:  ALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNG
         L   S      + Q +RA+ +L                      R +    G R  R +              S +  LQ +  NL+  +   +     
Subjt:  ALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNG

Query:  LAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQ
        L   ++ +  V +FV    VA +    + L V   +P +F W+     LH  + EE  R+    +   +KE+ +++  A  ++ LA T+Q
Subjt:  LAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQ

Q9CAK4 Protein ROH13.3e-6636.1Show/hide
Query:  PATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDL----------------SSVSSEEILSLSWIRKVLDAFICCQE
        PA D Q S L     GR   S+RR+Q   + +          DL+ FQK +  RF +L                S  ++E+I+S++W+RK++D F+CC+ 
Subjt:  PATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDL----------------SSVSSEEILSLSWIRKVLDAFICCQE

Query:  EFKIILFARK--AEMCRLPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDAN---
        EFK IL   +   ++ + P DR+V + L+RS+KALD+C  +  GID +R +Q+L EI ++AL+    ++ LG+G  RRAK+AL +L +A+  E+  N   
Subjt:  EFKIILFARK--AEMCRLPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDAN---

Query:  -------SPAIAQRNRSFGRSNG-----TRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPC
                    +R+ SFGR +G     ++    +G  +S SW+V R+WSAA+Q+ A+ +NLT PR NE     GL  P+F M+ V++FV W L AA+PC
Subjt:  -------SPAIAQRNRSFGRSNG-----TRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPC

Query:  QDR-GLNVHFSL-PRQFAWAAPMLSLHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLE
        Q+R GL  H  + P+   WA  ++ +H++I +E +++E++ + GL++E+ +++K  H + E AD   +P   D  E    ++ E+++IC  ++  L PL+
Subjt:  QDR-GLNVHFSL-PRQFAWAAPMLSLHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLE

Query:  RQIREVFHRI
        +QIREVFHRI
Subjt:  RQIREVFHRI

Arabidopsis top hitse value%identityAlignment
AT1G18740.1 Protein of unknown function (DUF793)8.6e-12660.99Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSM-----ESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARK
        MPATDFQ S       GRS+ SLRRDQV S       SS H+  +   +LD FQ+QV ++F DL++ SS ++LSL WI K+LD+F+CCQEEF+ I+F  +
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRDQVHSM-----ESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARK

Query:  AEMCRLPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPA-IAQRNRSFGRS
        +++ + PMDR++SDY ERS+KALDVCN IR+GI+Q+RQW+KL +IV+SALD  SH + +GEGQ RRAKKALIDLAI MLDE D  S   +A RNRSFGR 
Subjt:  AEMCRLPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPA-IAQRNRSFGRS

Query:  NGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHD
          +  +R +GHFRSLSWSVSRSWSA++QLQA+ SNL  PR N+ +A+NGLAVPV+TM  VLLFV W LVAAIPCQDRGL V+F +PR F WAAP++SLHD
Subjt:  NGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHD

Query:  RILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        +I+EES+RR+R+N CGLLKEI +I+K + +MNEL D+  FPL +DKE EV+QR+ EL Q+ E L+ GLDP ER++REVFHRI
Subjt:  RILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

AT1G43630.1 Protein of unknown function (DUF793)1.5e-11455.61Show/hide
Query:  SSLPFT--NIGRSIFSLRRDQVHSME-SSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSE-EILSLSWIRKVLDAFICCQEEFKIILFARKAEMCRLPMD
        S +P T  + GRS  SLRRDQ H M+ +S  + ++   +LD FQ+QV ++F DL++ + E EILSL WI K+LD+F+CCQE+F++I+F  K ++ + PMD
Subjt:  SSLPFT--NIGRSIFSLRRDQVHSME-SSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSE-EILSLSWIRKVLDAFICCQEEFKIILFARKAEMCRLPMD

Query:  RMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDNRFLG
        R++ +Y ERSVKALDVCN IR+GI+Q+RQWQKL+EIV+SALD  ++++ LGEG+  RAKKALIDLAI MLDE D+++     RNRSF R+     N+ +G
Subjt:  RMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDNRFLG

Query:  HFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEESRRRE
        + RSLSWSVSRSWSA+RQLQ IG+NL  PRA++ MATNGLA+ V+TM  +LLFVTW LVAAIPCQDRGL+VHF  PR F WA P++SLHD+I++ES++R+
Subjt:  HFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEESRRRE

Query:  -RRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDK-EEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
         ++  CGLL+EI+QI++ + ++++L D+  F LT++K   EV++R+QEL  +CE +K GLDP +R++R+VFH+I
Subjt:  -RRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDK-EEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

AT1G63930.1 from the Czech 'roh' meaning 'corner'2.4e-6736.1Show/hide
Query:  PATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDL----------------SSVSSEEILSLSWIRKVLDAFICCQE
        PA D Q S L     GR   S+RR+Q   + +          DL+ FQK +  RF +L                S  ++E+I+S++W+RK++D F+CC+ 
Subjt:  PATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDL----------------SSVSSEEILSLSWIRKVLDAFICCQE

Query:  EFKIILFARK--AEMCRLPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDAN---
        EFK IL   +   ++ + P DR+V + L+RS+KALD+C  +  GID +R +Q+L EI ++AL+    ++ LG+G  RRAK+AL +L +A+  E+  N   
Subjt:  EFKIILFARK--AEMCRLPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDAN---

Query:  -------SPAIAQRNRSFGRSNG-----TRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPC
                    +R+ SFGR +G     ++    +G  +S SW+V R+WSAA+Q+ A+ +NLT PR NE     GL  P+F M+ V++FV W L AA+PC
Subjt:  -------SPAIAQRNRSFGRSNG-----TRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPC

Query:  QDR-GLNVHFSL-PRQFAWAAPMLSLHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLE
        Q+R GL  H  + P+   WA  ++ +H++I +E +++E++ + GL++E+ +++K  H + E AD   +P   D  E    ++ E+++IC  ++  L PL+
Subjt:  QDR-GLNVHFSL-PRQFAWAAPMLSLHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLE

Query:  RQIREVFHRI
        +QIREVFHRI
Subjt:  RQIREVFHRI

AT1G74450.1 Protein of unknown function (DUF793)4.0e-12357.92Show/hide
Query:  MPATDFQSSSLPFTNIGRSIFSLRRD-QVHSMESS--AHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAE
        MPAT++Q S       GRS  +LRRD  V+S+ES+    +     ++L  FQ++V +RF DL++ S E++LSL W+ K+LD+F+ CQEEF+ I+   ++ 
Subjt:  MPATDFQSSSLPFTNIGRSIFSLRRD-QVHSMESS--AHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAE

Query:  MCRLPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDD----SSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIA--QRNRSF
        + + PMDR+VSDY ERSVKALDVCN IR+G++Q+RQWQKL+EIV+ A ++    SS K+ LGEGQFRRA+K LI+LAI MLDE D++S +++   RNRSF
Subjt:  MCRLPMDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDD----SSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIA--QRNRSF

Query:  GRSNGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLS
        GR+     +R +GHFRSLSWSVSRSWSA++QLQAIG+NL  PRA++  ATNGL VPV+TM  VLLFV W LVAAIPCQDRGL VHF++PR + W   ++S
Subjt:  GRSNGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLS

Query:  LHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        LHDRI+EES++RER+N CGLLKEIHQ +K + +MNEL D+ QFPL+ +KE EVR+R++EL ++ E LK GLDP ER++REVFHRI
Subjt:  LHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI

AT4G23530.1 Protein of unknown function (DUF793)1.4e-5636.52Show/hide
Query:  ATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDL------------SSVS--SEEILSLSWIRKVLDAFICCQEEFK
        AT+FQ S L       S  S+RR+Q+ SM+ + H+    + +L++FQK V +RF DL            S+VS  S+ ILS+ W++ +LD F+ C+ EFK
Subjt:  ATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDL------------SSVS--SEEILSLSWIRKVLDAFICCQEEFK

Query:  IILFARKAEMCRLP-MDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAM-----LDENDANSP
         +L     ++ + P ++R++ + L+R +KALD+CN +  GID +RQ ++  EI ++AL     ++ L +G  RRAK+AL  L I +      D N   S 
Subjt:  IILFARKAEMCRLP-MDRMVSDYLERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAM-----LDENDANSP

Query:  AIAQ-RNRSFGRSNGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNV-HFSLP
           Q R  S   S GTR N   G        VS++WSA++Q+QA+ +NL  PR  E    +G A+PV+ M+ V++ V W LVAA+PCQ   + V    LP
Subjt:  AIAQ-RNRSFGRSNGTRDNRFLGHFRSLSWSVSRSWSAARQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNV-HFSLP

Query:  RQFAWAAPMLSLHDRILEESRRRERR-NACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI
        +   WA+  +S+ +RI EE +R+E+R    GL++E+ +++K    + E A+  +FP   ++E EV +++ E+ +IC  +++GL+ L+RQ+R+VFHR+
Subjt:  RQFAWAAPMLSLHDRILEESRRRERR-NACGLLKEIHQIDKFAHIMNELADTAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGCTACTGATTTTCAGAGTTCTTCGTTGCCGTTTACTAACATTGGCCGTTCGATCTTCAGTCTCCGACGTGATCAAGTTCATTCCATGGAGAGCAGCGCTCATGA
TGGCGTCTCCTTCGTTTCCGATCTCGATTTCTTCCAGAAGCAGGTCACTCAACGGTTTCAGGATTTGTCGTCGGTTTCTTCTGAGGAGATTTTGTCGCTCTCGTGGATTA
GGAAGGTTTTGGATGCTTTTATTTGTTGTCAGGAGGAATTTAAAATCATTTTGTTTGCTCGGAAGGCGGAAATGTGTAGACTGCCGATGGATAGGATGGTTTCGGATTAC
TTGGAGCGGAGCGTGAAGGCGCTTGATGTTTGTAATGTGATTAGAGAGGGGATTGACCAGCTGAGGCAGTGGCAGAAGTTGTTGGAGATTGTGCTTAGTGCTTTGGATGA
TTCTAGTCACAAGAAGACTCTTGGTGAAGGCCAATTTCGCCGTGCCAAGAAGGCTTTGATAGACTTGGCGATTGCAATGCTCGATGAAAATGATGCCAATTCTCCGGCTA
TTGCACAGAGAAACCGTTCTTTTGGACGCAGTAATGGTACCAGAGACAACAGATTCTTGGGGCATTTCCGTTCGCTTTCATGGAGCGTTTCACGTTCGTGGTCGGCCGCT
AGGCAGCTTCAGGCCATTGGGAGCAATTTGACTGCACCGAGGGCGAATGAGACTATGGCTACCAATGGCCTGGCAGTCCCTGTTTTTACAATGAACATGGTATTGCTGTT
CGTAACGTGGACTCTGGTAGCGGCGATTCCTTGCCAGGACCGTGGCTTAAATGTTCATTTTTCGTTGCCCCGGCAGTTCGCATGGGCGGCGCCAATGCTTTCACTTCACG
ATCGGATTTTGGAAGAATCGAGAAGGCGGGAAAGAAGAAATGCTTGTGGGTTGTTGAAGGAAATTCATCAGATCGATAAATTTGCGCATATTATGAATGAATTGGCTGAT
ACAGCACAGTTCCCCTTGACGAATGATAAGGAAGAAGAGGTGAGGCAAAGAATGCAGGAGCTATCTCAAATTTGTGAAACTTTGAAGATTGGTTTGGATCCTCTGGAACG
GCAGATCAGAGAGGTGTTCCATCGAATTAACATCTTCATAGTTGGCTTCCAGAGTTGTGAGAAGTTTGAACAATGGACGAGTGAAATAAACGAAAGGCTGTTCTCTGTGA
TATTTATAGCATGGGAATCCGTTCAAAATTATACGCTGAATTTCCCTGCTTCTGCGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGCTACTGATTTTCAGAGTTCTTCGTTGCCGTTTACTAACATTGGCCGTTCGATCTTCAGTCTCCGACGTGATCAAGTTCATTCCATGGAGAGCAGCGCTCATGA
TGGCGTCTCCTTCGTTTCCGATCTCGATTTCTTCCAGAAGCAGGTCACTCAACGGTTTCAGGATTTGTCGTCGGTTTCTTCTGAGGAGATTTTGTCGCTCTCGTGGATTA
GGAAGGTTTTGGATGCTTTTATTTGTTGTCAGGAGGAATTTAAAATCATTTTGTTTGCTCGGAAGGCGGAAATGTGTAGACTGCCGATGGATAGGATGGTTTCGGATTAC
TTGGAGCGGAGCGTGAAGGCGCTTGATGTTTGTAATGTGATTAGAGAGGGGATTGACCAGCTGAGGCAGTGGCAGAAGTTGTTGGAGATTGTGCTTAGTGCTTTGGATGA
TTCTAGTCACAAGAAGACTCTTGGTGAAGGCCAATTTCGCCGTGCCAAGAAGGCTTTGATAGACTTGGCGATTGCAATGCTCGATGAAAATGATGCCAATTCTCCGGCTA
TTGCACAGAGAAACCGTTCTTTTGGACGCAGTAATGGTACCAGAGACAACAGATTCTTGGGGCATTTCCGTTCGCTTTCATGGAGCGTTTCACGTTCGTGGTCGGCCGCT
AGGCAGCTTCAGGCCATTGGGAGCAATTTGACTGCACCGAGGGCGAATGAGACTATGGCTACCAATGGCCTGGCAGTCCCTGTTTTTACAATGAACATGGTATTGCTGTT
CGTAACGTGGACTCTGGTAGCGGCGATTCCTTGCCAGGACCGTGGCTTAAATGTTCATTTTTCGTTGCCCCGGCAGTTCGCATGGGCGGCGCCAATGCTTTCACTTCACG
ATCGGATTTTGGAAGAATCGAGAAGGCGGGAAAGAAGAAATGCTTGTGGGTTGTTGAAGGAAATTCATCAGATCGATAAATTTGCGCATATTATGAATGAATTGGCTGAT
ACAGCACAGTTCCCCTTGACGAATGATAAGGAAGAAGAGGTGAGGCAAAGAATGCAGGAGCTATCTCAAATTTGTGAAACTTTGAAGATTGGTTTGGATCCTCTGGAACG
GCAGATCAGAGAGGTGTTCCATCGAATTAACATCTTCATAGTTGGCTTCCAGAGTTGTGAGAAGTTTGAACAATGGACGAGTGAAATAAACGAAAGGCTGTTCTCTGTGA
TATTTATAGCATGGGAATCCGTTCAAAATTATACGCTGAATTTCCCTGCTTCTGCGAACTGACAATTGGGAGGCTACACACACACTGTCTCTGTCTCCCCCTCGCAGTAC
ATTTCTTTCAAAAAGTATCATTTGTCTTACTTTTAGTTCTTCATTTCTTGGAGGAAGGATAAAAACATAGGAATTACATTTGGGGCCATGTATGTAGGGAATTGGTGAGT
TCATAACAAGAAATTTGTGTGAGATCCCACGTTGGTTGGGGAGGAGAACGAAACATTCTTTATAAGGATGTGGAAACCTCTCCTTAGTAG
Protein sequenceShow/hide protein sequence
MPATDFQSSSLPFTNIGRSIFSLRRDQVHSMESSAHDGVSFVSDLDFFQKQVTQRFQDLSSVSSEEILSLSWIRKVLDAFICCQEEFKIILFARKAEMCRLPMDRMVSDY
LERSVKALDVCNVIREGIDQLRQWQKLLEIVLSALDDSSHKKTLGEGQFRRAKKALIDLAIAMLDENDANSPAIAQRNRSFGRSNGTRDNRFLGHFRSLSWSVSRSWSAA
RQLQAIGSNLTAPRANETMATNGLAVPVFTMNMVLLFVTWTLVAAIPCQDRGLNVHFSLPRQFAWAAPMLSLHDRILEESRRRERRNACGLLKEIHQIDKFAHIMNELAD
TAQFPLTNDKEEEVRQRMQELSQICETLKIGLDPLERQIREVFHRINIFIVGFQSCEKFEQWTSEINERLFSVIFIAWESVQNYTLNFPASAN