; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001776 (gene) of Snake gourd v1 genome

Gene IDTan0001776
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1639)
Genome locationLG09:62064886..62067615
RNA-Seq ExpressionTan0001776
SyntenyTan0001776
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK09966.1 uncharacterized protein E5676_scaffold16G00950 [Cucumis melo var. makuwa]2.4e-9492.23Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVVTAEKASSTQQTHRL+K+I SPT NQ+T  TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKV
        TRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK+
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKV

XP_004144072.1 uncharacterized protein LOC101218228 isoform X4 [Cucumis sativus]1.5e-9392.19Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVVTAEKASSTQQTHRL+K+I SPT NQ+T  TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        TRGSMGVD+K SMDHP GNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

XP_008451016.1 PREDICTED: uncharacterized protein LOC103492424 isoform X1 [Cucumis melo]4.1e-9492.71Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVVTAEKASSTQQTHRL+K+I SPT NQ+T  TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        TRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

XP_008451017.1 PREDICTED: uncharacterized protein LOC103492424 isoform X2 [Cucumis melo]4.1e-9492.71Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVVTAEKASSTQQTHRL+K+I SPT NQ+T  TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        TRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

XP_038879793.1 uncharacterized protein LOC120071537 [Benincasa hispida]2.4e-9492.71Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+T+KPDCLGKKKVSS GDRRVVTAEKASST QTHRL+K+IV+PT+NQ+T  TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

TrEMBL top hitse value%identityAlignment
A0A0A0LZ81 Uncharacterized protein7.5e-9492.19Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVVTAEKASSTQQTHRL+K+I SPT NQ+T  TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        TRGSMGVD+K SMDHP GNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

A0A1S3BPZ2 uncharacterized protein LOC103492424 isoform X12.0e-9492.71Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVVTAEKASSTQQTHRL+K+I SPT NQ+T  TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        TRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

A0A1S3BQL5 uncharacterized protein LOC103492424 isoform X22.0e-9492.71Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVVTAEKASSTQQTHRL+K+I SPT NQ+T  TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        TRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

A0A5D3CHU7 Uncharacterized protein1.2e-9492.23Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        ME E I QKSNKSGETDFLLQWGNRKRMRYMKVKEPQR+TSKPDCLGKKKVSS GDRRVVTAEKASSTQQTHRL+K+I SPT NQ+T  TSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKV
        TRGSMGVDEK S+DHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK+
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKV

A0A6J1HZ95 uncharacterized protein LOC1114675659.7e-9493.23Show/hide
Query:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT
        MENEGIAQK+NKSGETDF+LQWGNRKRMRYMKVKEPQRVT+KPDCLGKKKVSSNGDRR+VTAEKASSTQ THRL+KNIVSPT NQKTLATSPEKEDRYYT
Subjt:  MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYT

Query:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        TR SMGVDEKASMD+PTGN  KGFVWPRL+ISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKT+KK
Subjt:  TRGSMGVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55340.1 Protein of unknown function (DUF1639)5.3e-5257.22Show/hide
Query:  QKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTS--KPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYTTRGSM
        Q+ + + E DF+LQWG RKR+R MKVK+ Q + +    DCL K+K+ S    R V++E+ S ++  +R +K   S  + +++   SPEKEDRYYTTRGSM
Subjt:  QKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTS--KPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYTTRGSM

Query:  GVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        G+DE   +      + K  VWP+LYI+LS+KEKEEDF+AMKGCKLPQRPKKRAKL+QK+L+LV PG+WLSDLC+ERYEVREKKTSKK
Subjt:  GVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

AT1G55340.2 Protein of unknown function (DUF1639)5.9e-5156.68Show/hide
Query:  QKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTS--KPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYTTRGSM
        Q+ + + E DF+LQWG RKR+R MKVK+ Q + +    DCL K+K+ S    R V++E+ S ++  +R +K+ ++          SPEKEDRYYTTRGSM
Subjt:  QKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTS--KPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYTTRGSM

Query:  GVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        G+DE   +      + K  VWP+LYI+LS+KEKEEDF+AMKGCKLPQRPKKRAKL+QK+L+LV PG+WLSDLC+ERYEVREKKTSKK
Subjt:  GVDEKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

AT3G03880.1 Protein of unknown function (DUF1639)2.8e-3243.55Show/hide
Query:  NKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYTTRG---SMGV
        + S E +  LQWGN+KR+R ++ K  +   SK           N  R +        + +  R S+  +  +        SPEKE+RYYTTRG   ++G 
Subjt:  NKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYTTRG---SMGV

Query:  D-EKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK
        D    +++    N+++  +WP+L+I+LS+KEKEEDFMAMKGCK   RPKKRAKL+Q+SL+LV PG+WL+DLC +RY+VR KK+SKK
Subjt:  D-EKASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKK

AT4G20300.1 Protein of unknown function (DUF1639)1.4e-2067.65Show/hide
Query:  WPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKV
        WPR+YI+LS KEKEEDF+ MKG KLP RP+KRAK + K+L    PG WLSDL + RYEVREKK  KKV
Subjt:  WPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKV

AT4G20300.2 Protein of unknown function (DUF1639)4.1e-2063.38Show/hide
Query:  WPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKVKEK
        WPR+YI+LS KEKEEDF+ MKG KLP RP+KRAK + K+L    PG WLSDL + RYEVREKK  KK +++
Subjt:  WPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKVKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACGAAGGAATAGCTCAGAAAAGCAATAAATCTGGTGAAACAGATTTTCTGTTGCAGTGGGGGAACAGGAAGAGGATGAGATATATGAAAGTCAAGGAACCACA
AAGAGTTACTAGTAAACCCGATTGTTTAGGGAAGAAGAAAGTTTCTTCTAATGGTGATCGCCGTGTTGTTACAGCTGAGAAAGCTTCATCTACTCAGCAAACCCATCGCC
TTAGCAAGAACATTGTTTCACCAACTAGCAATCAAAAAACATTAGCAACATCACCAGAAAAGGAAGATCGATACTATACGACAAGGGGATCGATGGGCGTGGATGAAAAA
GCCTCAATGGATCACCCCACAGGGAACGACCGAAAAGGGTTTGTCTGGCCAAGGCTCTATATTTCTCTATCTAGCAAGGAGAAGGAAGAAGATTTCATGGCCATGAAAGG
GTGCAAACTTCCTCAAAGACCTAAAAAGAGAGCCAAGTTGCTACAAAAAAGCTTAGTTCTGGTGATGCCAGGCTCGTGGCTATCAGACTTGTGCCAAGAGAGGTATGAAG
TAAGGGAGAAAAAGACTTCAAAGAAGGTAAAAGAAAAAACAAACCCCACTTCTCCAAGTAAACCATTAATGCAAAACATTCACTGA
mRNA sequenceShow/hide mRNA sequence
AAACAAACAAAGAGAGAAAGAAAAGAAGAAGAGAGAGAGAGAGAGCTTCTCTGCAACTGAACTGAAGAAGAGCGGCCGATTTTTCTTCTCTGTTCCCTTTTGAATGGTAA
AAACTAACTGTGACCCATAAAGCTTCAATGGCTCTTCTTCCATTGTAGCCGCTCTTCTCTTCTTCTTCTTTTGAAGATTGCTTTCCCATTTCCATTTTTCTCTCTGCGGG
TTCAACGGTTCTTTTTACTCTACTGGGTCTTCCATTGCAACTCTGTTGGTGATGGTGTTGACGATGATGATGATTACAGAGATGGAGAAGCTTTTTTGATGGCTTTTCGA
CATGGGTTATTAGAGGGAAATCGTGTTTGATAATCCACTGAGTTTTGGCCTTGGATCTTGAGCGATATGGAGAACGAAGGAATAGCTCAGAAAAGCAATAAATCTGGTGA
AACAGATTTTCTGTTGCAGTGGGGGAACAGGAAGAGGATGAGATATATGAAAGTCAAGGAACCACAAAGAGTTACTAGTAAACCCGATTGTTTAGGGAAGAAGAAAGTTT
CTTCTAATGGTGATCGCCGTGTTGTTACAGCTGAGAAAGCTTCATCTACTCAGCAAACCCATCGCCTTAGCAAGAACATTGTTTCACCAACTAGCAATCAAAAAACATTA
GCAACATCACCAGAAAAGGAAGATCGATACTATACGACAAGGGGATCGATGGGCGTGGATGAAAAAGCCTCAATGGATCACCCCACAGGGAACGACCGAAAAGGGTTTGT
CTGGCCAAGGCTCTATATTTCTCTATCTAGCAAGGAGAAGGAAGAAGATTTCATGGCCATGAAAGGGTGCAAACTTCCTCAAAGACCTAAAAAGAGAGCCAAGTTGCTAC
AAAAAAGCTTAGTTCTGGTGATGCCAGGCTCGTGGCTATCAGACTTGTGCCAAGAGAGGTATGAAGTAAGGGAGAAAAAGACTTCAAAGAAGGTAAAAGAAAAAACAAAC
CCCACTTCTCCAAGTAAACCATTAATGCAAAACATTCACTGATGAAATGTTATACAAAAGAGAGGTATGAAGTAAGGGAGAAAAAGACTTCAAAGAAGGTAAAAGAAAAA
ACAAACCCCACTTCTCCAAGTAAACCATTAATGCAAAACATTCACTGATGAAATGTTATACAAAAGAGAGGTATGAAGTAAGGGAGAAAAAGACTTCAAAGAAGAGACCA
AGAGGATTGAAGGCCATGAGAAGTATGGATAGTGAATCAGAATGATGACTGAATCAATAAGAGACACAAATAGAGACATATGGATAGAATAACTTCTTGTTTGGTAGCCA
TATTTTGGAGAAAGAAAGAAAGAAAGAAAAAGAGAGAGGGACAGTAGACTTTTTTGGTGGGTTTTTTTTTTGTTTTTTTTTTTGGGTTGAAATTGATTGGGCATTTTTCA
GAATTTGAGATGGTTTGGTTGTAATGTAAGTCACCCAATTTCTTCTAGTGATATTTGGCTAATTGAGTAAAATTTAATTTGGTAGAAATGATTATACTTTTTCCTCTTCA
A
Protein sequenceShow/hide protein sequence
MENEGIAQKSNKSGETDFLLQWGNRKRMRYMKVKEPQRVTSKPDCLGKKKVSSNGDRRVVTAEKASSTQQTHRLSKNIVSPTSNQKTLATSPEKEDRYYTTRGSMGVDEK
ASMDHPTGNDRKGFVWPRLYISLSSKEKEEDFMAMKGCKLPQRPKKRAKLLQKSLVLVMPGSWLSDLCQERYEVREKKTSKKVKEKTNPTSPSKPLMQNIH