; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0213 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0213
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationMC04:1674661..1678109
RNA-Seq ExpressionMC04g0213
SyntenyMC04g0213
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572284.1 hypothetical protein SDJN03_29012, partial [Cucurbita argyrosperma subsp. sororia]1.09e-11589.12Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_022136333.1 uncharacterized protein LOC111008045 [Momordica charantia]3.57e-130100Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_022952351.1 uncharacterized protein LOC111455059 isoform X1 [Cucurbita moschata]7.64e-11689.12Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_022969266.1 uncharacterized protein LOC111468321 [Cucurbita maxima]3.11e-11588.6Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+ PPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

XP_023554621.1 uncharacterized protein LOC111811814 isoform X1 [Cucurbita pepo subsp. pepo]1.80e-11488.08Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RAR+RGWKEG  +SR QKQ NI TA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

TrEMBL top hitse value%identityAlignment
A0A1S3C135 uncharacterized protein LOC103495614 isoform X12.05e-11387.56Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQ TSC P+LIK GLA IA+ I GYI+GPPLYWHF EGLAVVS SS+SSSCPPCFCDCPSQPVISIPEELRN+TF DCVK DPEVS+DTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKEAEALENQRRAD+ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVL AQKRLTA WE RARQRGWKEG  +SR QKQGNIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1C7C2 uncharacterized protein LOC1110080451.73e-130100Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1ERF9 uncharacterized protein LOC111437126 isoform X12.50e-11489.64Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA SC P+LIK GLA IAL IAGYI+GPPLYWH  EG AVVSRSSSSSSCPPCFCDCPSQPVISIPEELRN+TF DCVK DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKEAEA E+ RRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA WE RARQRGWKEGA +SRIQKQGNIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1GK01 uncharacterized protein LOC111455059 isoform X13.70e-11689.12Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+GPPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

A0A6J1HXB7 uncharacterized protein LOC1114683211.51e-11588.6Show/hide
Query:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL
        MAIQA+SC P+LIKFGLA IAL IAGYI+ PPLYWHF EGLAVV+RSSSSSSCPPCFCDCPSQP+I+IP+ELRNTTFGDC+K DPEVSQDTEKNFADLLL
Subjt:  MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLL

Query:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        EELKLKE EALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTA+WE RARQRGWKEG  +SR QKQ NIQTA
Subjt:  EELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)4.4e-6162.3Show/hide
Query:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA
        A +K GLA + L +AGYI+GPPLYWH  E LA V    S+SSCP C C+C +   ++IP+EL N +F DC K DPEV++DTEKN+A+LL EELKL+EAE+
Subjt:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA

Query:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        LE  +RADM LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  LA QK+LT+ WE RARQ+GW+EG+ +  ++ + N+Q A
Subjt:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

AT2G32580.1 Protein of unknown function (DUF1068)4.3e-5659.02Show/hide
Query:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA
        A +K GLA +AL + GYI+GPPLYWH  E LAV     S++SC  C CDC S P+++IP  L N +F DC KRDPEV++DTEKN+A+LL EELK +EA +
Subjt:  ALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA

Query:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA
        +E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT++WE RARQ+G+K+GA +S ++ +   + A
Subjt:  LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA

AT2G32580.2 Protein of unknown function (DUF1068)5.4e-3562.61Show/hide
Query:  DCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEG
        +C KRDPEV++DTEKN+A+LL EELK +EA ++E  +R D  LLEAKK+TS YQKEADKCNSGMETCEEAREKAE  L  QK+LT++WE RARQ+G+K+G
Subjt:  DCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEG

Query:  AGRSRIQKQGNIQTA
        A +S ++ +   + A
Subjt:  AGRSRIQKQGNIQTA

AT4G04360.1 Protein of unknown function (DUF1068)1.2e-4557.58Show/hide
Query:  IALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADM
        + LCI  YI GP LYWH  E +A     S  SSCPPC CDC SQP++SIP+ L N +F DC++ + E S+++E +F +++ EELKL+EA+A E++ RAD 
Subjt:  IALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQRRADM

Query:  ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRI
         LL+AKK  SQYQKEADKC+ GMETCE AREKAEA L  Q+RL+ +WELRARQ GWKEG   S +
Subjt:  ALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRI

AT4G30996.1 Protein of unknown function (DUF1068)1.9e-3548.45Show/hide
Query:  LATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQ-PVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQR
        L   A+  A  + GP LYW F +G   V  + ++S CPPC CDCP    ++ I   L N +  DC   DPE+ Q+ EK F DLL EELKL+EA A E+ R
Subjt:  LATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQ-PVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEALENQR

Query:  RADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWK
          ++ L EAK++ SQYQKEA+KCN+  E CE ARE+AEA+L  ++++T++WE RARQ GW+
Subjt:  RADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCAGGCCACTTCTTGCACTCCGGCACTGATTAAGTTCGGATTGGCTACCATTGCCCTCTGTATCGCCGGCTACATTATCGGCCCTCCTCTCTATTGGCACTT
CGTCGAAGGTTTGGCCGTCGTTAGCCGCTCCTCCTCTTCTTCCTCCTGCCCTCCTTGTTTCTGCGACTGCCCTTCTCAGCCCGTCATCTCAATTCCCGAAGAATTGAGGA
ACACTACCTTTGGAGATTGTGTTAAGCGTGACCCAGAAGTGAGTCAAGACACCGAGAAGAACTTTGCAGACCTATTGTTAGAGGAGCTGAAGCTAAAGGAAGCCGAAGCG
TTGGAAAATCAGCGACGTGCTGATATGGCTCTACTCGAGGCGAAGAAGATGACATCTCAGTATCAAAAAGAGGCAGACAAGTGCAATTCCGGGATGGAAACATGTGAAGA
AGCGAGAGAGAAAGCCGAGGCAGTATTAGCTGCACAGAAGAGACTAACAGCAGTGTGGGAGCTAAGGGCTCGCCAAAGGGGATGGAAGGAAGGGGCTGGCAGGTCTCGTA
TCCAAAAGCAGGGAAATATTCAGACTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATATGAGTCATGTGACGCGGAATTCCAGCAAGCTAAAGCCATAAATTTTCATCTTATAATAAAATTTGAAAACAAAGAAAGAGGTGAGAGAAAGTGATAGAGAGATGATT
TGTTCCCAAAAATTTCAGAAGAGCGAATTAGAATAGATTGCAGTACCCAATAAAACTCCGATTCTCCTCTTTGTTCGTAAGAGATTCAGTTGCAATCCCCCACTTATTTA
TTCCAACTATTCACTCTCTGCTTACCAATTTCTCCGATTCATCGCCGGAGCCGATGGCAATTCAGGCCACTTCTTGCACTCCGGCACTGATTAAGTTCGGATTGGCTACC
ATTGCCCTCTGTATCGCCGGCTACATTATCGGCCCTCCTCTCTATTGGCACTTCGTCGAAGGTTTGGCCGTCGTTAGCCGCTCCTCCTCTTCTTCCTCCTGCCCTCCTTG
TTTCTGCGACTGCCCTTCTCAGCCCGTCATCTCAATTCCCGAAGAATTGAGGAACACTACCTTTGGAGATTGTGTTAAGCGTGACCCAGAAGTGAGTCAAGACACCGAGA
AGAACTTTGCAGACCTATTGTTAGAGGAGCTGAAGCTAAAGGAAGCCGAAGCGTTGGAAAATCAGCGACGTGCTGATATGGCTCTACTCGAGGCGAAGAAGATGACATCT
CAGTATCAAAAAGAGGCAGACAAGTGCAATTCCGGGATGGAAACATGTGAAGAAGCGAGAGAGAAAGCCGAGGCAGTATTAGCTGCACAGAAGAGACTAACAGCAGTGTG
GGAGCTAAGGGCTCGCCAAAGGGGATGGAAGGAAGGGGCTGGCAGGTCTCGTATCCAAAAGCAGGGAAATATTCAGACTGCATAAAAGGGCATTCCAAACAGTAACACTT
CCTCCCATTCTGAATTTTTGTTTTCCTATATATATATATTCTTCCGAGCAGTAACTTAAATCTTACTCTACGTCGCTACAAATAGATCGAAGGGATTATTACAGGTATCT
TTAGCAAATCGAAATGTCAATTACCATTTTAGTTAATCACAAAGAAGAAAACGTTCATTAGTTGAAAGCAACTAATGTATGCAAGAAGTTGTGTTACTCTACTCTCATTT
GTCCATCTCCATTGTGTATTTTGAAGTTCCATTCTCTGACTGTGAAAGCAAAATTCTTCTGGGTTTAAACAAGTTTGATAGTATACAAAATTCAAATACTCTCTTGCTTT
GCCAAAATCATGGTATACATTATACTATCAGAGAAAGAAAAAGGTAGCAAAAGAGGCAGCAAGTTCCCAAATATTTGTATAAGTTTTAGATAATTTTATTAGTGCATGTT
GAGCCGAAACCTTCTGATATGATCACAATCACAACAGCATTTAATTTACATCAAAAGGGCAGTTTCAGTTAGCCCTTTGTAGGCCTGAAATTGAAGCTCCCTCCCCTGTA
ATGGAATCATACCTTCGGAATATGTAGGCGACGAACACCACCGCCGTGAGCACCACCAAGTCCTTGGCCGCACACTGCTTATTGTGCTCGGTCGGCGTCCCGGTCTGCGG
GCCGTTCGACCAGAACAGATAATTCAGCAGCTCGGCCCCTTTCTCCGCCCTGAACCGGTCCGGCTTAAACACCTCCGGCTCCTCGAATACGTTCGGATCCCTCATCGCCA
GCGGCTGATACCCACACAGCAGCTCCCCCTTCTTCACCTTGAACACCGAGTCGTGTGAACTCAGCTCGAAATCCTTTCTGGCTCTCGCGAATTGGGAGGGAACCGGCGGG
TCCAGTCGGAGCGCCTCGTACACCACCGAGTAGACCAGCTCCAACTCCTTCACCGACTCGAATGTCAGACCCGACCCGGTTTTCTCCCTGACTTCCTTCACGATCCTCTC
CTGTAACCCGGTCTTGTCGCTCGCTATCCGACCGAGTAGAATCGGCAGGAACAGACTGAACCCACCGTACGCGTTGAATCCCAATGTGAAGATGAGATTGTGGATCGCTT
CGTCCTCCTTCAACCCGAACTCGGTCACGCCTCGGCGGATTGCTTCTTCCCCTGTAGAATACAAATCCCAGAAAATGGCGAATCAAAAAAAAAAAAAATGAGAGGAGATT
GAATCGGTGATTTTGAGTCGAGAGGAGAGGGTACCTTCTTTTTGGATGAAGTTGTAGAGCTTCCGGTAGCGGCCGGCGATGAGGAAGAATGGGTAGGAGAAGGAGTGGAG
GAAGATTTCTTCGAGGGGCTGGAGAATGCCGATGTTAATGGTGGGAAGGAGCTGGAGGCCGAGCCAGAGGTTGACGTCGAAATAGCCGGAATTGGCGATGTCGGGGGATT
TTGCAGTGTCGGCGCCGGCTAGGGTTTTGGCGAAGAAACTGAATAGGGCTTGCTGGAGATTGAATAGATAGTCGGATTTTCCGCTCTTGATAACGTCGGATTCAATGGCG
TCAAACGCCGTGGAGAACTTGGATTCCAGCTCCGGGATCCATATCTTGGAGCTCCGCCGGAGAACGTCTAGAACGAAGTTTTTTACCTGGCAGTAAATTTCTGGTGATGA
GAAATTAATTCAAAAAATAATAATTAGAAAGTTATTATTTTAGTATCTGGA
Protein sequenceShow/hide protein sequence
MAIQATSCTPALIKFGLATIALCIAGYIIGPPLYWHFVEGLAVVSRSSSSSSCPPCFCDCPSQPVISIPEELRNTTFGDCVKRDPEVSQDTEKNFADLLLEELKLKEAEA
LENQRRADMALLEAKKMTSQYQKEADKCNSGMETCEEAREKAEAVLAAQKRLTAVWELRARQRGWKEGAGRSRIQKQGNIQTA