; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G192700 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G192700
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane, vacuole; EXPRESSED IN: cultured cell;
Genome locationCla97Chr10:20533766..20538522
RNA-Seq ExpressionCla97C10G192700
SyntenyCla97C10G192700
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147506.1 uncharacterized protein LOC101214901 [Cucumis sativus]5.5e-8683.11Show/hide
Query:  MERDGEGAPAAA------SSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKN
        MER+GEGAPA A      SSSSSQ TRP RSVDPLLVTCRFFSVITAL AILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFF VLAETEWEFIFKN
Subjt:  MERDGEGAPAAA------SSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKN

Query:  WKAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKD
        WK                         VLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKD
Subjt:  WKAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKD

Query:  RVVKDLQELERQKQELEQLLISETV
        +VVKDLQELERQKQELEQLLISETV
Subjt:  RVVKDLQELERQKQELEQLLISETV

XP_008454462.1 PREDICTED: uncharacterized protein LOC103494861 [Cucumis melo]1.9e-8683.86Show/hide
Query:  MERDGEGAPAA----ASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWK
        MER+GEGAPAA    +SSSSSQ TRP RSVDPLLVTCRFFSVITAL AILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVI FFVVLAETEWEFIFKNWK
Subjt:  MERDGEGAPAA----ASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWK

Query:  AITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRV
                                 VLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKD+V
Subjt:  AITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRV

Query:  VKDLQELERQKQELEQLLISETV
        VKDLQELERQKQELEQLLISETV
Subjt:  VKDLQELERQKQELEQLLISETV

XP_022922529.1 uncharacterized protein LOC111430498 [Cucurbita moschata]2.6e-8077.68Show/hide
Query:  MERDGEGA-----PAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNW
        MER GEGA      AA ++SSSQT+RP R VDPLLVTCRFFSV+TAL AILCIVSNVI+AIRSFKN+SDIFDGIFRCYAVVIAFFVVLAETEWEFI KNW
Subjt:  MERDGEGA-----PAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNW

Query:  KAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDR
        K                         VLEYWAGRGMLQIFVAVMTRAFP YSVEQRE ILLQ+AASYLLLACGAVYVVSGILCIGFLKRARE+KET+KDR
Subjt:  KAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDR

Query:  VVKDLQELERQKQELEQLLISETV
        VVKDLQELERQKQELEQLLIS++V
Subjt:  VVKDLQELERQKQELEQLLISETV

XP_023552582.1 uncharacterized protein LOC111810200 isoform X2 [Cucurbita pepo subsp. pepo]3.5e-8077.23Show/hide
Query:  MERDGEGA-----PAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNW
        MER GEGA      AA ++SSSQT+RP R VDPLLVTCRFFSV+TAL AILCIVSNV++AIRSFKN+SDIFDGIFRCYAVVIAFFVVLAETEWEFI KNW
Subjt:  MERDGEGA-----PAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNW

Query:  KAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDR
        K                         VLEYWAGRGMLQIFVAVMTRAFP YSVEQRE ILLQ+AASYLLLACGAVYVVSGILCIGFLKRARE+KET+KDR
Subjt:  KAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDR

Query:  VVKDLQELERQKQELEQLLISETV
        VVKDLQELERQKQELEQLLIS++V
Subjt:  VVKDLQELERQKQELEQLLISETV

XP_038905258.1 uncharacterized protein LOC120091338 isoform X1 [Benincasa hispida]5.5e-8683.56Show/hide
Query:  MERDGEGAPAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWKAITD
        MERDGEGAPAAA+SSSSQT RP R VDPLLVTCRFFSV+TAL AILCIVSNVISAIRSFKN+SD+FDGIFRCYAVVIA FVVLAETEWEFI KNWK    
Subjt:  MERDGEGAPAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWKAITD

Query:  MSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRVVKDL
                             VLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRARE+KETAKDRVVKDL
Subjt:  MSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRVVKDL

Query:  QELERQKQELEQLLISETV
        QELERQKQELEQLLISETV
Subjt:  QELERQKQELEQLLISETV

TrEMBL top hitse value%identityAlignment
A0A0A0KYH6 Uncharacterized protein2.7e-8683.11Show/hide
Query:  MERDGEGAPAAA------SSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKN
        MER+GEGAPA A      SSSSSQ TRP RSVDPLLVTCRFFSVITAL AILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFF VLAETEWEFIFKN
Subjt:  MERDGEGAPAAA------SSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKN

Query:  WKAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKD
        WK                         VLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKD
Subjt:  WKAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKD

Query:  RVVKDLQELERQKQELEQLLISETV
        +VVKDLQELERQKQELEQLLISETV
Subjt:  RVVKDLQELERQKQELEQLLISETV

A0A1S3BY74 uncharacterized protein LOC1034948619.2e-8783.86Show/hide
Query:  MERDGEGAPAA----ASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWK
        MER+GEGAPAA    +SSSSSQ TRP RSVDPLLVTCRFFSVITAL AILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVI FFVVLAETEWEFIFKNWK
Subjt:  MERDGEGAPAA----ASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWK

Query:  AITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRV
                                 VLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKD+V
Subjt:  AITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRV

Query:  VKDLQELERQKQELEQLLISETV
        VKDLQELERQKQELEQLLISETV
Subjt:  VKDLQELERQKQELEQLLISETV

A0A6J1D697 uncharacterized protein LOC1110177041.0e-6969Show/hide
Query:  MERDGE---------GAPAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFI
        M +DGE          A +++SSSSS TTR  R VDPLLVTCRFFSV+TAL AILCIV NVISA+RSFK+++DIFDGIFRCYAV+IA FVVLAETEWEFI
Subjt:  MERDGE---------GAPAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFI

Query:  FKNWKAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKET
         K WK                         VLEYWAGRGMLQIFVAVMTRAFP YS +QRELI+LQD ASYLLL CGAVYV SGILC+GFLKRARE+KET
Subjt:  FKNWKAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKET

Query:  AKDRVVKDLQELERQKQELE-QLLISETV
        AK+R VKDLQELERQKQELE +LLI+E+V
Subjt:  AKDRVVKDLQELERQKQELE-QLLISETV

A0A6J1E3M8 uncharacterized protein LOC1114304981.3e-8077.68Show/hide
Query:  MERDGEGA-----PAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNW
        MER GEGA      AA ++SSSQT+RP R VDPLLVTCRFFSV+TAL AILCIVSNVI+AIRSFKN+SDIFDGIFRCYAVVIAFFVVLAETEWEFI KNW
Subjt:  MERDGEGA-----PAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNW

Query:  KAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDR
        K                         VLEYWAGRGMLQIFVAVMTRAFP YSVEQRE ILLQ+AASYLLLACGAVYVVSGILCIGFLKRARE+KET+KDR
Subjt:  KAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDR

Query:  VVKDLQELERQKQELEQLLISETV
        VVKDLQELERQKQELEQLLIS++V
Subjt:  VVKDLQELERQKQELEQLLISETV

A0A6J1J849 uncharacterized protein LOC1114826122.2e-8077.68Show/hide
Query:  MERDGEGA-----PAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNW
        MER GEGA      AA ++SSSQT RP R VDPLLVTCRFFSV+TAL AILCIVSNVI+AIRSFKN+SDIFDGIFRCYAVVIAFFVVLAETEWEFI KNW
Subjt:  MERDGEGA-----PAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNW

Query:  KAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDR
        K                         VLEYWAGRGMLQIFVAVMTRAFP YSVEQRE ILLQ+AASYLLLACGAVYVVSGILCIGFLKRARE+KET+KDR
Subjt:  KAITDMSEARNLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDR

Query:  VVKDLQELERQKQELEQLLISETV
        VVKDLQELERQKQELEQLLIS++V
Subjt:  VVKDLQELERQKQELEQLLISETV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G33625.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane, vacuole; EXPRESSED IN: cultured cell; CONTAINS InterPro DOMAIN/s: Golgi apparatus membrane protein TVP15 (InterPro:IPR013714); Has 59 Blast hits to 59 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink).4.4e-5757.14Show/hide
Query:  EGAPAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWKAITDMSEAR
        E +PA  SS S++    +R+ DP LV CR FS++T+L AILC+V NV++A+RSF++  D+FDGIFRCYAVVIA FVVL ETEW FI K  K         
Subjt:  EGAPAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWKAITDMSEAR

Query:  NLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRVVKDLQELER
                        VLEYWAGRGMLQIFVAVMTRAFP Y  ++++L+LLQ+ ASYLLLACG +YV+SG+LCIGFLKRAR++KE ++++ VKDL+E+ R
Subjt:  NLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRVVKDLQELER

Query:  QKQELEQLLI
        +K+ELEQLL+
Subjt:  QKQELEQLLI

AT4G33625.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: cultured cell; CONTAINS InterPro DOMAIN/s: Golgi apparatus membrane protein TVP15 (InterPro:IPR013714); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.4e-5557.14Show/hide
Query:  EGAPAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWKAITDMSEAR
        E +PA  SS S++    +R+ DP LV CR FS++T+L AILC+V NV++A+RSF++  D+FDGIFRCYAVVIA FVVL ETEW FI K  K         
Subjt:  EGAPAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWKAITDMSEAR

Query:  NLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRVVKDLQELER
                        VLEYWAGRGMLQIFVAVMTRAFP Y  ++++L+LLQ+ ASYLLLACG +YV+SG+LCIGFLKRAR++KE ++++ VKDL E+ R
Subjt:  NLPRSSIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRVVKDLQELER

Query:  QKQELEQLLI
        +K+ELEQLL+
Subjt:  QKQELEQLLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGAGACGGAGAGGGAGCACCGGCCGCGGCGTCATCGTCTTCTTCTCAGACCACCAGACCTAGTAGGAGCGTGGACCCTTTGCTTGTAACTTGCAGGTTTTTCAG
TGTTATAACAGCTCTTGCTGCTATTCTCTGCATTGTTTCCAATGTTATCTCTGCGATTCGGTCCTTTAAGAACCAATCCGATATATTCGATGGTATATTTCGGTGTTATG
CAGTTGTGATCGCATTCTTCGTGGTTCTTGCCGAGACGGAATGGGAATTTATTTTCAAGAATTGGAAGGCAATAACTGATATGAGTGAAGCTCGAAATCTACCTCGTTCG
TCCATACAGTTCTCTGGCTCCGTGTCAAAACTGGTATTGGAATATTGGGCTGGCCGGGGCATGTTGCAAATCTTTGTTGCAGTCATGACAAGAGCTTTCCCGGTGTATTC
TGTAGAGCAGAGAGAGCTCATTCTTCTTCAAGATGCTGCAAGTTATCTCCTCCTTGCCTGTGGTGCAGTCTATGTCGTATCGGGAATACTGTGCATTGGGTTTCTCAAAC
GAGCTCGTGAAAAGAAAGAGACTGCGAAGGACAGGGTGGTCAAAGATCTCCAGGAGTTAGAAAGACAAAAGCAAGAACTTGAACAGTTGCTCATTTCAGAAACTGTTTGA
mRNA sequenceShow/hide mRNA sequence
CTTTGGCGAACTGATTAATGCTATGAATTTTTCTTGTGCAGTTGAACACTTGTGAACCCTCGAATGGAAAACGTTCATCGATTCAATCATCATACCCAATCAGTAAGCTG
AATCTCGAGAGACAATTTTCATCCGGGTAAGGTATAGATTCAGGGAAATTCATTGGAGCTGAGGAAATCTGAGGAAATGGAGAGAGACGGAGAGGGAGCACCGGCCGCGG
CGTCATCGTCTTCTTCTCAGACCACCAGACCTAGTAGGAGCGTGGACCCTTTGCTTGTAACTTGCAGGTTTTTCAGTGTTATAACAGCTCTTGCTGCTATTCTCTGCATT
GTTTCCAATGTTATCTCTGCGATTCGGTCCTTTAAGAACCAATCCGATATATTCGATGGTATATTTCGGTGTTATGCAGTTGTGATCGCATTCTTCGTGGTTCTTGCCGA
GACGGAATGGGAATTTATTTTCAAGAATTGGAAGGCAATAACTGATATGAGTGAAGCTCGAAATCTACCTCGTTCGTCCATACAGTTCTCTGGCTCCGTGTCAAAACTGG
TATTGGAATATTGGGCTGGCCGGGGCATGTTGCAAATCTTTGTTGCAGTCATGACAAGAGCTTTCCCGGTGTATTCTGTAGAGCAGAGAGAGCTCATTCTTCTTCAAGAT
GCTGCAAGTTATCTCCTCCTTGCCTGTGGTGCAGTCTATGTCGTATCGGGAATACTGTGCATTGGGTTTCTCAAACGAGCTCGTGAAAAGAAAGAGACTGCGAAGGACAG
GGTGGTCAAAGATCTCCAGGAGTTAGAAAGACAAAAGCAAGAACTTGAACAGTTGCTCATTTCAGAAACTGTTTGAACAATTTAAAGACTTCCCCATGCACCAGAATATA
ATTGCACCTGATTCTTGCTTCGCTGCTGGGATTGTATGAGCATTTGTGCTGCTTCCTTTTTTACTCTACTATAATTGTCCCCTTGATAGATTCAGCTCATGTAAATTATG
TCTAACTTGACAATTTGTCCATGCAGTGAGCGTGTAAATTCTGATTGTCTTGACAATTTGTCTCACTGTGTAAACAAAAGAAATTGTTTATTGAGACAAAAAATTGGTTC
TTTGCTCTCTAAATTGATGCTCTGCAAACTTCAGTGATCAAATGATACTAACTTATAGGACCAAAGTCCCTGCGACATCTCGATGAAAGGTCTCAATCACAGCCGGATCA
CATCTCATGTAAATATTTATGATGAACATGCTTCAAGATGCTACAAAAGCCATCAGAATATTGATGTTCAGTCTCTCTTAGGCACGCACACAAAGACACAATCAATGAAC
TCTGGCAATCATGGAAGAGGATAATAGAAGTTGAGTTCAACACATTGGGAGATTATGGATTCTGCTTGTATCTCTTTAGTGTGTGGGTAGCGTATGGGGTAAACCCTTAG
TACTCCCCATAGTACCTACTTTTTATGTTGAAATTTTAATATGGATGATCTTTTCATGGATGTCTTAATGTTTAATGATCCACCCCACAAACTAATGAAATAAAATCTTA
TGCTTACATCATAG
Protein sequenceShow/hide protein sequence
MERDGEGAPAAASSSSSQTTRPSRSVDPLLVTCRFFSVITALAAILCIVSNVISAIRSFKNQSDIFDGIFRCYAVVIAFFVVLAETEWEFIFKNWKAITDMSEARNLPRS
SIQFSGSVSKLVLEYWAGRGMLQIFVAVMTRAFPVYSVEQRELILLQDAASYLLLACGAVYVVSGILCIGFLKRAREKKETAKDRVVKDLQELERQKQELEQLLISETV