; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G014200 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G014200
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUnknown protein
Genome locationchr01:12344778..12351591
RNA-Seq ExpressionLsi01G014200
SyntenyLsi01G014200
Gene Ontology termsNA
InterPro domainsIPR040344 - Uncharacterized protein At3g17950-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032638.1 uncharacterized protein E6C27_scaffold184G00230 [Cucumis melo var. makuwa]2.1e-9883.26Show/hide
Query:  GATPQMDVTDLLTWPPPRTRIDACPFLMKNHRDTRFAFDSPSSLILCQSTPPPLMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSI
        GATPQ DVT+LLTWPPPRTRIDACPFLMKNHRDTRFAF SPSSLIL Q+T   L FLSISNQPPQSQPVMAQQDDGWPLGLR+LNARVGLLENRD PGSI
Subjt:  GATPQMDVTDLLTWPPPRTRIDACPFLMKNHRDTRFAFDSPSSLILCQSTPPPLMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSI

Query:  SFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR----GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHS
        SFNTLPTGSPISFTDSS LDSESSGSFFH+KS TLGSL+GGSTS+IMELSRR    G+TEASLG DRKINN   FK K WLFSLCCKLSTDAV ATRTHS
Subjt:  SFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR----GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHS

Query:  LAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS
        LAHFLE+ER+R A  AAA P PI GR+N+ LTS
Subjt:  LAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS

KAG6595318.1 hypothetical protein SDJN03_11871, partial [Cucurbita argyrosperma subsp. sororia]1.9e-10758.47Show/hide
Query:  FSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKIPKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNL
        FSRGKD QGRNS Q EAN  GLYR +EAK+GETA +KRKLEDYLDPVLLSAVSSKISR +K+PKM VK++VRDFEWPV ELRML +D+ VGKGKI+ VNL
Subjt:  FSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKIPKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNL

Query:  GNDSDNLIENDEDGDVKFCTPFQKFEQTAFVLHTKGLYRCGAEFFIPISKSPGHLLHIIFYSKSGNTATHQAYFVAIKKFGCDKMTPARHLCLNVCLQGA
        GNDSDNL ENDE+GDVKF TPFQKFEQTA +                +++ P H +      K+GN                                  
Subjt:  GNDSDNLIENDEDGDVKFCTPFQKFEQTAFVLHTKGLYRCGAEFFIPISKSPGHLLHIIFYSKSGNTATHQAYFVAIKKFGCDKMTPARHLCLNVCLQGA

Query:  TPQMDVTDLLTWPPPRTRIDACPFLMKNHRDTRFAFDSPSSLILCQST-PPPLMFLSISNQPPQSQP-------VMAQQDDGWPLGLRLLNARVGLLENR
                             CP L+             SSL     T PP L   SISNQPP+SQP       +MAQQDDGWPLGLRLLNARVGLLENR
Subjt:  TPQMDVTDLLTWPPPRTRIDACPFLMKNHRDTRFAFDSPSSLILCQST-PPPLMFLSISNQPPQSQP-------VMAQQDDGWPLGLRLLNARVGLLENR

Query:  DLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR----GTTEASLGGDRKINNLGFKYKPWLFSLCCKLSTDAVSA
        D  GSISFNTLPTGSPISFTDSSDLDS+SSGSF HAKSI+ GSLI G  S I+ELSRR    G+TE SLGG RK +   FK KPWLFSLCCKLSTDAVS 
Subjt:  DLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR----GTTEASLGGDRKINNLGFKYKPWLFSLCCKLSTDAVSA

Query:  TRTHSLAHFLEVERRRAAA
        TRTHSLAHFLE ERRR AA
Subjt:  TRTHSLAHFLEVERRRAAA

KAG7027328.1 hypothetical protein SDJN02_11340, partial [Cucurbita argyrosperma subsp. argyrosperma]8.8e-12660.31Show/hide
Query:  MEARGGGDGGFSPRATVEDIQRRLLRPLSSLHSPLPTPFSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKI
        M+  GGGDGGFSPRATVEDIQRRLLRP SS+HSP PTPFSRGKD QGRNS Q EAN  GLYR +EAK+GETA +KRKLEDYLDPVLLSAVSSKISR +K+
Subjt:  MEARGGGDGGFSPRATVEDIQRRLLRPLSSLHSPLPTPFSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKI

Query:  PKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNLGNDSDNLIENDEDGDVKFCTPFQKFEQTAFVLHTKGLYRCGAEFFIPISKSPGHLLHIIFYS
        PKM VK++VRDFEWPV ELRML +D+ VGKGKI+ VNLGNDSDNL ENDE+GDVKF TPFQKFEQTA                                 
Subjt:  PKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNLGNDSDNLIENDEDGDVKFCTPFQKFEQTAFVLHTKGLYRCGAEFFIPISKSPGHLLHIIFYS

Query:  KSGNTATHQAYFVAIKKFGCDKMTPARHLCLNVCLQGATPQMDVTDLLTWPPPRTRIDACPFLMKNHRDTRFAFDSPSSLILCQSTPPPLMFLSISNQPP
                             K +  RH  L + +                       A PF     R+T        +  L    PP L   SISNQPP
Subjt:  KSGNTATHQAYFVAIKKFGCDKMTPARHLCLNVCLQGATPQMDVTDLLTWPPPRTRIDACPFLMKNHRDTRFAFDSPSSLILCQSTPPPLMFLSISNQPP

Query:  QSQP-------VMAQQDDGWPLGLRLLNARVGLLENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR----G
        +SQP       +MAQQDDGWPLGLRLLNARVGLLENRD  GSISFNTLPTGSPISFTDSSDLDS+SSGSF HAKSI+ GSLI G  S I+ELSRR    G
Subjt:  QSQP-------VMAQQDDGWPLGLRLLNARVGLLENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR----G

Query:  TTEASLGGDRKINNLGFKYKPWLFSLCCKLSTDAVSATRTHSLAHFLEVERRRAAA
        +TE SLGG RK +   FK KPWLFSLCCKLSTDAVS TRTHSLAHFLE ERRR AA
Subjt:  TTEASLGGDRKINNLGFKYKPWLFSLCCKLSTDAVSATRTHSLAHFLEVERRRAAA

XP_004142071.2 uncharacterized protein LOC101214483 [Cucumis sativus]9.6e-7279.47Show/hide
Query:  LILCQSTPPPLMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGST
        ++   +T   L FLSISNQPPQSQPVMAQQDDGWPLGLR+LNARVGLLENRD PGSISFNTLPTGSPISFTDSS LDSESSGSFFH+KSITLGSLIGGST
Subjt:  LILCQSTPPPLMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGST

Query:  SSIMELSRR----GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHSLAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS
        S+IMEL+RR    G+TEASLG DRKINN    K KPWLFSLCCKLSTDAV ATRTHSLAHFLE+ER+R A  AAA P PI GR++N+LTS
Subjt:  SSIMELSRR----GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHSLAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS

XP_038877635.1 uncharacterized protein LOC120069886 [Benincasa hispida]7.6e-7787.98Show/hide
Query:  PPPLMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELS
        PPPL FLSISNQPP+SQ VMAQQDDGWPLGLRLLNARVGLLENRD PGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGST  IMELS
Subjt:  PPPLMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELS

Query:  RR----GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHSLAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS
        RR    G+TEASLG DRKINN   FK KPWLFSLCCKLSTDAV ATRT SLAHFLEVERRR AAAAA+ PPPIA RTNNMLTS
Subjt:  RR----GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHSLAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS

TrEMBL top hitse value%identityAlignment
A0A0A0L270 Uncharacterized protein4.5e-6782.04Show/hide
Query:  MEARGGGDGGFSPRATVEDIQRRLLRPLSSLHSPLPTPFSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKI
        MEA GGG+GGFS RATVEDI+RRLLRP SSLHSP PTPFS GK+AQ RNS Q EANY  L   REAKKGET+ K+RKLEDYLDPVLLSA+SSKISR+EKI
Subjt:  MEARGGGDGGFSPRATVEDIQRRLLRPLSSLHSPLPTPFSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKI

Query:  PKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNLGNDSDNLIENDEDGDVKFCTPFQKFEQTA
        PK+TVK+ VRDFEW VDELRM TEDTAVGK KIDAVNLGNDSDNLIEND DGDVKF TPFQKFEQ A
Subjt:  PKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNLGNDSDNLIENDEDGDVKFCTPFQKFEQTA

A0A1S4DWX0 uncharacterized protein LOC103490160 isoform X13.0e-7182.78Show/hide
Query:  LMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR-
        L FLSISNQPPQSQPVMAQQDDGWPLGLR+LNARVGLLENRD PGSISFNTLPTGSPISFTDSS LDSESSGSFFH+KS TLGSL+GGSTS+IMELSRR 
Subjt:  LMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR-

Query:  ---GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHSLAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS
           G+TEASLG DRKINN   FK K WLFSLCCKLSTDAV ATRTHSLAHFLE+ER+R A  AAA P PI GR+N+ LTS
Subjt:  ---GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHSLAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS

A0A5D3DJ71 Uncharacterized protein9.9e-9983.26Show/hide
Query:  GATPQMDVTDLLTWPPPRTRIDACPFLMKNHRDTRFAFDSPSSLILCQSTPPPLMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSI
        GATPQ DVT+LLTWPPPRTRIDACPFLMKNHRDTRFAF SPSSLIL Q+T   L FLSISNQPPQSQPVMAQQDDGWPLGLR+LNARVGLLENRD PGSI
Subjt:  GATPQMDVTDLLTWPPPRTRIDACPFLMKNHRDTRFAFDSPSSLILCQSTPPPLMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENRDLPGSI

Query:  SFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR----GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHS
        SFNTLPTGSPISFTDSS LDSESSGSFFH+KS TLGSL+GGSTS+IMELSRR    G+TEASLG DRKINN   FK K WLFSLCCKLSTDAV ATRTHS
Subjt:  SFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRR----GTTEASLGGDRKINN-LGFKYKPWLFSLCCKLSTDAVSATRTHS

Query:  LAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS
        LAHFLE+ER+R A  AAA P PI GR+N+ LTS
Subjt:  LAHFLEVERRRAAAAAAARPPPIAGRTNNMLTS

A0A6J1HFI5 uncharacterized protein LOC1114631082.4e-6880Show/hide
Query:  MEARGGGDGGFSPRATVEDIQRRLLRPLSSLHSPLPTPFSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKI
        ME  GGGDGGFSPRA+VEDIQRRLLRP SS+HSP PTPFSRGKD QGRNS Q EAN  GLYR +EAK+GETA +KRKLEDYLDPVLLSAVSSKISR +K+
Subjt:  MEARGGGDGGFSPRATVEDIQRRLLRPLSSLHSPLPTPFSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKI

Query:  PKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNLGNDSDNLIENDEDGDVKFCTPFQKFEQTAFVL
        PKM VK++VRDFEWPV ELRML +D+ VGKGKI+ VNLGNDSDNL EN+E+GDVKF TPFQKFEQTA VL
Subjt:  PKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNLGNDSDNLIENDEDGDVKFCTPFQKFEQTAFVL

A0A6J1I9M7 uncharacterized protein LOC111471294 isoform X11.8e-6880.59Show/hide
Query:  MEARGGGDGGFSPRATVEDIQRRLLRPLSSLHSPLPTPFSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKI
        ME  GGGDGGFSPRATVEDIQRRLLRP SS+HSP PTPFSRGKD QGRNS Q EAN  GLYR REAK+GETA +KRKLEDYLDPVLLSAVSSKISR +K+
Subjt:  MEARGGGDGGFSPRATVEDIQRRLLRPLSSLHSPLPTPFSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKI

Query:  PKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNLGNDSDNLIENDEDGDVKFCTPFQKFEQTAFVL
        PKMTVK++VRDFEWPV ELRML +D+ VGKGK + VNLGNDSD L ENDE+GDVK  TPFQKFEQTA VL
Subjt:  PKMTVKKDVRDFEWPVDELRMLTEDTAVGKGKIDAVNLGNDSDNLIENDEDGDVKFCTPFQKFEQTAFVL

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179506.8e-0449.02Show/hide
Query:  LPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRRGTT
        +P+   IS   SSDLD+ES+GSFFH +SITLG+L+G S ++ M +  R ++
Subjt:  LPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRRGTT

Arabidopsis top hitse value%identityAlignment
AT3G17950.1 unknown protein4.8e-0549.02Show/hide
Query:  LPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRRGTT
        +P+   IS   SSDLD+ES+GSFFH +SITLG+L+G S ++ M +  R ++
Subjt:  LPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRRGTT

AT5G02440.1 unknown protein1.5e-2243.43Show/hide
Query:  MAQQDDGWPLGLRLLNARVGLL---------ENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRRG--TTEAS
        MA Q++GWPLGLR +NAR+G L           +   GSISF++L + SP S   SSDLDS+S GSFF  +S TLG+LIG   SS +ELSRR   T    
Subjt:  MAQQDDGWPLGLRLLNARVGLL---------ENRDLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRRG--TTEAS

Query:  LGGDRK---INNLGFKYKPWLFSLCCKLSTDAV-------------SATRTHSLAHFLEVERRRAAAAAAARPPP
         G  R      NL   YKPW+FS+C KLST+A                    SL HFL +ERR   +   + P P
Subjt:  LGGDRK---INNLGFKYKPWLFSLCCKLSTDAV-------------SATRTHSLAHFLEVERRRAAAAAAARPPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCCAGAGGAGGCGGAGATGGAGGTTTTAGTCCTCGCGCCACCGTAGAAGATATCCAGAGACGGCTGCTTCGACCGTTGTCGTCACTTCACTCTCCTCTACCGAC
GCCGTTTTCTCGCGGCAAAGATGCTCAAGGCCGAAATTCATTGCAATTAGAGGCGAATTACACGGGGCTATATCGGCATCGAGAAGCGAAAAAAGGAGAAACCGCAATCA
AGAAGAGGAAGCTTGAAGATTACTTAGATCCGGTTCTTCTCTCTGCGGTTTCTTCGAAGATCAGTCGGATGGAAAAGATTCCTAAGATGACGGTGAAGAAAGACGTTAGG
GATTTCGAGTGGCCTGTCGATGAATTGAGGATGTTGACGGAGGATACGGCAGTTGGAAAGGGGAAAATTGATGCGGTTAATCTTGGTAATGACTCTGATAATCTGATAGA
AAATGATGAAGACGGAGATGTTAAGTTCTGTACTCCGTTTCAAAAGTTTGAGCAGACTGCATTTGTACTTCATACAAAGGGCCTGTACCGCTGCGGAGCTGAATTCTTCA
TCCCAATTTCAAAATCCCCTGGCCATTTACTTCACATAATTTTCTATTCAAAATCTGGAAACACAGCTACTCATCAGGCATACTTTGTTGCAATCAAGAAATTTGGCTGT
GATAAGATGACACCAGCGCGCCATTTGTGCTTAAATGTTTGCCTGCAGGGTGCCACCCCACAAATGGATGTTACTGACCTTTTAACTTGGCCACCACCACGGACGAGGAT
TGATGCCTGCCCATTCCTGATGAAGAATCATCGGGATACGAGATTCGCATTTGATTCTCCATCTTCCTTAATACTTTGCCAGAGTACTCCTCCTCCTTTGATGTTTCTTA
GTATATCAAATCAACCACCACAGTCTCAGCCAGTGATGGCTCAACAGGACGATGGGTGGCCTTTGGGATTAAGACTGCTAAATGCTAGAGTTGGGTTGCTGGAAAATCGA
GACCTTCCTGGATCAATCTCCTTCAACACTTTGCCTACTGGATCTCCCATCTCTTTCACGGACTCTTCAGATCTTGATTCTGAGTCAAGTGGGTCGTTCTTCCATGCTAA
AAGCATCACTCTGGGTAGTCTAATCGGTGGTTCTACTTCTAGCATCATGGAACTCTCGAGAAGGGGAACCACAGAAGCAAGCCTAGGAGGAGACAGAAAGATCAATAACT
TGGGGTTCAAGTACAAGCCATGGTTGTTTTCACTGTGTTGCAAACTGAGCACCGACGCCGTCAGCGCCACCAGAACTCACTCCCTGGCTCACTTTCTAGAAGTGGAGAGA
AGGAGAGCTGCCGCCGCCGCCGCTGCCCGCCCCCCGCCAATTGCCGGAAGAACCAATAATATGTTAACCAGTTGA
mRNA sequenceShow/hide mRNA sequence
AATAAACCGTGAGAGAAATTTGGCGGGAATGAGAAATGGAGGCCAGAGGAGGCGGAGATGGAGGTTTTAGTCCTCGCGCCACCGTAGAAGATATCCAGAGACGGCTGCTT
CGACCGTTGTCGTCACTTCACTCTCCTCTACCGACGCCGTTTTCTCGCGGCAAAGATGCTCAAGGCCGAAATTCATTGCAATTAGAGGCGAATTACACGGGGCTATATCG
GCATCGAGAAGCGAAAAAAGGAGAAACCGCAATCAAGAAGAGGAAGCTTGAAGATTACTTAGATCCGGTTCTTCTCTCTGCGGTTTCTTCGAAGATCAGTCGGATGGAAA
AGATTCCTAAGATGACGGTGAAGAAAGACGTTAGGGATTTCGAGTGGCCTGTCGATGAATTGAGGATGTTGACGGAGGATACGGCAGTTGGAAAGGGGAAAATTGATGCG
GTTAATCTTGGTAATGACTCTGATAATCTGATAGAAAATGATGAAGACGGAGATGTTAAGTTCTGTACTCCGTTTCAAAAGTTTGAGCAGACTGCATTTGTACTTCATAC
AAAGGGCCTGTACCGCTGCGGAGCTGAATTCTTCATCCCAATTTCAAAATCCCCTGGCCATTTACTTCACATAATTTTCTATTCAAAATCTGGAAACACAGCTACTCATC
AGGCATACTTTGTTGCAATCAAGAAATTTGGCTGTGATAAGATGACACCAGCGCGCCATTTGTGCTTAAATGTTTGCCTGCAGGGTGCCACCCCACAAATGGATGTTACT
GACCTTTTAACTTGGCCACCACCACGGACGAGGATTGATGCCTGCCCATTCCTGATGAAGAATCATCGGGATACGAGATTCGCATTTGATTCTCCATCTTCCTTAATACT
TTGCCAGAGTACTCCTCCTCCTTTGATGTTTCTTAGTATATCAAATCAACCACCACAGTCTCAGCCAGTGATGGCTCAACAGGACGATGGGTGGCCTTTGGGATTAAGAC
TGCTAAATGCTAGAGTTGGGTTGCTGGAAAATCGAGACCTTCCTGGATCAATCTCCTTCAACACTTTGCCTACTGGATCTCCCATCTCTTTCACGGACTCTTCAGATCTT
GATTCTGAGTCAAGTGGGTCGTTCTTCCATGCTAAAAGCATCACTCTGGGTAGTCTAATCGGTGGTTCTACTTCTAGCATCATGGAACTCTCGAGAAGGGGAACCACAGA
AGCAAGCCTAGGAGGAGACAGAAAGATCAATAACTTGGGGTTCAAGTACAAGCCATGGTTGTTTTCACTGTGTTGCAAACTGAGCACCGACGCCGTCAGCGCCACCAGAA
CTCACTCCCTGGCTCACTTTCTAGAAGTGGAGAGAAGGAGAGCTGCCGCCGCCGCCGCTGCCCGCCCCCCGCCAATTGCCGGAAGAACCAATAATATGTTAACCAGTTGA
AACACTGTTTTCATTAGCCAATGGCAAAGTTGGTCGTCAGCAAATAAGAAGAGCAGCGAAGAAGAGTGTGCTTAATTGTTTCATTTTGCCTTCAGACAAATTATTAGGCT
GCCAGCCACTCATAATTCCGTTAAATGTGGCAATAGGGAGATGCTTTTCTGGGTCTTCATCAGCTGTTTTGTGATCTGACTCTTATCTCTAAGTCAGGCCTATCATTGAT
ATATTATATCAATGGAATAAAAATGGTATGCAAATGATATTTGGTTTTGTGACCTACCAGCAATAGACATATATTACTAATAAATGTAAAAATGATCCATCCTTATTATT
ACTAATAGACATGAAAATTATCT
Protein sequenceShow/hide protein sequence
MEARGGGDGGFSPRATVEDIQRRLLRPLSSLHSPLPTPFSRGKDAQGRNSLQLEANYTGLYRHREAKKGETAIKKRKLEDYLDPVLLSAVSSKISRMEKIPKMTVKKDVR
DFEWPVDELRMLTEDTAVGKGKIDAVNLGNDSDNLIENDEDGDVKFCTPFQKFEQTAFVLHTKGLYRCGAEFFIPISKSPGHLLHIIFYSKSGNTATHQAYFVAIKKFGC
DKMTPARHLCLNVCLQGATPQMDVTDLLTWPPPRTRIDACPFLMKNHRDTRFAFDSPSSLILCQSTPPPLMFLSISNQPPQSQPVMAQQDDGWPLGLRLLNARVGLLENR
DLPGSISFNTLPTGSPISFTDSSDLDSESSGSFFHAKSITLGSLIGGSTSSIMELSRRGTTEASLGGDRKINNLGFKYKPWLFSLCCKLSTDAVSATRTHSLAHFLEVER
RRAAAAAAARPPPIAGRTNNMLTS