; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G03220 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G03220
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionStructural constituent of nuclear pore isoform 1
Genome locationClcChr06:3347288..3350586
RNA-Seq ExpressionClc06G03220
SyntenyClc06G03220
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575228.1 hypothetical protein SDJN03_25867, partial [Cucurbita argyrosperma subsp. sororia]2.3e-8259.62Show/hide
Query:  MAFKSPSVVSSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC
        MAFKS  VVSSTC NCG DTC S   GREIQ++D KAG    K DLGPVPS  EVDAAVTALQSLLQE FS+ES+SKWLQPL+NS DSSILHSRGY+LLC
Subjt:  MAFKSPSVVSSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC

Query:  KGFQWLLTDPTFK------GLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQ
        KGFQWLLTDPT K      GLV+SLCLDK+VWDAIKNNGIVEKLQELPSS                                                  
Subjt:  KGFQWLLTDPTFK------GLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQ

Query:  ILSCLDNNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVV
                               EGG GN  SSK G D G+ ILSWILQMSLTKI EL++NFV LLNNAF FPGKEKLK EKR+EIDEKIQSS VLSL++
Subjt:  ILSCLDNNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVV

Query:  LLIVVVARVQIA
        +LIVVV+RVQIA
Subjt:  LLIVVVARVQIA

XP_008459730.1 PREDICTED: uncharacterized protein LOC103498772 [Cucumis melo]8.2e-8863.28Show/hide
Query:  MAFKSPSVVSSTCNCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLCK
        MAFKS S V ST +  G+T NSLRHGREI+A+      PFP  DLGPVPSPLEV+AAV ALQSLLQE FSLESMSKWLQPLMNS  SSILHSRGY+LLCK
Subjt:  MAFKSPSVVSSTCNCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLCK

Query:  GFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLDN
        GFQW+LTDPTFKGLVISLCLDKDVW+AI+N+GIVEKLQELPSS                                                         
Subjt:  GFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLDN

Query:  NQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVVA
                         GG GN  SSKQGPDFGN ILSWILQ+SLTKIRELIENFV LLNNAF FPGKEKLKPEK+DEIDEKIQSSLVLSLVV+LIVVVA
Subjt:  NQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVVA

Query:  RVQIA
        RVQIA
Subjt:  RVQIA

XP_022959467.1 uncharacterized protein LOC111460433 [Cucurbita moschata]2.5e-8461.11Show/hide
Query:  MAFKSPSVVSSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC
        MAFKS  VVSSTC NCG DTC S   GREIQ++D KAG    K DLGPVPS  EVDAAVTALQSLLQE FS ES+SKWLQPL+NS DSSILHSRGY+LLC
Subjt:  MAFKSPSVVSSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC

Query:  KGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLD
        KGFQWLLTDPT KGLV+SLCLDK+VWDAIKNNGIVEKLQELPSS                                                        
Subjt:  KGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLD

Query:  NNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVV
                         EGG GN  SSK+G D G+ ILSWILQMSLTKI ELI+NFV LLNNAF FPGKEKLK EKR+EIDEKIQSS VLSL+++LIVVV
Subjt:  NNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVV

Query:  ARVQIA
        +RVQIA
Subjt:  ARVQIA

XP_023549361.1 uncharacterized protein LOC111807735 [Cucurbita pepo subsp. pepo]7.9e-8360.13Show/hide
Query:  MAFKSPSVVSSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC
        MAFKSP VVSSTC NCG DTC S   G EIQ++D KA     K DLGPVPS  EVDAAVTALQSLLQE FS+ES+SKWLQPL+NS DSSILHSRGY+LLC
Subjt:  MAFKSPSVVSSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC

Query:  KGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLD
        KGFQWLLTDP+ KGLV+SLCLDK+VWDAIKNNGIVEKLQELPSS                                                        
Subjt:  KGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLD

Query:  NNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVV
                         EGG GN  SSK G D G+ ILSWILQMSLTKI ELI+NFV LLNNAF FPGKEKLK EKR+EIDEKIQSS VLSL+++LIVVV
Subjt:  NNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVV

Query:  ARVQIA
        +R QIA
Subjt:  ARVQIA

XP_038875839.1 uncharacterized protein LOC120068201 [Benincasa hispida]3.5e-9967.54Show/hide
Query:  MAFKSPSVVSSTCNCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLCK
        M FKSPSVVSST NC  DTCNSL +GREI+AIDRKAG   PKIDLGPVPSP+EVDAAV ALQSLLQESFSL S+SKWLQPLMNSCDSSILHSRGYRLLCK
Subjt:  MAFKSPSVVSSTCNCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLCK

Query:  GFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLDN
        G QWLLTDPTFKGLVISLCLDKDVW+AI+NNGIVEKLQELPSS                                                         
Subjt:  GFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLDN

Query:  NQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVVA
                        EGG GNP SSKQGPDFGNAILSWIL MSLTKIRELIENFVFLLNNAFRFPGKEKLK EKRDEIDEKIQSS  LSLV+LLIV+VA
Subjt:  NQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVVA

Query:  RVQIA
        RVQ+A
Subjt:  RVQIA

TrEMBL top hitse value%identityAlignment
A0A0A0KFP6 Uncharacterized protein3.9e-7556.86Show/hide
Query:  MAFKSPSVVSSTCNCGGDTCNSLRH-GREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC
        MAFKS S V ST +   +T NSLRH GREI+ +      PFP  DLGPVPS +EVDAAVTAL+SLLQE FSLES+SKWLQPLMNS  SSIL SRGYRLL 
Subjt:  MAFKSPSVVSSTCNCGGDTCNSLRH-GREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC

Query:  KGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLD
        KGF+W+L DPTFKGLVISLCLDKDVW+AI N+GIVEKLQELPSS                                                        
Subjt:  KGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLD

Query:  NNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVV
                          GG GN  SSKQG +FGN ILSWILQMS +KIREL+ENFV LLN AF FPGKE LKPEK+DE+DEKIQS+ +LSLV+++IVVV
Subjt:  NNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVV

Query:  ARVQIA
        ARVQIA
Subjt:  ARVQIA

A0A1S3CC23 uncharacterized protein LOC1034987724.0e-8863.28Show/hide
Query:  MAFKSPSVVSSTCNCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLCK
        MAFKS S V ST +  G+T NSLRHGREI+A+      PFP  DLGPVPSPLEV+AAV ALQSLLQE FSLESMSKWLQPLMNS  SSILHSRGY+LLCK
Subjt:  MAFKSPSVVSSTCNCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLCK

Query:  GFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLDN
        GFQW+LTDPTFKGLVISLCLDKDVW+AI+N+GIVEKLQELPSS                                                         
Subjt:  GFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLDN

Query:  NQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVVA
                         GG GN  SSKQGPDFGN ILSWILQ+SLTKIRELIENFV LLNNAF FPGKEKLKPEK+DEIDEKIQSSLVLSLVV+LIVVVA
Subjt:  NQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVVA

Query:  RVQIA
        RVQIA
Subjt:  RVQIA

A0A5D3DM87 Uncharacterized protein4.0e-8863.28Show/hide
Query:  MAFKSPSVVSSTCNCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLCK
        MAFKS S V ST +  G+T NSLRHGREI+A+      PFP  DLGPVPSPLEV+AAV ALQSLLQE FSLESMSKWLQPLMNS  SSILHSRGY+LLCK
Subjt:  MAFKSPSVVSSTCNCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLCK

Query:  GFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLDN
        GFQW+LTDPTFKGLVISLCLDKDVW+AI+N+GIVEKLQELPSS                                                         
Subjt:  GFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLDN

Query:  NQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVVA
                         GG GN  SSKQGPDFGN ILSWILQ+SLTKIRELIENFV LLNNAF FPGKEKLKPEK+DEIDEKIQSSLVLSLVV+LIVVVA
Subjt:  NQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVVA

Query:  RVQIA
        RVQIA
Subjt:  RVQIA

A0A6J1H4L7 uncharacterized protein LOC1114604331.2e-8461.11Show/hide
Query:  MAFKSPSVVSSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC
        MAFKS  VVSSTC NCG DTC S   GREIQ++D KAG    K DLGPVPS  EVDAAVTALQSLLQE FS ES+SKWLQPL+NS DSSILHSRGY+LLC
Subjt:  MAFKSPSVVSSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLC

Query:  KGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLD
        KGFQWLLTDPT KGLV+SLCLDK+VWDAIKNNGIVEKLQELPSS                                                        
Subjt:  KGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLD

Query:  NNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVV
                         EGG GN  SSK+G D G+ ILSWILQMSLTKI ELI+NFV LLNNAF FPGKEKLK EKR+EIDEKIQSS VLSL+++LIVVV
Subjt:  NNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVV

Query:  ARVQIA
        +RVQIA
Subjt:  ARVQIA

A0A6J1L5J5 uncharacterized protein LOC1114993145.2e-8059.8Show/hide
Query:  MAFKSPSVV-SSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLL
        MAFKSP VV SSTC NCG DTC SL  GREIQ++D KAG    K DLGPVPS  EVDAAV ALQSLLQE FS+ES+SKWLQPL+NS DSSILHSRGY LL
Subjt:  MAFKSPSVV-SSTC-NCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLL

Query:  CKGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCL
        CKG QWLLTDPT KGLV+SLCLDK+V DAIKNNGIVEKLQELPSS                                                       
Subjt:  CKGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCL

Query:  DNNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVV
                          EGG GN  SSK G D G+ ILSWILQMSLTKI ELI+NFV LLNNAF FPGKEKL+ EKR+EIDEKIQSS VLSL+++LIVV
Subjt:  DNNQANVSDPSLSETLDLEGGKGNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVV

Query:  VARVQI
        V+R QI
Subjt:  VARVQI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G25170.1 Uncharacterised conserved protein (UCP012943)3.6e-0929.77Show/hide
Query:  GPVPSPLEVDAAVTALQSLLQESF--------------------------------SLESMSKWLQPLMNSCDSSILHSRGYRLLCKGFQWLLTDPTFKG
        G VPS  EV  AV+ALQ +   S                                 S  S S W++P M+ C S  L    Y  +   F  L T+P+ + 
Subjt:  GPVPSPLEVDAAVTALQSLLQESF--------------------------------SLESMSKWLQPLMNSCDSSILHSRGYRLLCKGFQWLLTDPTFKG

Query:  LVISLCLDKDVWDAIKNNGIVEKLQELPSSG
        +V+SL  DK VW+A+ NN +V ++++L ++G
Subjt:  LVISLCLDKDVWDAIKNNGIVEKLQELPSSG

AT4G25170.2 Uncharacterised conserved protein (UCP012943)2.8e-0936Show/hide
Query:  SLESMSKWLQPLMNSCDSSILHSRGYRLLCKGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSG
        S  S S W++P M+ C S  L    Y  +   F  L T+P+ + +V+SL  DK VW+A+ NN +V ++++L ++G
Subjt:  SLESMSKWLQPLMNSCDSSILHSRGYRLLCKGFQWLLTDPTFKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSG

AT5G61490.1 Uncharacterised conserved protein (UCP012943)3.1e-0831.93Show/hide
Query:  REIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSK--------WLQPLMNSCDSSILHSRGYRLLCKGFQWLLTDPTFKGLVISL
        +EI  +  KA    P++D        EVD A +ALQ +  +    ES  +        W++P +  C++S+L       L   F    TDP+ + +V+SL
Subjt:  REIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSK--------WLQPLMNSCDSSILHSRGYRLLCKGFQWLLTDPTFKGLVISL

Query:  CLDKDVWDAIKNNGIVEKL
          DK VWDA+ NN +V +L
Subjt:  CLDKDVWDAIKNNGIVEKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTCAAATCGCCTTCTGTAGTTTCATCCACTTGCAATTGCGGTGGCGATACTTGTAACAGCTTACGCCATGGAAGAGAAATTCAAGCTATTGATAGAAAAGCTGG
GATACCCTTTCCCAAAATTGATTTGGGACCTGTTCCATCGCCTCTCGAAGTTGATGCTGCAGTTACTGCACTTCAGAGTTTGCTGCAGGAGAGTTTTTCACTTGAATCAA
TGTCAAAATGGCTTCAGCCGCTGATGAACTCCTGTGATTCAAGCATTTTGCATTCTCGTGGTTATCGATTACTTTGTAAAGGTTTTCAATGGCTTCTAACAGATCCTACT
TTCAAGGGACTGGTAATTTCACTGTGTTTGGACAAAGATGTTTGGGATGCTATTAAAAACAATGGGATTGTAGAGAAGCTCCAGGAGTTACCTTCTTCAGGAAAGCTGTT
GACTGCACCCTTAAATGTTGCTAACCGACCATCCATGGACTTGGATGGCCTAGGTACCAGTAATGCAAATCCAATTATCTACGGGTGGAGGAATAAAGTTGAGTTCTGGG
AAGCGATATTTGAGTGCATTCTCTTGCAAATTTTATCTTGTTTGGATAATAACCAAGCTAATGTGAGCGATCCTTCTCTCTCTGAAACTTTAGACCTTGAAGGTGGAAAA
GGAAACCCTAGGAGCTCCAAACAGGGACCTGACTTTGGTAATGCCATTCTAAGCTGGATTTTGCAGATGTCACTCACAAAAATCAGGGAGCTAATCGAGAACTTTGTGTT
CCTGTTGAACAACGCATTTCGTTTTCCTGGGAAAGAGAAACTGAAGCCAGAGAAAAGAGATGAGATAGATGAAAAAATCCAATCTTCATTGGTTCTGTCCTTGGTCGTCC
TGTTGATTGTGGTCGTTGCTCGAGTTCAGATCGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTTCAAATCGCCTTCTGTAGTTTCATCCACTTGCAATTGCGGTGGCGATACTTGTAACAGCTTACGCCATGGAAGAGAAATTCAAGCTATTGATAGAAAAGCTGG
GATACCCTTTCCCAAAATTGATTTGGGACCTGTTCCATCGCCTCTCGAAGTTGATGCTGCAGTTACTGCACTTCAGAGTTTGCTGCAGGAGAGTTTTTCACTTGAATCAA
TGTCAAAATGGCTTCAGCCGCTGATGAACTCCTGTGATTCAAGCATTTTGCATTCTCGTGGTTATCGATTACTTTGTAAAGGTTTTCAATGGCTTCTAACAGATCCTACT
TTCAAGGGACTGGTAATTTCACTGTGTTTGGACAAAGATGTTTGGGATGCTATTAAAAACAATGGGATTGTAGAGAAGCTCCAGGAGTTACCTTCTTCAGGAAAGCTGTT
GACTGCACCCTTAAATGTTGCTAACCGACCATCCATGGACTTGGATGGCCTAGGTACCAGTAATGCAAATCCAATTATCTACGGGTGGAGGAATAAAGTTGAGTTCTGGG
AAGCGATATTTGAGTGCATTCTCTTGCAAATTTTATCTTGTTTGGATAATAACCAAGCTAATGTGAGCGATCCTTCTCTCTCTGAAACTTTAGACCTTGAAGGTGGAAAA
GGAAACCCTAGGAGCTCCAAACAGGGACCTGACTTTGGTAATGCCATTCTAAGCTGGATTTTGCAGATGTCACTCACAAAAATCAGGGAGCTAATCGAGAACTTTGTGTT
CCTGTTGAACAACGCATTTCGTTTTCCTGGGAAAGAGAAACTGAAGCCAGAGAAAAGAGATGAGATAGATGAAAAAATCCAATCTTCATTGGTTCTGTCCTTGGTCGTCC
TGTTGATTGTGGTCGTTGCTCGAGTTCAGATCGCATAA
Protein sequenceShow/hide protein sequence
MAFKSPSVVSSTCNCGGDTCNSLRHGREIQAIDRKAGIPFPKIDLGPVPSPLEVDAAVTALQSLLQESFSLESMSKWLQPLMNSCDSSILHSRGYRLLCKGFQWLLTDPT
FKGLVISLCLDKDVWDAIKNNGIVEKLQELPSSGKLLTAPLNVANRPSMDLDGLGTSNANPIIYGWRNKVEFWEAIFECILLQILSCLDNNQANVSDPSLSETLDLEGGK
GNPRSSKQGPDFGNAILSWILQMSLTKIRELIENFVFLLNNAFRFPGKEKLKPEKRDEIDEKIQSSLVLSLVVLLIVVVARVQIA