; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G011920 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G011920
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionTail fiber
Genome locationchr08:20502778..20506557
RNA-Seq ExpressionLsi08G011920
SyntenyLsi08G011920
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445191.1 PREDICTED: uncharacterized protein LOC103488297 isoform X1 [Cucumis melo]9.8e-9877.69Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATSTPSSTST+SKSW+RNLSS+ASR+YF LIILQIPLFRI CRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVP WSDLF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKL
        GILGKPVNTDP K+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKL
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKL

XP_008445192.1 PREDICTED: uncharacterized protein LOC103488297 isoform X2 [Cucumis melo]3.4e-9877.78Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATSTPSSTST+SKSW+RNLSS+ASR+YF LIILQIPLFRI CRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVP WSDLF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK
        GILGKPVNTDP K+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKLK
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK

XP_008445193.1 PREDICTED: uncharacterized protein LOC103488297 isoform X4 [Cucumis melo]3.4e-9877.78Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATSTPSSTST+SKSW+RNLSS+ASR+YF LIILQIPLFRI CRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVP WSDLF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK
        GILGKPVNTDP K+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKLK
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK

XP_016900009.1 PREDICTED: uncharacterized protein LOC103488297 isoform X3 [Cucumis melo]3.4e-9877.78Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATSTPSSTST+SKSW+RNLSS+ASR+YF LIILQIPLFRI CRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVP WSDLF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK
        GILGKPVNTDP K+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKLK
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK

XP_038883986.1 uncharacterized protein LOC120074948 [Benincasa hispida]9.8e-9877.78Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATST SSTS +SKSWIRNLSS+ASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGA+VNGLVMNLTVP W++LF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK
        GILGKPVNTDPAK+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKLK
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK

TrEMBL top hitse value%identityAlignment
A0A1S3BC26 uncharacterized protein LOC103488297 isoform X14.7e-9877.69Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATSTPSSTST+SKSW+RNLSS+ASR+YF LIILQIPLFRI CRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVP WSDLF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKL
        GILGKPVNTDP K+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKL
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKL

A0A1S3BC32 uncharacterized protein LOC103488297 isoform X21.6e-9877.78Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATSTPSSTST+SKSW+RNLSS+ASR+YF LIILQIPLFRI CRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVP WSDLF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK
        GILGKPVNTDP K+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKLK
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK

A0A1S3BCZ0 uncharacterized protein LOC103488297 isoform X41.6e-9877.78Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATSTPSSTST+SKSW+RNLSS+ASR+YF LIILQIPLFRI CRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVP WSDLF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK
        GILGKPVNTDP K+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKLK
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK

A0A1S4DVJ9 uncharacterized protein LOC103488297 isoform X31.6e-9877.78Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATSTPSSTST+SKSW+RNLSS+ASR+YF LIILQIPLFRI CRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVP WSDLF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK
        GILGKPVNTDP K+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKLK
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK

A0A5A7VF29 Uncharacterized protein1.6e-9877.78Show/hide
Query:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN
        MATSTPSSTST+SKSW+RNLSS+ASR+YF LIILQIPLFRI CRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVP WSDLF+IYN
Subjt:  MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYN

Query:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
        LTNIKEASAVTDLQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE
Subjt:  LTNIKEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKE

Query:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK
        GILGKPVNTDP K+VYVYPTMILAVICAFSSVKYDVKK VR APARPIAKPLQSSSKSKLK
Subjt:  GILGKPVNTDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G80200.1 unknown protein2.0e-2429.04Show/hide
Query:  SSKSWIRNLSSVASRIYFFLIILQIPLFR-----------------------------IPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNG
        S K W   +S +AS ++  LI+ QIPLFR                             + CR+  C TPL V SS+LIA+++ P+ +VK LLYPGA+   
Subjt:  SSKSWIRNLSSVASRIYFFLIILQIPLFR-----------------------------IPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNG

Query:  LVMNLTVPGWSDLFNIYNLTNI-KEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLK
        L     +P +  LF  Y+   + + +S  TD+                                                LEV AGS   + GA + L K
Subjt:  LVMNLTVPGWSDLFNIYNLTNI-KEASAVTDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLK

Query:  PGRMSMFGTLLVIWGLVKEGILGKPVN---TDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAK
        P R++  GTLL+ WGL+++ +L    +   +    SV VYPT+ LA + AF S++ DV+K +R   +  ++K
Subjt:  PGRMSMFGTLLVIWGLVKEGILGKPVN---TDPAKSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAK

AT5G11280.1 unknown protein1.7e-8463.86Show/hide
Query:  SKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYNLTNIKEASAVTD
        SK WI+  +S+AS +YF LI+ QIPLFR+PCRSGMC++P+HVTSSQLI+SE+FP P++KALLYPGAVVNGL +N+T P W ++ +IYNLTN+KEASAVTD
Subjt:  SKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYNLTNIKEASAVTD

Query:  LQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKEGILGKPVNTDPA
        LQR                                              LEVLAGSYFSVAGAFVGLLKPGRMSMFG+LL++WGLVKEGILGKPVNTDPA
Subjt:  LQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKEGILGKPVNTDPA

Query:  KSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK
        K+VYVYPTM+LA+ICAFS +KYD++KA R+APARPIAKPL SSSKSKLK
Subjt:  KSVYVYPTMILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCTCTACGCCATCGTCGACATCAACATCATCAAAGAGTTGGATCAGAAATCTTTCTTCAGTTGCATCTCGTATCTATTTCTTCCTTATCATACTTCAGATCCC
TCTCTTCAGGATCCCATGCAGATCTGGCATGTGTACAACCCCTTTGCACGTAACTTCATCCCAGTTAATTGCAAGTGAAGTCTTTCCAGCACCTGTAGTTAAGGCACTTC
TCTATCCTGGAGCAGTTGTGAATGGCCTTGTCATGAACTTGACTGTTCCTGGCTGGAGCGATCTGTTCAACATCTATAATTTGACCAACATTAAGGAAGCCTCTGCTGTG
ACTGATCTTCAACGCTTAGAGGTGAACAACAACCTGTGTCTGGAAGAAACAAATGATAAGCCAGTTAGCCACCAAGCCAATATGCTCATTGACTTAAAATTAGCCAATGA
ATCTGGGAGCCAACGTAGGATTTGGATGCTTAATCAGAGATCCCTGGAGGTTCTTGCGGGAAGCTATTTCTCAGTGGCTGGAGCATTTGTGGGTCTTTTGAAGCCCGGGA
GGATGAGCATGTTTGGAACTCTGTTGGTAATTTGGGGTCTTGTTAAGGAAGGAATTCTGGGAAAACCTGTGAACACAGATCCTGCGAAATCTGTTTATGTTTATCCTACA
ATGATTCTTGCGGTGATCTGTGCTTTCTCATCGGTTAAGTATGATGTGAAGAAGGCAGTTAGAAGTGCCCCTGCTCGACCAATTGCAAAGCCCCTTCAAAGCTCATCAAA
ATCTAAGCTTAAGTGA
mRNA sequenceShow/hide mRNA sequence
CAACATTCCCAATTTCAACGTCCTTTGGTTCGATACTTTGATCCTTCCCGATCGATTTCTAATTAGACCGCCATTACCAGACCATCTTTCTTCCATCTCCACCGTCTGGC
CTTTTTTTCCTCTCAAGTTGTCAACAATGGCAACCTCTACGCCATCGTCGACATCAACATCATCAAAGAGTTGGATCAGAAATCTTTCTTCAGTTGCATCTCGTATCTAT
TTCTTCCTTATCATACTTCAGATCCCTCTCTTCAGGATCCCATGCAGATCTGGCATGTGTACAACCCCTTTGCACGTAACTTCATCCCAGTTAATTGCAAGTGAAGTCTT
TCCAGCACCTGTAGTTAAGGCACTTCTCTATCCTGGAGCAGTTGTGAATGGCCTTGTCATGAACTTGACTGTTCCTGGCTGGAGCGATCTGTTCAACATCTATAATTTGA
CCAACATTAAGGAAGCCTCTGCTGTGACTGATCTTCAACGCTTAGAGGTGAACAACAACCTGTGTCTGGAAGAAACAAATGATAAGCCAGTTAGCCACCAAGCCAATATG
CTCATTGACTTAAAATTAGCCAATGAATCTGGGAGCCAACGTAGGATTTGGATGCTTAATCAGAGATCCCTGGAGGTTCTTGCGGGAAGCTATTTCTCAGTGGCTGGAGC
ATTTGTGGGTCTTTTGAAGCCCGGGAGGATGAGCATGTTTGGAACTCTGTTGGTAATTTGGGGTCTTGTTAAGGAAGGAATTCTGGGAAAACCTGTGAACACAGATCCTG
CGAAATCTGTTTATGTTTATCCTACAATGATTCTTGCGGTGATCTGTGCTTTCTCATCGGTTAAGTATGATGTGAAGAAGGCAGTTAGAAGTGCCCCTGCTCGACCAATT
GCAAAGCCCCTTCAAAGCTCATCAAAATCTAAGCTTAAGTGAGTCCAAATGTAATTTCCTTCTACGGGTTTTGTTTGATTCTTCACTCCTGACCACTTCAAAACATCATT
TATGCATATCCCATTATTCATTTTCTGGATCGATTAGGTGTGTCGTGAATGTGATATACAATTTTCTTCTCAGATTTTCATTTTCTGGATCTATTAGCTGTCTCGTGAAC
TGCATCTATTTGTTCAATTATAAATATATATATATAGATCAGAAAGAGAGGTCCCTCATTTCTTGTTACTTCTTCTCATAGAACATTGAAACTATTATTATTGACCAAAA
TTTGTAAAGTTACTACAAATGTG
Protein sequenceShow/hide protein sequence
MATSTPSSTSTSSKSWIRNLSSVASRIYFFLIILQIPLFRIPCRSGMCTTPLHVTSSQLIASEVFPAPVVKALLYPGAVVNGLVMNLTVPGWSDLFNIYNLTNIKEASAV
TDLQRLEVNNNLCLEETNDKPVSHQANMLIDLKLANESGSQRRIWMLNQRSLEVLAGSYFSVAGAFVGLLKPGRMSMFGTLLVIWGLVKEGILGKPVNTDPAKSVYVYPT
MILAVICAFSSVKYDVKKAVRSAPARPIAKPLQSSSKSKLK