; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G15940 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G15940
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTransmembrane protein
Genome locationClcChr01:28732870..28735870
RNA-Seq ExpressionClc01G15940
SyntenyClc01G15940
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK12956.1 uncharacterized protein E5676_scaffold255G005520 [Cucumis melo var. makuwa]8.7e-11390.61Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT
        MF ALE+ PPCPAAKLN+V ALPS+ K  RLPYNL LPNRRLSLLSIRAQSLSDPSTSSRYT+TIG+SSP FLQF  CTLTQRHILVLNVVACATAISAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE
        WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLT MSPTASVQEMT+TNLGV+
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE

Query:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKRE
         A+PVLAKRARDIKEGIVKGRS+FQLFLS+TRFSRLALNYFSKR+
Subjt:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKRE

XP_008440148.1 PREDICTED: uncharacterized protein LOC103484701 isoform X1 [Cucumis melo]1.9e-11290.98Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT
        MF ALE+ PPCPAAKLN+V ALPS+ K  RLPYNL LPNRRLSLLSIRAQSLSDPSTSSRYT+TIG+SSP FLQF  CTLTQRHILVLNVVACATAISAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE
        WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLT MSPTASVQEMT+TNLGV+
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE

Query:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
         A+PVLAKRARDIKEGIVKGRS+FQLFLS+TRFSRLALNYFSKR
Subjt:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR

XP_008440149.1 PREDICTED: uncharacterized protein LOC103484701 isoform X2 [Cucumis melo]4.1e-11090.16Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT
        MF ALE+ PPCPAAKLN+V ALPS+ K  RLPYNL LPNRRLSLLSIRAQSLSDPSTSSRYT+TIG+SSP FLQF  CTLTQRHILVLNVVACATAISAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE
        WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLT MSPT  VQEMT+TNLGV+
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE

Query:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
         A+PVLAKRARDIKEGIVKGRS+FQLFLS+TRFSRLALNYFSKR
Subjt:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR

XP_022977945.1 uncharacterized protein LOC111478086 isoform X1 [Cucurbita maxima]4.5e-10988.76Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICR-LPYNLCLPNRRLSLLSIRAQSL----SDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACAT
        MFRALEL PPCPAAK  LVHA PSDVK+CR  P+NL LPNRRLSLLS+RAQSL    SDPSTS RYTETIGHSSP F+QFS CTLTQRHILVLNVVACAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICR-LPYNLCLPNRRLSLLSIRAQSL----SDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACAT

Query:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVT
        AI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLR LT M+PTA VQEMTV 
Subjt:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVT

Query:  NLGVETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
        NLGVE AEPVLAKRARDIKEGIVKGRS+FQLFLSLTRFSRLALN+FSKR
Subjt:  NLGVETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR

XP_038880851.1 uncharacterized protein LOC120072535 [Benincasa hispida]9.0e-11894.67Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT
        MFRALELPPPCPAAKLNLVHALPSDVK CRLPY+L LPNRRLSLL IRAQSLSDPSTSSRYTETIGHSSP FLQFS CTLTQ HI VLNVVACATAISAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE
        WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMEL+DLGQDITQGVRSSTRAVRVAE+RLRRLT M+PTASVQEMTVTNLGVE
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE

Query:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
        TAEPVLAKRARDIKEGIVKGRS+FQLFLSLTRFSRLALNYFSKR
Subjt:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR

TrEMBL top hitse value%identityAlignment
A0A1S3B011 uncharacterized protein LOC103484701 isoform X22.0e-11090.16Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT
        MF ALE+ PPCPAAKLN+V ALPS+ K  RLPYNL LPNRRLSLLSIRAQSLSDPSTSSRYT+TIG+SSP FLQF  CTLTQRHILVLNVVACATAISAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE
        WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLT MSPT  VQEMT+TNLGV+
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE

Query:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
         A+PVLAKRARDIKEGIVKGRS+FQLFLS+TRFSRLALNYFSKR
Subjt:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR

A0A1S3B164 uncharacterized protein LOC103484701 isoform X19.4e-11390.98Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT
        MF ALE+ PPCPAAKLN+V ALPS+ K  RLPYNL LPNRRLSLLSIRAQSLSDPSTSSRYT+TIG+SSP FLQF  CTLTQRHILVLNVVACATAISAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE
        WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLT MSPTASVQEMT+TNLGV+
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE

Query:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
         A+PVLAKRARDIKEGIVKGRS+FQLFLS+TRFSRLALNYFSKR
Subjt:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR

A0A5D3CRC7 Uncharacterized protein4.2e-11390.61Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT
        MF ALE+ PPCPAAKLN+V ALPS+ K  RLPYNL LPNRRLSLLSIRAQSLSDPSTSSRYT+TIG+SSP FLQF  CTLTQRHILVLNVVACATAISAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISAT

Query:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE
        WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLT MSPTASVQEMT+TNLGV+
Subjt:  WLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVE

Query:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKRE
         A+PVLAKRARDIKEGIVKGRS+FQLFLS+TRFSRLALNYFSKR+
Subjt:  TAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKRE

A0A6J1GDG4 uncharacterized protein LOC111452981 isoform X11.7e-10687.6Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICR-LPYNLCLPNRRLSLLSIRAQSL----SDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACAT
        MFRALEL PPCPAAK +LVHA PSDVK+ R  PYNL LPNRRLSLLS+RAQSL    SDPSTS RYTETIGHSSP ++QFS CTLTQRH+LVLNVVACAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICR-LPYNLCLPNRRLSLLSIRAQSL----SDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACAT

Query:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVT
        AI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLR LT M+PTA VQEMTV 
Subjt:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVT

Query:  NLG-VETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
        NLG VE AEPVLAKRARDIK GIVKGRS+FQLFLSLTRFSRLALN+FSKR
Subjt:  NLG-VETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR

A0A6J1INQ2 uncharacterized protein LOC111478086 isoform X12.2e-10988.76Show/hide
Query:  MFRALELPPPCPAAKLNLVHALPSDVKICR-LPYNLCLPNRRLSLLSIRAQSL----SDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACAT
        MFRALEL PPCPAAK  LVHA PSDVK+CR  P+NL LPNRRLSLLS+RAQSL    SDPSTS RYTETIGHSSP F+QFS CTLTQRHILVLNVVACAT
Subjt:  MFRALELPPPCPAAKLNLVHALPSDVKICR-LPYNLCLPNRRLSLLSIRAQSL----SDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACAT

Query:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVT
        AI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLR LT M+PTA VQEMTV 
Subjt:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVT

Query:  NLGVETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
        NLGVE AEPVLAKRARDIKEGIVKGRS+FQLFLSLTRFSRLALN+FSKR
Subjt:  NLGVETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08530.1 unknown protein8.9e-2342.76Show/hide
Query:  HSSPPFLQFSHCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSS
        HS  P  + S   L+ +  L+L  + C T+++ T L  +AIPTL+A  RAA S  KL D  R+ELP T+AA+RLSGMEISDLT+ELSDL QDIT G+  S
Subjt:  HSSPPFLQFSHCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSS

Query:  TRAVRVAEERLRRLTTMSPTASVQEMTV-TNLGVETAEPVLAKRARDIKEGI
         +AV+ AE  ++++ T++   ++  +    NL   + +PV+A  A      I
Subjt:  TRAVRVAEERLRRLTTMSPTASVQEMTV-TNLGVETAEPVLAKRARDIKEGI

AT5G09995.1 unknown protein5.8e-4674.24Show/hide
Query:  DPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELS
        +PS SS+ T ++G    P LQ S  T TQ+H ++LNVVAC TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREELP TMAA+RLSGMEISDLTMELS
Subjt:  DPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELS

Query:  DLGQDITQGVRSSTRAVRVAEERLRRLTTMSP
        DLGQ ITQGV+SSTRA+RVAE+RLRRLT M+P
Subjt:  DLGQDITQGVRSSTRAVRVAEERLRRLTTMSP

AT5G09995.2 unknown protein1.3e-6165.97Show/hide
Query:  DPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELS
        +PS SS+ T ++G    P LQ S  T TQ+H ++LNVVAC TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREELP TMAA+RLSGMEISDLTMELS
Subjt:  DPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELS

Query:  DLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
        DLGQ ITQGV+SSTRA+RVAE+RLRRLT M+P AS+QE+ +     +  EP+LAK+AR  +EG+VKGRS++QLF ++TRFS++  +Y +KR
Subjt:  DLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR

AT5G09995.3 unknown protein2.4e-6065.97Show/hide
Query:  DPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELS
        +PS SS+ T ++G    P LQ S  T TQ+H ++LNVVAC TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREELP TMAA+RLSGMEISDLTMELS
Subjt:  DPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELS

Query:  DLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR
        DLGQ ITQGV+SSTRA+RVAE+RLRRLT M+P AS+QE+ +     +  EP+LAK+AR  +EG+VKGRS++QLF ++TRFS++  +Y +KR
Subjt:  DLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVETAEPVLAKRARDIKEGIVKGRSVFQLFLSLTRFSRLALNYFSKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAGAGCTTTGGAGCTACCGCCACCGTGCCCGGCGGCGAAGCTTAATCTCGTTCACGCACTGCCAAGTGACGTCAAAATCTGCCGACTACCTTACAATCTCTGCCT
ACCGAATCGTCGACTTTCTTTGCTTTCGATACGAGCACAATCGCTATCTGATCCATCGACTTCATCTCGTTATACGGAAACTATTGGACATTCTTCTCCACCATTTCTTC
AATTCTCTCACTGCACGCTAACTCAACGCCACATCCTTGTTCTTAATGTCGTTGCCTGCGCGACGGCTATTTCTGCAACCTGGCTCTTTTGTTCTGCGATCCCCACTCTT
CTGGCATTCAAGAGAGCAGCCGAGTCATTAGAGAAGCTCATGGATGTCACAAGGGAGGAGCTTCCAGGCACTATGGCAGCCATTCGGTTATCTGGCATGGAAATTAGTGA
TCTGACCATGGAGCTCAGTGATCTTGGCCAGGATATCACCCAAGGTGTGAGAAGTTCCACCAGAGCTGTTCGAGTAGCCGAAGAGAGATTGCGTCGCTTGACAACCATGT
CTCCAACAGCCTCAGTGCAGGAAATGACAGTAACCAATCTGGGAGTGGAGACAGCAGAGCCGGTTCTGGCTAAAAGGGCAAGAGACATTAAGGAGGGGATTGTGAAAGGC
CGTTCCGTCTTCCAATTATTTCTCTCCCTTACAAGGTTCTCTCGGCTGGCCTTGAATTATTTTAGCAAACGAGAATTATTCTCAAGGTTTTACCCTTATGATGCATCGGG
TGTCTTAAATTTGCGTGACCAGCTTCAAAGTGGAGGGAATGCGAATCGATCTTTGGGAGGATAA
mRNA sequenceShow/hide mRNA sequence
CTCTGGGCCTTGAGGACAACAAGCCGGCCCAGAATCTCCAAATCCTCTCGGCTATGCCCCAACTTCCGAGCTCGGAGAGTGCTTTACCGAGCAACAACGGCAGAAGCAAT
GTTCAGAGCTTTGGAGCTACCGCCACCGTGCCCGGCGGCGAAGCTTAATCTCGTTCACGCACTGCCAAGTGACGTCAAAATCTGCCGACTACCTTACAATCTCTGCCTAC
CGAATCGTCGACTTTCTTTGCTTTCGATACGAGCACAATCGCTATCTGATCCATCGACTTCATCTCGTTATACGGAAACTATTGGACATTCTTCTCCACCATTTCTTCAA
TTCTCTCACTGCACGCTAACTCAACGCCACATCCTTGTTCTTAATGTCGTTGCCTGCGCGACGGCTATTTCTGCAACCTGGCTCTTTTGTTCTGCGATCCCCACTCTTCT
GGCATTCAAGAGAGCAGCCGAGTCATTAGAGAAGCTCATGGATGTCACAAGGGAGGAGCTTCCAGGCACTATGGCAGCCATTCGGTTATCTGGCATGGAAATTAGTGATC
TGACCATGGAGCTCAGTGATCTTGGCCAGGATATCACCCAAGGTGTGAGAAGTTCCACCAGAGCTGTTCGAGTAGCCGAAGAGAGATTGCGTCGCTTGACAACCATGTCT
CCAACAGCCTCAGTGCAGGAAATGACAGTAACCAATCTGGGAGTGGAGACAGCAGAGCCGGTTCTGGCTAAAAGGGCAAGAGACATTAAGGAGGGGATTGTGAAAGGCCG
TTCCGTCTTCCAATTATTTCTCTCCCTTACAAGGTTCTCTCGGCTGGCCTTGAATTATTTTAGCAAACGAGAATTATTCTCAAGGTTTTACCCTTATGATGCATCGGGTG
TCTTAAATTTGCGTGACCAGCTTCAAAGTGGAGGGAATGCGAATCGATCTTTGGGAGGATAA
Protein sequenceShow/hide protein sequence
MFRALELPPPCPAAKLNLVHALPSDVKICRLPYNLCLPNRRLSLLSIRAQSLSDPSTSSRYTETIGHSSPPFLQFSHCTLTQRHILVLNVVACATAISATWLFCSAIPTL
LAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRRLTTMSPTASVQEMTVTNLGVETAEPVLAKRARDIKEGIVKG
RSVFQLFLSLTRFSRLALNYFSKRELFSRFYPYDASGVLNLRDQLQSGGNANRSLGG