; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011212 (gene) of Chayote v1 genome

Gene IDSed0011212
OrganismSechium edule (Chayote v1)
DescriptionNodulin 22
Genome locationLG12:5818565..5821704
RNA-Seq ExpressionSed0011212
SyntenySed0011212
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582302.1 hypothetical protein SDJN03_22304, partial [Cucurbita argyrosperma subsp. sororia]3.5e-8376.62Show/hide
Query:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFVSSSKPIPIVKNNA--TVAAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKLR
        MI  F +LN+N I+IS           RASLISLLILVL L  V+SS+PI  V+     T  AMKVHPLPRKRNIAVRNNPNSR SLED QS+L HKKLR
Subjt:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFVSSSKPIPIVKNNA--TVAAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKLR

Query:  RLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGELV
        RLPH+FSRVLELPFRSDA+V VEENPDCFRFIAETDG+ISDGVRAHAVEIHPGVIKIVVRE  SLEM +DELELDMWRFRLPETT PELASAAFVDGEL+
Subjt:  RLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGELV

Query:  VTVPKGIEEENSEDGGDIWGDGMEGRLVLVQ
        VTVPKG EE + + GGDIWGD MEGRLVLVQ
Subjt:  VTVPKGIEEENSEDGGDIWGDGMEGRLVLVQ

XP_004134135.1 uncharacterized protein LOC101205778 [Cucumis sativus]6.8e-8778.11Show/hide
Query:  LNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV-------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKL
        LNSNPIKISIP EIRPSFF RAS ISL I VLTLVF+       + SKPI  +KN   +    AMKVHPLPRKRNIAVRNN   R SLED   +  HKKL
Subjt:  LNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV-------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKL

Query:  RRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGEL
        RRLPHIFSRVLELPFRSDA+V VEENPDCFRFIAETDG+ISDGVRAHAVEIHPGVIKIVVREN SLEM+IDELELDMWRFRLPETT PELASAAFVDGEL
Subjt:  RRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGEL

Query:  VVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ
        +VTVPKG +E NS+D GGDI+ D MEGRLVLVQ
Subjt:  VVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ

XP_008438664.1 PREDICTED: uncharacterized protein LOC103483704 [Cucumis melo]1.4e-8777.18Show/hide
Query:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV--------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQ
        MI  F  LNSNPIKISIP EIRPSFF RAS  SLLI VLTLVFV        + SK I  +KN   +     MKVHPLPRKRNIAVRNNP SR SLED  
Subjt:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV--------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQ

Query:  SVLTHKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELAS
         +  HKKLRRLPHIFSRVLELPFRSDA+V VEEN DCFRFIA TDG+ISDGVRAHAVEIHPGVIKIVVREN SLEMAIDELELDMWRFRLPETT PELAS
Subjt:  SVLTHKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELAS

Query:  AAFVDGELVVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ
        AAFVDGEL+VTVPKG +EENS+D GGDI+ D MEGRLVLVQ
Subjt:  AAFVDGELVVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ

XP_022138274.1 uncharacterized protein LOC111009490 [Momordica charantia]5.2e-8776.45Show/hide
Query:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVF-VSSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKK
        MI  F  LN+NPIKISIP +IRP+ F RASLI LLI+ L LV   + +KPI  VKN   +    AMKVHPLPRKRNI VR NPNSR SLED QS L HKK
Subjt:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVF-VSSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKK

Query:  LRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGE
        LRRLPHIFSRVL+LPFRSDA+V +EENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRE+GS+EMA+DELELDMWRFRLPETT PELASAAFVDGE
Subjt:  LRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGE

Query:  LVVTVPKGIEEENSE--DGGDIWG-------DGMEGRLVLVQ
        L+VTVPKG EEE+SE  DGGDIWG       DGM GRLVLVQ
Subjt:  LVVTVPKGIEEENSE--DGGDIWG-------DGMEGRLVLVQ

XP_022979495.1 uncharacterized protein LOC111479182 [Cucurbita maxima]2.7e-8376.62Show/hide
Query:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFVSSSKPIPIVKNNA--TVAAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKLR
        MIP F  LN+N I+IS           RASLISLLILVL L  V+ S+PI  V+     T  AMKVHPLPRKRNIAVRNNPNSR SLED QS+L HKKLR
Subjt:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFVSSSKPIPIVKNNA--TVAAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKLR

Query:  RLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGELV
        RLPH+FSRVLELPFRSDA+V VEENPDCFRFIAETDG+ISDGVRAHAVEIHPGVIKIVVRE  SLEM +DELELDMWRFRLPETT PELASAAFVDGEL+
Subjt:  RLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGELV

Query:  VTVPKGIEEENSEDGGDIWGDGMEGRLVLVQ
        VTVPKG EE + + GGDIWGD MEGRLVLVQ
Subjt:  VTVPKGIEEENSEDGGDIWGDGMEGRLVLVQ

TrEMBL top hitse value%identityAlignment
A0A0A0LA89 Uncharacterized protein3.3e-8778.11Show/hide
Query:  LNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV-------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKL
        LNSNPIKISIP EIRPSFF RAS ISL I VLTLVF+       + SKPI  +KN   +    AMKVHPLPRKRNIAVRNN   R SLED   +  HKKL
Subjt:  LNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV-------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKL

Query:  RRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGEL
        RRLPHIFSRVLELPFRSDA+V VEENPDCFRFIAETDG+ISDGVRAHAVEIHPGVIKIVVREN SLEM+IDELELDMWRFRLPETT PELASAAFVDGEL
Subjt:  RRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGEL

Query:  VVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ
        +VTVPKG +E NS+D GGDI+ D MEGRLVLVQ
Subjt:  VVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ

A0A1S3AXL9 uncharacterized protein LOC1034837046.6e-8877.18Show/hide
Query:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV--------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQ
        MI  F  LNSNPIKISIP EIRPSFF RAS  SLLI VLTLVFV        + SK I  +KN   +     MKVHPLPRKRNIAVRNNP SR SLED  
Subjt:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV--------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQ

Query:  SVLTHKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELAS
         +  HKKLRRLPHIFSRVLELPFRSDA+V VEEN DCFRFIA TDG+ISDGVRAHAVEIHPGVIKIVVREN SLEMAIDELELDMWRFRLPETT PELAS
Subjt:  SVLTHKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELAS

Query:  AAFVDGELVVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ
        AAFVDGEL+VTVPKG +EENS+D GGDI+ D MEGRLVLVQ
Subjt:  AAFVDGELVVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ

A0A5A7U4T4 Nodulin 226.6e-8877.18Show/hide
Query:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV--------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQ
        MI  F  LNSNPIKISIP EIRPSFF RAS  SLLI VLTLVFV        + SK I  +KN   +     MKVHPLPRKRNIAVRNNP SR SLED  
Subjt:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFV--------SSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQ

Query:  SVLTHKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELAS
         +  HKKLRRLPHIFSRVLELPFRSDA+V VEEN DCFRFIA TDG+ISDGVRAHAVEIHPGVIKIVVREN SLEMAIDELELDMWRFRLPETT PELAS
Subjt:  SVLTHKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELAS

Query:  AAFVDGELVVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ
        AAFVDGEL+VTVPKG +EENS+D GGDI+ D MEGRLVLVQ
Subjt:  AAFVDGELVVTVPKGIEEENSED-GGDIWGDGMEGRLVLVQ

A0A6J1C9P7 uncharacterized protein LOC1110094901.9e-8776.86Show/hide
Query:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVF-VSSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKK
        MI  F  LN+NPIKISIP +IRP+ F RASLI LLI+ L LV   + +KPI  VKN   +    AMKVHPLPRKRNI VR NPNSR SLED QS L HKK
Subjt:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVF-VSSSKPIPIVKNNATV---AAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKK

Query:  LRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGE
        LRRLPHIFSRVLELPFRSDA+V +EENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRE+GS+EMA+DELELDMWRFRLPETT PELASAAFVDGE
Subjt:  LRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGE

Query:  LVVTVPKGIEEENSE--DGGDIWG-------DGMEGRLVLVQ
        L+VTVPKG EEE+SE  DGGDIWG       DGM GRLVLVQ
Subjt:  LVVTVPKGIEEENSE--DGGDIWG-------DGMEGRLVLVQ

A0A6J1ITF0 uncharacterized protein LOC1114791821.3e-8376.62Show/hide
Query:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFVSSSKPIPIVKNNA--TVAAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKLR
        MIP F  LN+N I+IS           RASLISLLILVL L  V+ S+PI  V+     T  AMKVHPLPRKRNIAVRNNPNSR SLED QS+L HKKLR
Subjt:  MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFVSSSKPIPIVKNNA--TVAAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKLR

Query:  RLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGELV
        RLPH+FSRVLELPFRSDA+V VEENPDCFRFIAETDG+ISDGVRAHAVEIHPGVIKIVVRE  SLEM +DELELDMWRFRLPETT PELASAAFVDGEL+
Subjt:  RLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGELV

Query:  VTVPKGIEEENSEDGGDIWGDGMEGRLVLVQ
        VTVPKG EE + + GGDIWGD MEGRLVLVQ
Subjt:  VTVPKGIEEENSEDGGDIWGDGMEGRLVLVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22530.1 unknown protein5.3e-4554.85Show/hide
Query:  SLISLLILVLTLVFVSSSKPIPIVKNNATVAAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVL-THKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFR
        SL S L+L L L+  S  KP P         AM+VHP+PR  N       N+ I    H       K LRRLPHIF+RVLELP RS+A+V+VEE  DCFR
Subjt:  SLISLLILVLTLVFVSSSKPIPIVKNNATVAAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVL-THKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFR

Query:  FIAETDGSIS-DG-VRAHAVEIHPGVIKIVVRENG--SLEMAIDELELDMWRFRLPETTLPELASAAFVDGELVVTVPKGIEEENSEDGGDIWGDGM-EG
        F+AET G  + DG +RA+ VEIHPG+ KIVVR NG  SL +++DELELD+WRFRLPE+T PEL + A VDG+L+VTVPK  EEE+ + GG  +G G+  G
Subjt:  FIAETDGSIS-DG-VRAHAVEIHPGVIKIVVRENG--SLEMAIDELELDMWRFRLPETTLPELASAAFVDGELVVTVPKGIEEENSEDGGDIWGDGM-EG

Query:  RLVLVQ
        RLVLVQ
Subjt:  RLVLVQ

AT4G14830.1 unknown protein7.9e-4159.74Show/hide
Query:  MKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENG
        MK+HPLPR        N N+ I      +    KKLRRLPHIFSRVLELP +SDA+V+VEE+ DCFRF+AETDG    GVRA+ VEIHPGV KI+VR NG
Subjt:  MKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKLRRLPHIFSRVLELPFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENG

Query:  --SLEMAIDELELDMWRFRLPETTLPELASA-AFVDGELVVTVPKGIEEENSED
          SL +++DELELD+WRFRLPE+T PEL +     DGEL+VTVPK   E+N  D
Subjt:  --SLEMAIDELELDMWRFRLPETTLPELASA-AFVDGELVVTVPKGIEEENSED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCCCTTTTTTGTACTCTGAACAGTAACCCCATCAAGATTTCGATTCCTTGCGAAATTCGTCCCTCGTTTTTCATCAGGGCATCTTTGATTTCCCTTTTGATTTT
GGTTCTGACCCTCGTTTTTGTTTCTTCTTCAAAACCTATTCCGATTGTGAAGAACAACGCCACCGTCGCCGCCATGAAGGTCCACCCATTGCCGAGGAAGCGAAATATCG
CCGTTCGCAATAACCCCAATTCGAGAATCTCTCTCGAAGATCATCAATCCGTTCTGACCCACAAGAAACTCAGGAGATTACCCCATATCTTCAGCCGGGTTCTCGAGCTT
CCGTTTCGATCCGATGCCGAGGTTTCGGTGGAAGAGAATCCTGATTGCTTTCGATTCATTGCTGAAACTGATGGTAGCATTAGCGATGGAGTTAGGGCTCATGCTGTGGA
AATCCATCCTGGGGTTATTAAGATCGTTGTGCGTGAGAATGGGTCGTTGGAAATGGCGATCGATGAGCTCGAATTGGATATGTGGAGGTTTCGGTTGCCCGAGACGACGC
TGCCCGAGCTTGCAAGTGCTGCGTTTGTTGATGGGGAGCTTGTTGTTACAGTTCCAAAGGGGATTGAGGAGGAGAATTCTGAAGATGGTGGAGATATCTGGGGAGATGGA
ATGGAAGGTCGGCTTGTTCTTGTACAGTAA
mRNA sequenceShow/hide mRNA sequence
CATAAATGCCATAGTTGATGGGCATATTTAATAATTTAGCACAAAAAAAAAATCTCTTCCTATTTATTTGCTGATGTATTATATCTCTGCACTTAGCTTTCACGGCCCTC
TCTTCCATAATTGCATTCTCTTCCCTTCGTTCCGATTCAATTCCCCCATTCTCTCAAATCCTTTCATCACATAATTTCATTTTTCAGTTCCCATTTCTGAAAAATTAGGG
TTCTAATCCATTCTCTTTTTCGTTTGATTCCTTTATGATTCCCCTTTTTTGTACTCTGAACAGTAACCCCATCAAGATTTCGATTCCTTGCGAAATTCGTCCCTCGTTTT
TCATCAGGGCATCTTTGATTTCCCTTTTGATTTTGGTTCTGACCCTCGTTTTTGTTTCTTCTTCAAAACCTATTCCGATTGTGAAGAACAACGCCACCGTCGCCGCCATG
AAGGTCCACCCATTGCCGAGGAAGCGAAATATCGCCGTTCGCAATAACCCCAATTCGAGAATCTCTCTCGAAGATCATCAATCCGTTCTGACCCACAAGAAACTCAGGAG
ATTACCCCATATCTTCAGCCGGGTTCTCGAGCTTCCGTTTCGATCCGATGCCGAGGTTTCGGTGGAAGAGAATCCTGATTGCTTTCGATTCATTGCTGAAACTGATGGTA
GCATTAGCGATGGAGTTAGGGCTCATGCTGTGGAAATCCATCCTGGGGTTATTAAGATCGTTGTGCGTGAGAATGGGTCGTTGGAAATGGCGATCGATGAGCTCGAATTG
GATATGTGGAGGTTTCGGTTGCCCGAGACGACGCTGCCCGAGCTTGCAAGTGCTGCGTTTGTTGATGGGGAGCTTGTTGTTACAGTTCCAAAGGGGATTGAGGAGGAGAA
TTCTGAAGATGGTGGAGATATCTGGGGAGATGGAATGGAAGGTCGGCTTGTTCTTGTACAGTAAATTGAATCCTTTCCCTTTTTTCCTTTTGGTTTTTGTACTAACTGTT
GGAAATTTCCCTTTTCATCATTGTTAATGTTGTTCTGGATTTCCTGCTAGTTTTCCATTGCTGAATAAACTGACCCATTAGCTTAAAAGGAACTTAGAAAGAAAAGGCTA
ATTGTTTCTATCTGAATGATCACAAGAACCTGAAACCTAAGAAGCACGGACACGGACACGAGATACGGATAAGACACGACACGGACACGTAGATACGCCATTATTTAAAA
ATGCAGGACACGGATACGTCGAGGACACGCGATTTAATATTTTAGTTAGTTATAAGTCTACTGTCTACCCATAACAATGTCTAAATATTTTAGTTTTTATCTCATTTTCC
TTTCCTATATTTTATTTTCTCTTTTTCTTTCATTTTTTCATTTTAGTCCATCTATCTTTGATAGACATCATCATCGACGACCTCCACCAACACTATCTGTTGTCGTGTCA
TCCGCCATTGGAAACAAATAAGTCTTAATTGAGGTGTTCATGAAGTTTCCATGAAGTGTCCGAAATTGAAAATAATAATAATAAAAGAGGACACCGAATTTTGAGTGTTG
GACACGTGTCTGACGAGTGTCCAGAAGTATCGATGTTGGACACGAGTACGACACGGATACGTTGTCAAAATAGAAGTGTCCGTGCTTCCTAGCCTGAAACTTAACACTAA
ACCATGAAACTTTTTTTTTTTTTCTCTTGATTTCAAAACTACATGTGGGGGTGGAATAATTCGAAACTTGATTGGGAGATTTACTTTAACCAGTTCAAGTTGACACGCAA
CTCTGAAACATTGATAATTTTGCCTTAGCATTTGCTTATGGTAACTTGCGTGTGCAATTTGAACTCCTTTTCTTTTCTCCCCACAACATTCTAACCTGACCCTGCAAGCT
TTGAAACAACTTGTGTTGCAAAATATGGGTCTTCTTTGAGGATAATTGTTGCATCTGAATTGTTATCTGTTGGGTTGAGTGAAACTGAGTATAACAGCTAACAATTGACA
ACTCAAATGGATAGTTTTAGACAGATAATGGTTGAAATTACCAGCATGATTTTGGTCTGTTCTCTTAGATTAGATTCCCTTTTTGTAGTTACTCTTGAACCCTCGTTTGT
TCAGACTCGTGGAGGAATCTCGTTAGTTGAAGTTTGGAACTTCAAGACATGTTTTTTTAAGGTCTTAGGTTTGAGTCTAACAAGTGAGTTTAATTGTAGATTTTTTTTTG
TTGCCTCTCGAGTATGGGCCTGGGGACGGGCGCGGATGTCCTGAGCAGAGTAGAATGGAGCTCTACTTTTTCGATTTCACAAAAAAAAAAGTTTTTATGCTTCTTCCTTT
ATTGCCACTGAACTCTGATGCTAATTGGCAGTTTACCTTCATTTGTTAATTGGCAAGAAGAAGTTCTGAGACTTGATTGCCAATGCAATTTGTTAGTGGTCTCAGCCCTT
TTCTAGATATCATTTTGTCCTCATCTTTTGATTGTACCTCAAAAGACTTGTCCTTCTCCGTGGTTTGGAAAGTATTGCTTCTCTCTTGTCCATTTCATGGGTCCTATATG
TGGGCATAAACTCATTGCTTTTTTTTTTGCATTTAAACCATAATGGGGCACTTTTCATGCTAGCCAAAGATAACATAACAAACCTTCTTTTAAGCATTTCTTTCACATAT
GCCATTTTCTAGTTGGCAAATTCTTATTCCTAAATCCCTAAACTGCTTTCTGCTTGCCCATTCTTGTTCTGTAGATCTTCTTTTTCCTTTTTATGGTTGACCTGTTGATT
TGAGTCATAGCTGCATTCATGCTCTTTGATTTCATATCTAGAAATTAATTTTCGAATATTGTAGCCACATGTTTTGTGACAAGTGTTTTTAGATTCTATATGGAGCCATT
AAGTGGAATAAAATGAGAGCAACCTGTACATACTATGATGAACAAACAGAAAAGACTTATTATTGAAGTTCCATGTAGCTGTCCATGTGAATGGCATTAGTTTCAATTGT
CATCATTTTATGGTGGCCATTTAGCATTGCCTTTTATGTGTAATAATCCTTTTGGACTTGAAATGATTGCAGAGGAATGGCCTACAAACCCCAACCATAATAATGGGTAA
TTATTTTGATGTAAAAAAAGTGATTGTTTGGGGGCATTACAAAATGTTATTGCGTTGGGG
Protein sequenceShow/hide protein sequence
MIPLFCTLNSNPIKISIPCEIRPSFFIRASLISLLILVLTLVFVSSSKPIPIVKNNATVAAMKVHPLPRKRNIAVRNNPNSRISLEDHQSVLTHKKLRRLPHIFSRVLEL
PFRSDAEVSVEENPDCFRFIAETDGSISDGVRAHAVEIHPGVIKIVVRENGSLEMAIDELELDMWRFRLPETTLPELASAAFVDGELVVTVPKGIEEENSEDGGDIWGDG
MEGRLVLVQ