; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001121 (gene) of Snake gourd v1 genome

Gene IDTan0001121
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNodulin 22
Genome locationLG10:6938084..6940134
RNA-Seq ExpressionTan0001121
SyntenyTan0001121
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582302.1 hypothetical protein SDJN03_22304, partial [Cucurbita argyrosperma subsp. sororia]2.4e-9583.75Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL
        MILPF  LN+N I+IS           RASLISLLILVLALV S   S PIASV +ELEINTT AMKVHPLPRKRNI VRNNPNSRNSLEDQSLLNHKKL
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL

Query:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL
        RRLPH+FSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRE  SLEMT+DELELDMWRFRLPETTRPELASAAFVDGEL
Subjt:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL

Query:  IVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ
        IVTVPKGN EENS+DGGGDIWG       D MEGRLVLVQ
Subjt:  IVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ

XP_004134135.1 uncharacterized protein LOC101205778 [Cucumis sativus]9.3e-10385.02Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSL
        MI   RPLNSNPIKISIPFEIRP FFTRAS ISL I VL LV      VSPAPS PIAS+K E EIN+T AMKVHPLPRKRNI VRNN   RNSLEDQSL
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSL

Query:  L-NHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASA
        L NHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEM+IDELELDMWRFRLPETTRPELASA
Subjt:  L-NHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASA

Query:  AFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ
        AFVDGELIVTVPKGN+E NS+DGGGDI       FRD MEGRLVLVQ
Subjt:  AFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ

XP_008438664.1 PREDICTED: uncharacterized protein LOC103483704 [Cucumis melo]7.9e-10283.87Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV-------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQS
        MI  FRPLNSNPIKISIPFEIRP FFTRAS  SLLI VL LV       VSPAPS  IA++K E EIN+T  MKVHPLPRKRNI VRNNP SRNSLEDQS
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV-------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQS

Query:  -LLNHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELAS
         L NHKKLRRLPHIFSRVLELPFRSDADVLVEEN DCFRFIA TDGNISDGVRAHAVEIHPGVIKIVVRENESLEM IDELELDMWRFRLPETTRPELAS
Subjt:  -LLNHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELAS

Query:  AAFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ
        AAFVDGELIVTVPKGN+EENS+DGGGDI       FRD MEGRLVLVQ
Subjt:  AAFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ

XP_022138274.1 uncharacterized protein LOC111009490 [Momordica charantia]2.5e-10886.31Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL
        MILPF PLN+NPIKISIPF+IRP  FTRASLI LLI+ L LVVSPAP+ PI SVK + EI  + AMKVHPLPRKRNITVR NPNSRNSLEDQS LNHKKL
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL

Query:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL
        RRLPHIFSRVL+LPFRSDADVL+EENPDCFRFIAETDG+ISDGVRAHAVEIHPGVIKIVVRE+ S+EM +DELELDMWRFRLPETTRPELASAAFVDGEL
Subjt:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL

Query:  IVTVPKGNEEENSE-DGGGDIWGDGSGSFRDGMEGRLVLVQ
        IVTVPKGNEEE+SE D GGDIWGDGSGSFRDGM GRLVLVQ
Subjt:  IVTVPKGNEEENSE-DGGGDIWGDGSGSFRDGMEGRLVLVQ

XP_023527185.1 uncharacterized protein LOC111790497 [Cucurbita pepo subsp. pepo]7.1e-9583.33Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL
        MILPF  LN++ IKIS           RASLISLLILVLALV S   S PIASV +ELEINTT AMKVHPLPRKRNI VRNNPNSRNSLEDQSLLNHKKL
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL

Query:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL
        RRLPH+FSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRE  SLEMT+DELELDMWRFRLPETTRPELASA FVDGEL
Subjt:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL

Query:  IVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ
        IVTVPKGN EENS+DGGGDIWG       D MEGRLVLVQ
Subjt:  IVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ

TrEMBL top hitse value%identityAlignment
A0A0A0LA89 Uncharacterized protein4.5e-10385.02Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSL
        MI   RPLNSNPIKISIPFEIRP FFTRAS ISL I VL LV      VSPAPS PIAS+K E EIN+T AMKVHPLPRKRNI VRNN   RNSLEDQSL
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSL

Query:  L-NHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASA
        L NHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEM+IDELELDMWRFRLPETTRPELASA
Subjt:  L-NHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASA

Query:  AFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ
        AFVDGELIVTVPKGN+E NS+DGGGDI       FRD MEGRLVLVQ
Subjt:  AFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ

A0A1S3AXL9 uncharacterized protein LOC1034837043.8e-10283.87Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV-------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQS
        MI  FRPLNSNPIKISIPFEIRP FFTRAS  SLLI VL LV       VSPAPS  IA++K E EIN+T  MKVHPLPRKRNI VRNNP SRNSLEDQS
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV-------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQS

Query:  -LLNHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELAS
         L NHKKLRRLPHIFSRVLELPFRSDADVLVEEN DCFRFIA TDGNISDGVRAHAVEIHPGVIKIVVRENESLEM IDELELDMWRFRLPETTRPELAS
Subjt:  -LLNHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELAS

Query:  AAFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ
        AAFVDGELIVTVPKGN+EENS+DGGGDI       FRD MEGRLVLVQ
Subjt:  AAFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ

A0A5A7U4T4 Nodulin 223.8e-10283.87Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV-------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQS
        MI  FRPLNSNPIKISIPFEIRP FFTRAS  SLLI VL LV       VSPAPS  IA++K E EIN+T  MKVHPLPRKRNI VRNNP SRNSLEDQS
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALV-------VSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQS

Query:  -LLNHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELAS
         L NHKKLRRLPHIFSRVLELPFRSDADVLVEEN DCFRFIA TDGNISDGVRAHAVEIHPGVIKIVVRENESLEM IDELELDMWRFRLPETTRPELAS
Subjt:  -LLNHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELAS

Query:  AAFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ
        AAFVDGELIVTVPKGN+EENS+DGGGDI       FRD MEGRLVLVQ
Subjt:  AAFVDGELIVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ

A0A6J1C9P7 uncharacterized protein LOC1110094909.3e-10986.72Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL
        MILPF PLN+NPIKISIPF+IRP  FTRASLI LLI+ L LVVSPAP+ PI SVK + EI  + AMKVHPLPRKRNITVR NPNSRNSLEDQS LNHKKL
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL

Query:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL
        RRLPHIFSRVLELPFRSDADVL+EENPDCFRFIAETDG+ISDGVRAHAVEIHPGVIKIVVRE+ S+EM +DELELDMWRFRLPETTRPELASAAFVDGEL
Subjt:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL

Query:  IVTVPKGNEEENSE-DGGGDIWGDGSGSFRDGMEGRLVLVQ
        IVTVPKGNEEE+SE D GGDIWGDGSGSFRDGM GRLVLVQ
Subjt:  IVTVPKGNEEENSE-DGGGDIWGDGSGSFRDGMEGRLVLVQ

A0A6J1GVS7 uncharacterized protein LOC1114578804.5e-9583.33Show/hide
Query:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL
        MILPF  LN+N I+IS           RASLISLLILVLALV S   S PIASV +ELEINTT AMKVHPLPRKRNI VRNNPNSRNSLEDQSLLNHKKL
Subjt:  MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKL

Query:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL
        RRLPH+FSRVLELPFRSDADVLVEENPDCFRFIAETDG+ISDGVRAHAVEIHPGVIKIVVRE  SLEMT+DELELDMWRFRLPETTRPELASAAFVDGEL
Subjt:  RRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGEL

Query:  IVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ
        IVTVPKGN EENS+DGGGDIWG       D MEGRLVLVQ
Subjt:  IVTVPKGNEEENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22530.1 unknown protein5.5e-4550.87Show/hide
Query:  ISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKLRRLPHIFSRVLELP
        I IP +I+ +F          +L+L L+ S    NP              AM+VHP+PR  N T+ ++ +   + E       K LRRLPHIF+RVLELP
Subjt:  ISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKLRRLPHIFSRVLELP

Query:  FRSDADVLVEENPDCFRFIAETDG--NISDGVRAHAVEIHPGVIKIVVREN--ESLEMTIDELELDMWRFRLPETTRPELASAAFVDGELIVTVPKGNEE
         RS+ADV VEE  DCFRF+AET G  N    +RA+ VEIHPG+ KIVVR N   SL +++DELELD+WRFRLPE+TRPEL + A VDG+LIVTVPK  EE
Subjt:  FRSDADVLVEENPDCFRFIAETDG--NISDGVRAHAVEIHPGVIKIVVREN--ESLEMTIDELELDMWRFRLPETTRPELASAAFVDGELIVTVPKGNEE

Query:  ENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ
        E+ + GGGD +G G GS      GRLVLVQ
Subjt:  ENSEDGGGDIWGDGSGSFRDGMEGRLVLVQ

AT4G14830.1 unknown protein5.7e-4260.76Show/hide
Query:  MKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVREN--
        MK+HPLPR       NN N  +   D +    KKLRRLPHIFSRVLELP +SDADV VEE+ DCFRF+AETDG    GVRA+ VEIHPGV KI+VR N  
Subjt:  MKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKLRRLPHIFSRVLELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVREN--

Query:  ESLEMTIDELELDMWRFRLPETTRPELASA-AFVDGELIVTVPKGNEEENSEDGGGDI
         SL +++DELELD+WRFRLPE+TRPEL +     DGELIVTVPK       ED G D+
Subjt:  ESLEMTIDELELDMWRFRLPETTRPELASA-AFVDGELIVTVPKGNEEENSEDGGGDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTCCCTTTCCGTCCTCTGAACAGCAATCCCATCAAGATTTCGATTCCTTTCGAAATTCGTCCGGTGTTTTTCACTAGGGCATCTTTGATTTCCCTGTTAATTTT
GGTTTTAGCCCTCGTTGTTTCTCCCGCCCCTTCAAATCCTATTGCGAGTGTTAAGAAGGAGCTCGAAATCAACACCACCGCCGCCATGAAGGTCCACCCATTGCCGAGGA
AGCGCAATATCACCGTCCGAAACAACCCCAATTCGAGAAACTCTCTCGAAGATCAATCCCTTCTGAACCACAAGAAACTCAGGAGATTGCCCCATATCTTCAGTCGGGTC
CTCGAGCTTCCTTTTCGATCTGATGCGGATGTTTTGGTGGAAGAGAATCCTGATTGCTTTCGATTCATTGCTGAAACTGATGGTAACATTAGCGATGGAGTAAGGGCTCA
TGCTGTGGAAATTCATCCTGGGGTTATTAAGATCGTTGTGCGTGAAAATGAATCGTTGGAAATGACAATAGATGAGCTCGAATTGGACATGTGGAGGTTTCGGCTACCGG
AGACGACACGGCCGGAGCTTGCAAGTGCGGCGTTTGTTGATGGAGAACTTATAGTTACTGTTCCAAAGGGGAATGAGGAGGAGAATTCTGAAGATGGTGGAGGAGATATC
TGGGGAGATGGGAGCGGGAGCTTTAGAGATGGAATGGAAGGTCGGCTTGTTCTTGTACAGTAA
mRNA sequenceShow/hide mRNA sequence
AAAAACTTTCCCTTCCTTTTAATCTGCTGAGCGATTCTCTCTGCTCTTAGCTCTCACGGCTCTCCCTTTGCATTCTCTTCCCCGAATGCCTTCTTCTTCTTCCTTCGTTT
CGATTCAATTTCCCAATTCTCTCAAATCCCTCGATCACATGATTCCCTTTTTCAGTCCCCATTTCTGAAAATTAGGTTCTAATCCAATATCTGTTTCTAGTTTCTTCATC
CATTTTGTTTTTACCCAAACCAATTCGTTTGATTCCTTTATGATTCTCCCTTTCCGTCCTCTGAACAGCAATCCCATCAAGATTTCGATTCCTTTCGAAATTCGTCCGGT
GTTTTTCACTAGGGCATCTTTGATTTCCCTGTTAATTTTGGTTTTAGCCCTCGTTGTTTCTCCCGCCCCTTCAAATCCTATTGCGAGTGTTAAGAAGGAGCTCGAAATCA
ACACCACCGCCGCCATGAAGGTCCACCCATTGCCGAGGAAGCGCAATATCACCGTCCGAAACAACCCCAATTCGAGAAACTCTCTCGAAGATCAATCCCTTCTGAACCAC
AAGAAACTCAGGAGATTGCCCCATATCTTCAGTCGGGTCCTCGAGCTTCCTTTTCGATCTGATGCGGATGTTTTGGTGGAAGAGAATCCTGATTGCTTTCGATTCATTGC
TGAAACTGATGGTAACATTAGCGATGGAGTAAGGGCTCATGCTGTGGAAATTCATCCTGGGGTTATTAAGATCGTTGTGCGTGAAAATGAATCGTTGGAAATGACAATAG
ATGAGCTCGAATTGGACATGTGGAGGTTTCGGCTACCGGAGACGACACGGCCGGAGCTTGCAAGTGCGGCGTTTGTTGATGGAGAACTTATAGTTACTGTTCCAAAGGGG
AATGAGGAGGAGAATTCTGAAGATGGTGGAGGAGATATCTGGGGAGATGGGAGCGGGAGCTTTAGAGATGGAATGGAAGGTCGGCTTGTTCTTGTACAGTAAATTGAATC
CTTTCCTCTTTTTTTTGGTTTTTGTACTCACTTTGGAAATTTCTTTCATCATTATTAACGTTGTTTCAGATTTCCTGTTAGTTTTCCATTGCTGAATCAGTTGACCCTTT
AGCTTAAAAGGAATTTAGAAAGAAAAACCAAATTGCTTCCTATCTGAATGGTTTATCTAATCCATCACAAGAACTTGAAAACGTAACACTAAACATTTTCTTTTCTTGAG
TTCGACAAACCTCTGATCTTTTGAAAGTACTTTCTTTAACCAGCTAAAGTTGGCATTCAACTCTGAAACATTGTGATCCTTTTGCATAATAACCTGATTTAGGAAGCTTC
TGGTTACTTGCGTATGCAATTTGAACTCCTTTTCTTTTCTCCCTACAACATTCTAACCCGACCCCTATAAGCTCTGAAACAACCTATGTTTCAAATATGGGCTTTCTTTG
AGTATAATCATTATATCAGAATTGTTATCTGTTAGCTTGAATGAAACTGATAACTCAACTGGATAGTTTTGCACAGAAGATGGTTATTTTAACATGCAGTTTTCATACTT
TCCCTTTGCTTTCATTTCTTAATCAGCAATAAAAAGTTCTGAAACTTGATTGCCAATGCATTTTCTGAGTGGTTTAGCCATTTTCTAGACATCATTTTCCCTTCTTTTTC
TTTTGCTTGTACCTCAAACACTTGTCCTTCTCTGTGGTTTGGAAAGTATTGCTTCTTTCTTGTCCATTTATGGGTCCTATATGTAGCCATATGTTCCTTACCTTTTGCAT
TTAAAACATAATAGGGGGCACTTTTAATGCTAGCCAAAGATAACATAAAACTTTCTTTTAAGCCTTTTTCACATCTTCCACTTTCAAGTTGGCAAATTCTTATATTCCTA
AATCCTAACTGTTTGCCTATTCTCATCCCTTAGATCTCCTTTTGCCTTTTTGATGGTTGACCTGTTCATTTGAGCTATAGCTGCCTTCATGCTCTTTGATATCTTTATCT
TTTAAATGTTTTTGTGGGTCATGTTTGCTATAGGTAGTATAGCCAATGGAAACCGTAGATTTTCCACTGGG
Protein sequenceShow/hide protein sequence
MILPFRPLNSNPIKISIPFEIRPVFFTRASLISLLILVLALVVSPAPSNPIASVKKELEINTTAAMKVHPLPRKRNITVRNNPNSRNSLEDQSLLNHKKLRRLPHIFSRV
LELPFRSDADVLVEENPDCFRFIAETDGNISDGVRAHAVEIHPGVIKIVVRENESLEMTIDELELDMWRFRLPETTRPELASAAFVDGELIVTVPKGNEEENSEDGGGDI
WGDGSGSFRDGMEGRLVLVQ