; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018651 (gene) of Snake gourd v1 genome

Gene IDTan0018651
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG06:31810129..31845773
RNA-Seq ExpressionTan0018651
SyntenyTan0018651
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038897352.1 uncharacterized protein LOC120085457 isoform X1 [Benincasa hispida]8.6e-2934.22Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE
        +V LRKE  M+EDEYL++LQER+ K+  NT+ S  +  IH +YV EY+   +L W+NDL +LE+   R+L  +Y          S+  +Y       D E
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE

Query:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE
        ++    + LKE E     L++ ++L I + I S+DV  QI++K  +    DE+  L+ ++K         L +   +Y+ +       + T  + L   E
Subjt:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE

Query:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL
          +   L+ N  ++N+ +  +     F    +  EE E+LT LY S+KS+N+YC EC   IN +I+ AL LEDD  E+ + IS+E    E V E C  V+
Subjt:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL

Query:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS
         KF  SRS I CP C   +R   IQ    S     TSTS
Subjt:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS

XP_038897355.1 uncharacterized protein LOC120085457 isoform X2 [Benincasa hispida]8.6e-2934.22Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE
        +V LRKE  M+EDEYL++LQER+ K+  NT+ S  +  IH +YV EY+   +L W+NDL +LE+   R+L  +Y          S+  +Y       D E
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE

Query:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE
        ++    + LKE E     L++ ++L I + I S+DV  QI++K  +    DE+  L+ ++K         L +   +Y+ +       + T  + L   E
Subjt:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE

Query:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL
          +   L+ N  ++N+ +  +     F    +  EE E+LT LY S+KS+N+YC EC   IN +I+ AL LEDD  E+ + IS+E    E V E C  V+
Subjt:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL

Query:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS
         KF  SRS I CP C   +R   IQ    S     TSTS
Subjt:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS

XP_038897356.1 uncharacterized protein LOC120085457 isoform X3 [Benincasa hispida]8.6e-2934.22Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE
        +V LRKE  M+EDEYL++LQER+ K+  NT+ S  +  IH +YV EY+   +L W+NDL +LE+   R+L  +Y          S+  +Y       D E
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE

Query:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE
        ++    + LKE E     L++ ++L I + I S+DV  QI++K  +    DE+  L+ ++K         L +   +Y+ +       + T  + L   E
Subjt:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE

Query:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL
          +   L+ N  ++N+ +  +     F    +  EE E+LT LY S+KS+N+YC EC   IN +I+ AL LEDD  E+ + IS+E    E V E C  V+
Subjt:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL

Query:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS
         KF  SRS I CP C   +R   IQ    S     TSTS
Subjt:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS

XP_038897357.1 uncharacterized protein LOC120085457 isoform X4 [Benincasa hispida]8.6e-2934.22Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE
        +V LRKE  M+EDEYL++LQER+ K+  NT+ S  +  IH +YV EY+   +L W+NDL +LE+   R+L  +Y          S+  +Y       D E
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE

Query:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE
        ++    + LKE E     L++ ++L I + I S+DV  QI++K  +    DE+  L+ ++K         L +   +Y+ +       + T  + L   E
Subjt:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE

Query:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL
          +   L+ N  ++N+ +  +     F    +  EE E+LT LY S+KS+N+YC EC   IN +I+ AL LEDD  E+ + IS+E    E V E C  V+
Subjt:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL

Query:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS
         KF  SRS I CP C   +R   IQ    S     TSTS
Subjt:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS

XP_038897359.1 uncharacterized protein LOC120085457 isoform X5 [Benincasa hispida]8.6e-2934.22Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE
        +V LRKE  M+EDEYL++LQER+ K+  NT+ S  +  IH +YV EY+   +L W+NDL +LE+   R+L  +Y          S+  +Y       D E
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE

Query:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE
        ++    + LKE E     L++ ++L I + I S+DV  QI++K  +    DE+  L+ ++K         L +   +Y+ +       + T  + L   E
Subjt:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKT------EHLFRAVILYDLEC-----DLSTWTQGLEHTE

Query:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL
          +   L+ N  ++N+ +  +     F    +  EE E+LT LY S+KS+N+YC EC   IN +I+ AL LEDD  E+ + IS+E    E V E C  V+
Subjt:  YGFLSMLR-NRCSENWKRMSEALERFFGDKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVL

Query:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS
         KF  SRS I CP C   +R   IQ    S     TSTS
Subjt:  HKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTS

TrEMBL top hitse value%identityAlignment
A0A6J1CWM1 uncharacterized protein LOC111015471 isoform X11.3e-0931.02Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDWNTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDREK
        +V LR++   LED+YL +LQ+RY K+ + T+    ++I+ +Y+ EY+L ++L WRN L +LE+    +L   Y   S + + S       S   HG    
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDWNTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDREK

Query:  LASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKTEHLFRAVILYDLECDLSTWTQ-GLEH
            +   K++E  Y  L+R   L  H++ISSIDV  QI+ K   P   D +   ++++    + R  +L +    L   TQ G++H
Subjt:  LASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKTEHLFRAVILYDLECDLSTWTQ-GLEH

A0A6J1CYL4 uncharacterized protein LOC111015471 isoform X21.2e-0733.55Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDWNTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDREK
        +V LR++   LED+YL +LQ+RY K+ + T+    ++I+ +Y+ EY+L ++L WRN L +LE+    +L   Y   S + + S       S   HG    
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDWNTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDREK

Query:  LASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDEL
            +   K++E  Y  L+R   L  H++ISSIDV  QI+ K   P   D +
Subjt:  LASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDEL

A0A6J1GQA4 uncharacterized protein LOC111456533 isoform X15.8e-2330.45Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE
        +V LR++   +EDEYL++L+ER+ K   NT  S  +  IHL+Y+ E +L   L WRND ++  +   R+L   + E   L                 D E
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE

Query:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKTEHLFRAVILYDLECD-----LSTWTQGLEHTEYGFLSM
         +      LKE ER Y   +  R++    +I  +DV  QI++K  +P    ++  L+ ++K           + +CD     ++   +  E  EY  ++ 
Subjt:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKTEHLFRAVILYDLECD-----LSTWTQGLEHTEYGFLSM

Query:  LRN-------RCSENWKRMSEALERFFG--------DKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEV
         R        R  + +K   E+ ++F                E   +   L +SIKSYN+YCS+C  CIN +IQ  LKLE D  E  ++ SYED  E+++
Subjt:  LRN-------RCSENWKRMSEALERFFG--------DKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEV

Query:  PENCYQVLHKFFNSRSLISCPNCKIYIRRMKIQHA
        P+ CY+V++KF N  +   CP C  Y+R + IQHA
Subjt:  PENCYQVLHKFFNSRSLISCPNCKIYIRRMKIQHA

A0A6J1GQF7 uncharacterized protein LOC111456533 isoform X35.8e-2330.45Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE
        +V LR++   +EDEYL++L+ER+ K   NT  S  +  IHL+Y+ E +L   L WRND ++  +   R+L   + E   L                 D E
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE

Query:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKTEHLFRAVILYDLECD-----LSTWTQGLEHTEYGFLSM
         +      LKE ER Y   +  R++    +I  +DV  QI++K  +P    ++  L+ ++K           + +CD     ++   +  E  EY  ++ 
Subjt:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKTEHLFRAVILYDLECD-----LSTWTQGLEHTEYGFLSM

Query:  LRN-------RCSENWKRMSEALERFFG--------DKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEV
         R        R  + +K   E+ ++F                E   +   L +SIKSYN+YCS+C  CIN +IQ  LKLE D  E  ++ SYED  E+++
Subjt:  LRN-------RCSENWKRMSEALERFFG--------DKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEV

Query:  PENCYQVLHKFFNSRSLISCPNCKIYIRRMKIQHA
        P+ CY+V++KF N  +   CP C  Y+R + IQHA
Subjt:  PENCYQVLHKFFNSRSLISCPNCKIYIRRMKIQHA

A0A6J1GQF8 uncharacterized protein LOC111456533 isoform X25.8e-2330.45Show/hide
Query:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE
        +V LR++   +EDEYL++L+ER+ K   NT  S  +  IHL+Y+ E +L   L WRND ++  +   R+L   + E   L                 D E
Subjt:  IVPLRKELKMLEDEYLYILQERYAKDW-NTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDRE

Query:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKTEHLFRAVILYDLECD-----LSTWTQGLEHTEYGFLSM
         +      LKE ER Y   +  R++    +I  +DV  QI++K  +P    ++  L+ ++K           + +CD     ++   +  E  EY  ++ 
Subjt:  KLASSSRRLKELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKTEHLFRAVILYDLECD-----LSTWTQGLEHTEYGFLSM

Query:  LRN-------RCSENWKRMSEALERFFG--------DKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEV
         R        R  + +K   E+ ++F                E   +   L +SIKSYN+YCS+C  CIN +IQ  LKLE D  E  ++ SYED  E+++
Subjt:  LRN-------RCSENWKRMSEALERFFG--------DKGDIGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEV

Query:  PENCYQVLHKFFNSRSLISCPNCKIYIRRMKIQHA
        P+ CY+V++KF N  +   CP C  Y+R + IQHA
Subjt:  PENCYQVLHKFFNSRSLISCPNCKIYIRRMKIQHA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGTTCCTCTTAGAAAAGAACTTAAAATGCTGGAAGATGAATATTTATATATACTGCAAGAGAGATATGCAAAGGATTGGAACACCACAGCATCATCTGGCATTGA
TATAATACATTTACAATATGTCGCGGAATATATGCTCATAATGTTGTTACGTTGGAGAAATGATTTGGTAATGTTAGAAGAGAGACTTCTTCGCCAATTAAATAGCAGAT
ACATGGAAAGTAGCAACCTGAGCCTCACGTCGAGCCACGCAAGCACTTATCCATCCCCTTTTGGACATGGGGATCGAGAAAAGCTTGCAAGTTCAAGCAGGAGACTGAAG
GAGCTAGAACGAGTATACCGTATTCTCGTGCGCTGGAGATATTTAAGAATCCACAGCAGCATATCATCAATTGATGTCATTGGTCAAATACATTCAAAAGCGTTTAATCC
ATTTGCTCAAGATGAGTTAGCACATTTACAATCGCAATTGAAAACGGAACACCTATTCAGAGCAGTGATATTATACGACTTGGAATGCGACCTGAGCACTTGGACACAAG
GTTTAGAGCATACAGAATACGGTTTTCTTTCTATGCTGAGAAATAGATGTTCAGAGAATTGGAAACGCATGTCCGAAGCTTTAGAACGATTCTTTGGTGATAAAGGCGAT
ATTGGAGAAGAACGGGAATATCTAACGTGGTTATATAATTCAATAAAGTCATATAACTTGTACTGTTCCGAGTGCAACATTTGTATCAACAAGGTGATTCAATGTGCCTT
GAAATTGGAAGACGATGGAAGAGAAATAGATTCTCAGATATCATATGAAGATGATGAAGAAGAAGAAGTGCCAGAAAATTGTTACCAAGTGCTGCATAAGTTTTTTAATT
CACGTAGTCTCATCTCATGTCCAAATTGCAAGATTTATATTAGAAGGATGAAAATTCAACATGCTAAGGTATCCGAAAATAGAACTTTGACATCCACTTCAAGTGAAGAA
AATCAGAATTTATTATGTACCACAGCAATCTCTAAGGTCATCACTCTTGGATCAGGCTCTCCCGCTGGATGCTCAGAAACGCAAAAGGAAATAACAAGATCAAGAAGTCT
GAAACAAGGAGGAGGAGGAGAGCCATTTAAGGGGTTGGACAATTCTACTGTGCAAAGCAGTAAAGAAGTTTTTATTCATCAACGATTACAAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATAGATTCCAAAAAAAATATATTAGAAATGATCGTTCCTCTTAGAAAAGAACTTAAAATGCTGGAAGATGAATATTTATATATACTGCAAGAGAGATATGCAAAGGATTG
GAACACCACAGCATCATCTGGCATTGATATAATACATTTACAATATGTCGCGGAATATATGCTCATAATGTTGTTACGTTGGAGAAATGATTTGGTAATGTTAGAAGAGA
GACTTCTTCGCCAATTAAATAGCAGATACATGGAAAGTAGCAACCTGAGCCTCACGTCGAGCCACGCAAGCACTTATCCATCCCCTTTTGGACATGGGGATCGAGAAAAG
CTTGCAAGTTCAAGCAGGAGACTGAAGGAGCTAGAACGAGTATACCGTATTCTCGTGCGCTGGAGATATTTAAGAATCCACAGCAGCATATCATCAATTGATGTCATTGG
TCAAATACATTCAAAAGCGTTTAATCCATTTGCTCAAGATGAGTTAGCACATTTACAATCGCAATTGAAAACGGAACACCTATTCAGAGCAGTGATATTATACGACTTGG
AATGCGACCTGAGCACTTGGACACAAGGTTTAGAGCATACAGAATACGGTTTTCTTTCTATGCTGAGAAATAGATGTTCAGAGAATTGGAAACGCATGTCCGAAGCTTTA
GAACGATTCTTTGGTGATAAAGGCGATATTGGAGAAGAACGGGAATATCTAACGTGGTTATATAATTCAATAAAGTCATATAACTTGTACTGTTCCGAGTGCAACATTTG
TATCAACAAGGTGATTCAATGTGCCTTGAAATTGGAAGACGATGGAAGAGAAATAGATTCTCAGATATCATATGAAGATGATGAAGAAGAAGAAGTGCCAGAAAATTGTT
ACCAAGTGCTGCATAAGTTTTTTAATTCACGTAGTCTCATCTCATGTCCAAATTGCAAGATTTATATTAGAAGGATGAAAATTCAACATGCTAAGGTATCCGAAAATAGA
ACTTTGACATCCACTTCAAGTGAAGAAAATCAGAATTTATTATGTACCACAGCAATCTCTAAGGTCATCACTCTTGGATCAGGCTCTCCCGCTGGATGCTCAGAAACGCA
AAAGGAAATAACAAGATCAAGAAGTCTGAAACAAGGAGGAGGAGGAGAGCCATTTAAGGGGTTGGACAATTCTACTGTGCAAAGCAGTAAAGAAGTTTTTATTCATCAAC
GATTACAAGGTTAA
Protein sequenceShow/hide protein sequence
MIVPLRKELKMLEDEYLYILQERYAKDWNTTASSGIDIIHLQYVAEYMLIMLLRWRNDLVMLEERLLRQLNSRYMESSNLSLTSSHASTYPSPFGHGDREKLASSSRRLK
ELERVYRILVRWRYLRIHSSISSIDVIGQIHSKAFNPFAQDELAHLQSQLKTEHLFRAVILYDLECDLSTWTQGLEHTEYGFLSMLRNRCSENWKRMSEALERFFGDKGD
IGEEREYLTWLYNSIKSYNLYCSECNICINKVIQCALKLEDDGREIDSQISYEDDEEEEVPENCYQVLHKFFNSRSLISCPNCKIYIRRMKIQHAKVSENRTLTSTSSEE
NQNLLCTTAISKVITLGSGSPAGCSETQKEITRSRSLKQGGGGEPFKGLDNSTVQSSKEVFIHQRLQG