; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032386 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032386
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionankyrin repeat-containing protein NPR4-like
Genome locationchr11:31729427..31732160
RNA-Seq ExpressionLag0032386
SyntenyLag0032386
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]1.5e-4335.11Show/hide
Query:  ESRITAASVAVTTPT-------ANIISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTP--------
        +S +  +S +  TPT       ++  SS FG+ L+    +KLD +N++LW+ MV  I++G ++DG++  T+  P EFL S ++    S  TP        
Subjt:  ESRITAASVAVTTPT-------ANIISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTP--------

Query:  -NPKFEEWTTVDQALSGWLFGSMTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPV------
         NP++E+W   DQ L GWL+ SMT  VA  V+  TT+  +WKALE ++GA +K++ N ++  +Q T+KGS+ M +YL  M+  +++L + G P       
Subjt:  -NPKFEEWTTVDQALSGWLFGSMTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPV------

Query:  --SLSDLVS---PI----------------STEVSVDQTSNYAYN--RQGN-FSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQ
          SL+ L S   PI                 T +S D    +  N   +GN  S        N   N    N+     N     N   NRG  RG  G  
Subjt:  --SLSDLVS---PI----------------STEVSVDQTSNYAYN--RQGN-FSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQ

Query:  RG-----NNSQPTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNGNDSL
        RG     NNS+PTCQ+  KFGH+A  CYFR+++++     ++N N    S F+ATPE V D  W ADSG T+H+T +A N+++K+DY G++SL
Subjt:  RG-----NNSQPTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNGNDSL

XP_022143579.1 ankyrin repeat-containing protein NPR4-like [Momordica charantia]1.7e-5064.71Show/hide
Query:  ISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKSELTPNPKFEEWTTVDQALSGWLFGSMTPVVAADVVNFT
        I+ SFGHPLST LTVKLD KNY LW+GMVLA+L GQKVDGYV  TK  PS++  ++S+    E T NP +EEW+ VDQA  GWLFGSMTP +AADVVN  
Subjt:  ISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKSELTPNPKFEEWTTVDQALSGWLFGSMTPVVAADVVNFT

Query:  TSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVSPIST
        TS EVW ALE ++G+T+KARINQL+  LQNTKKG+ KM  YLA M+Q SE+LKL G PV+LS L S I T
Subjt:  TSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVSPIST

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]3.4e-8048.68Show/hide
Query:  SVAVTTPTAN-IISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTPNPKFEEWTTVDQALSGWLFGS
        +VAV TP  +   ++SFGHPL TVLTVKLD KNY LWRGMVLA+LRGQK DGYV GT A+P +FL S  +E     L  NP++ EW  VDQAL GWLFGS
Subjt:  SVAVTTPTAN-IISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTPNPKFEEWTTVDQALSGWLFGS

Query:  MTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLS-------------------------
        MTP +A DVV+F +SREVWKALE +YGAT+KARINQL+ VLQNTKK S KM +YL  M+QASE+LKL G PV+ +                         
Subjt:  MTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLS-------------------------

Query:  ---------------------DLVSPISTEVSVDQTSNYAYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ
                             ++VS  + E   D + NY +++Q N  G +QF       ++++  QG    +Y   +  N  RGRGRG +   RGNNS+
Subjt:  ---------------------DLVSPISTEVSVDQTSNYAYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ

Query:  PTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG
        P+CQL  K+GH A  CY RF+E+FNN    S+ NN  NSA++A PE+V +P+WLADSG T H+T++ +N+NVK+DYNG
Subjt:  PTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]3.4e-8048.68Show/hide
Query:  SVAVTTPTAN-IISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTPNPKFEEWTTVDQALSGWLFGS
        +VAV TP  +   ++SFGHPL TVLTVKLD KNY LWRGMVLA+LRGQK DGYV GT A+P +FL S  +E     L  NP++ EW  VDQAL GWLFGS
Subjt:  SVAVTTPTAN-IISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTPNPKFEEWTTVDQALSGWLFGS

Query:  MTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLS-------------------------
        MTP +A DVV+F +SREVWKALE +YGAT+KARINQL+ VLQNTKK S KM +YL  M+QASE+LKL G PV+ +                         
Subjt:  MTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLS-------------------------

Query:  ---------------------DLVSPISTEVSVDQTSNYAYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ
                             ++VS  + E   D + NY +++Q N  G +QF       ++++  QG    +Y   +  N  RGRGRG +   RGNNS+
Subjt:  ---------------------DLVSPISTEVSVDQTSNYAYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ

Query:  PTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG
        P+CQL  K+GH A  CY RF+E+FNN    S+ NN  NSA++A PE+V +P+WLADSG T H+T++ +N+NVK+DYNG
Subjt:  PTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG

XP_030492910.1 uncharacterized protein LOC115709020 isoform X2 [Cannabis sativa]6.1e-4535.53Show/hide
Query:  AASVAVTTPTANIISS---SFGHPLSTV---LTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKSELT----PNPKFEEWTTVD
        ++   VT+ TA   SS   +F  P ST+     +KLD  NY LW+ MV  I+RG ++DG+V GT+A P EF+   +EA + +++     NP +E W   D
Subjt:  AASVAVTTPTANIISS---SFGHPLSTV---LTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKSELT----PNPKFEEWTTVD

Query:  QALSGWLFGSMTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVS-----------
        Q L GWL+ SMT  +A +V+  TT+  +WKALE +YGA +K++++  +  +Q T+KG+T M+DYL   +  S+ L L G P   + L+S           
Subjt:  QALSGWLFGSMTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVS-----------

Query:  PISTEVSVDQTSNYA------------------YNRQGNFSGGQQFQGQNNQRNKNRG-------NQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ
        PI  ++    ++ +                    N     +        N   N +RG       +     +NY   N  NG+RGRGRG   N   NN++
Subjt:  PISTEVSVDQTSNYA------------------YNRQGNFSGGQQFQGQNNQRNKNRG-------NQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ

Query:  PTCQLYRKFGHAAPACYFRFEEDF--NNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG
        PTCQ+  KFGH+A  CY R+ E F  ++P+ S N+     SAF A+PEVV    W ADSG +SH+T++ +NM+ K+DY G
Subjt:  PTCQLYRKFGHAAPACYFRFEEDF--NNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG

TrEMBL top hitse value%identityAlignment
A0A6J1CPQ7 ankyrin repeat-containing protein NPR4-like8.0e-5164.71Show/hide
Query:  ISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKSELTPNPKFEEWTTVDQALSGWLFGSMTPVVAADVVNFT
        I+ SFGHPLST LTVKLD KNY LW+GMVLA+L GQKVDGYV  TK  PS++  ++S+    E T NP +EEW+ VDQA  GWLFGSMTP +AADVVN  
Subjt:  ISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKSELTPNPKFEEWTTVDQALSGWLFGSMTPVVAADVVNFT

Query:  TSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVSPIST
        TS EVW ALE ++G+T+KARINQL+  LQNTKKG+ KM  YLA M+Q SE+LKL G PV+LS L S I T
Subjt:  TSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVSPIST

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X21.7e-8048.68Show/hide
Query:  SVAVTTPTAN-IISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTPNPKFEEWTTVDQALSGWLFGS
        +VAV TP  +   ++SFGHPL TVLTVKLD KNY LWRGMVLA+LRGQK DGYV GT A+P +FL S  +E     L  NP++ EW  VDQAL GWLFGS
Subjt:  SVAVTTPTAN-IISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTPNPKFEEWTTVDQALSGWLFGS

Query:  MTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLS-------------------------
        MTP +A DVV+F +SREVWKALE +YGAT+KARINQL+ VLQNTKK S KM +YL  M+QASE+LKL G PV+ +                         
Subjt:  MTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLS-------------------------

Query:  ---------------------DLVSPISTEVSVDQTSNYAYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ
                             ++VS  + E   D + NY +++Q N  G +QF       ++++  QG    +Y   +  N  RGRGRG +   RGNNS+
Subjt:  ---------------------DLVSPISTEVSVDQTSNYAYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ

Query:  PTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG
        P+CQL  K+GH A  CY RF+E+FNN    S+ NN  NSA++A PE+V +P+WLADSG T H+T++ +N+NVK+DYNG
Subjt:  PTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X11.7e-8048.68Show/hide
Query:  SVAVTTPTAN-IISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTPNPKFEEWTTVDQALSGWLFGS
        +VAV TP  +   ++SFGHPL TVLTVKLD KNY LWRGMVLA+LRGQK DGYV GT A+P +FL S  +E     L  NP++ EW  VDQAL GWLFGS
Subjt:  SVAVTTPTAN-IISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLES-SSEADKSELTPNPKFEEWTTVDQALSGWLFGS

Query:  MTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLS-------------------------
        MTP +A DVV+F +SREVWKALE +YGAT+KARINQL+ VLQNTKK S KM +YL  M+QASE+LKL G PV+ +                         
Subjt:  MTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLS-------------------------

Query:  ---------------------DLVSPISTEVSVDQTSNYAYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ
                             ++VS  + E   D + NY +++Q N  G +QF       ++++  QG    +Y   +  N  RGRGRG +   RGNNS+
Subjt:  ---------------------DLVSPISTEVSVDQTSNYAYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQ

Query:  PTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG
        P+CQL  K+GH A  CY RF+E+FNN    S+ NN  NSA++A PE+V +P+WLADSG T H+T++ +N+NVK+DYNG
Subjt:  PTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNG

A0A803PAZ1 Uncharacterized protein1.7e-4532Show/hide
Query:  SDEIKTPESRITAASVAVTTPTANIISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKS---ELTPNPKFEE
        ++ + +P+    A+   V   +    + SFG+ L+   ++KLD  NY LWR +V  I+RG +++GYV GTK  P+EF+ +    + +    L  NP++E 
Subjt:  SDEIKTPESRITAASVAVTTPTANIISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKS---ELTPNPKFEE

Query:  WTTVDQALSGWLFGSMTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVSPISTEV
        W   DQ L GWL+GSMT  +A  V+  T++R +W ALE +YGA ++A+++  +  +Q T+KG T M DYL   R  +++L L G P   + L++ + + +
Subjt:  WTTVDQALSGWLFGSMTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVSPISTEV

Query:  SVDQTSNY----------------------------------AYNRQGNFSGGQQFQGQNNQRNKN------RGNQGYQYSNYGPRNNNNGNRG---RGR
          +  +                                    A  + GN SG        N+++ N      RGN  +  +N+G  +  N NRG    G 
Subjt:  SVDQTSNY----------------------------------AYNRQGNFSGGQQFQGQNNQRNKN------RGNQGYQYSNYGPRNNNNGNRG---RGR

Query:  GTYGNQRGNNSQPTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGEN----SAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNGNDSLTL
           G  RG N++PTCQ+  ++GH+A  CY R++E F     ++N N+G+     +A+IATPE+V    W ADSG ++H+T++  NM  K  Y+G D LT+
Subjt:  GTYGNQRGNNSQPTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGEN----SAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNGNDSLTL

A0A803QD97 Uncharacterized protein1.5e-4435.97Show/hide
Query:  TANIISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKSELTP--NPKFEEWTTVDQALSGWLFGSMTPVVAA
        ++N+    FG  L+    +KLD  N+ LW+ MV AI RG ++DGY+ G +  P E+L +     +  + P  NP+FE W   DQ L GWL+GSMT  +A 
Subjt:  TANIISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYV-GTKARPSEFLESSSEADKSELTP--NPKFEEWTTVDQALSGWLFGSMTPVVAA

Query:  DVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVS-----------PISTEVSVDQTSNY---
        +++  ++S E+W +LE ++GA +KA++++ +  +Q  +KGS  M+DYL   +Q S+ L L G P   S LVS           PI  ++   + + +   
Subjt:  DVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVS-----------PISTEVSVDQTSNY---

Query:  ---------AYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRG----RGRGTYGNQRGNNS----QPTCQLYRKFGHAAPACYFRFEE
                   +R  + S   +    N     N  N+     NY P NNNN  RG      RG + N RG +     +PTCQ+  ++GH+A  CY RF+E
Subjt:  ---------AYNRQGNFSGGQQFQGQNNQRNKNRGNQGYQYSNYGPRNNNNGNRG----RGRGTYGNQRGNNS----QPTCQLYRKFGHAAPACYFRFEE

Query:  DF--NNPHGSSNKNN--GEN-SAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNGNDSLTL
         F    P G++  NN  G+N +AF+ATPE++ D  W A+SG ++H+T+ A N+N KT YNG DSLT+
Subjt:  DF--NNPHGSSNKNN--GEN-SAFIATPEVVCDPNWLADSGPTSHITANATNMNVKTDYNGNDSLTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATGAAATCAAAACACCTGAGAGCAGAATCACTGCTGCCTCTGTTGCCGTGACTACCCCTACTGCGAACATCATTAGTTCTTCATTCGGTCACCCCCTCAGCAC
AGTTCTCACGGTAAAGCTAGATGGGAAAAATTACTTGCTATGGAGGGGAATGGTGCTCGCCATACTCAGGGGCCAGAAGGTAGACGGATATGTTGGAACAAAGGCTCGAC
CATCCGAGTTTCTTGAATCGAGCAGTGAAGCTGACAAATCGGAGCTCACTCCAAATCCTAAGTTCGAAGAGTGGACTACAGTTGACCAAGCCCTCTCTGGTTGGCTTTTC
GGATCTATGACTCCAGTCGTGGCAGCTGATGTTGTTAATTTCACAACCTCAAGAGAGGTATGGAAGGCACTTGAACAAATGTATGGAGCGACCAACAAGGCAAGGATCAA
TCAACTTAAAGGTGTTTTACAGAACACCAAAAAAGGCTCAACAAAAATGCTGGATTACCTGGCGACGATGAGACAAGCATCAGAGAACCTCAAGCTAGTCGGAGCCCCTG
TTTCTTTATCTGATTTGGTGTCCCCTATTTCTACTGAAGTGAGTGTTGATCAGACATCCAACTATGCCTATAATCGGCAAGGGAACTTCTCTGGAGGACAGCAATTTCAG
GGGCAAAATAACCAGAGAAATAAGAACCGTGGAAATCAAGGCTATCAATATTCAAATTATGGACCACGAAACAACAACAATGGCAATCGAGGAAGAGGTCGAGGTACATA
TGGAAATCAAAGAGGTAACAACTCTCAACCAACCTGTCAGTTATACAGAAAATTTGGACATGCTGCTCCTGCGTGCTATTTTCGGTTTGAGGAGGATTTCAATAATCCAC
ATGGCTCGAGTAACAAAAACAATGGTGAGAACTCAGCCTTTATCGCAACACCTGAAGTCGTTTGTGATCCAAACTGGTTGGCAGACAGTGGTCCTACTAGCCACATTACT
GCAAATGCTACGAACATGAATGTGAAGACTGATTACAATGGTAATGATTCTCTTACTCTCAAAATTCCTCTTCAAAGGAAATCCATCAATCCGAAAGAGTCTCCCTTCAA
TCCTAAGAGTCCTCATCAAGCACAATGTGTGTCCTCATCAAGCACTAAGTCTGAGTCATGTTTTACTGCCTTTAAGAGTCAAAATAAATCTAAGCCTTGTAAGTCCACAT
CTTCGACCAATGAGACTGTTGTCAATTGTCTTCCATTTCCATCTTCTAATTCAAGCCCACTGCCAACACCTCCCAATCCTATACCTCTTGTCCAAGCCTCTCCACTGCCA
TCTCCTGGGTTTTGTTCAACACAACCCTCTCCTTTGAGTTATCCTTCAGGTGATGTTCCTTCCTCTTCTGGTGGTGCTCTTATATTACCGCTAGGGGTGTTCGTCGGTCG
GTCGGAGTCGGTTTTGGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATGAAATCAAAACACCTGAGAGCAGAATCACTGCTGCCTCTGTTGCCGTGACTACCCCTACTGCGAACATCATTAGTTCTTCATTCGGTCACCCCCTCAGCAC
AGTTCTCACGGTAAAGCTAGATGGGAAAAATTACTTGCTATGGAGGGGAATGGTGCTCGCCATACTCAGGGGCCAGAAGGTAGACGGATATGTTGGAACAAAGGCTCGAC
CATCCGAGTTTCTTGAATCGAGCAGTGAAGCTGACAAATCGGAGCTCACTCCAAATCCTAAGTTCGAAGAGTGGACTACAGTTGACCAAGCCCTCTCTGGTTGGCTTTTC
GGATCTATGACTCCAGTCGTGGCAGCTGATGTTGTTAATTTCACAACCTCAAGAGAGGTATGGAAGGCACTTGAACAAATGTATGGAGCGACCAACAAGGCAAGGATCAA
TCAACTTAAAGGTGTTTTACAGAACACCAAAAAAGGCTCAACAAAAATGCTGGATTACCTGGCGACGATGAGACAAGCATCAGAGAACCTCAAGCTAGTCGGAGCCCCTG
TTTCTTTATCTGATTTGGTGTCCCCTATTTCTACTGAAGTGAGTGTTGATCAGACATCCAACTATGCCTATAATCGGCAAGGGAACTTCTCTGGAGGACAGCAATTTCAG
GGGCAAAATAACCAGAGAAATAAGAACCGTGGAAATCAAGGCTATCAATATTCAAATTATGGACCACGAAACAACAACAATGGCAATCGAGGAAGAGGTCGAGGTACATA
TGGAAATCAAAGAGGTAACAACTCTCAACCAACCTGTCAGTTATACAGAAAATTTGGACATGCTGCTCCTGCGTGCTATTTTCGGTTTGAGGAGGATTTCAATAATCCAC
ATGGCTCGAGTAACAAAAACAATGGTGAGAACTCAGCCTTTATCGCAACACCTGAAGTCGTTTGTGATCCAAACTGGTTGGCAGACAGTGGTCCTACTAGCCACATTACT
GCAAATGCTACGAACATGAATGTGAAGACTGATTACAATGGTAATGATTCTCTTACTCTCAAAATTCCTCTTCAAAGGAAATCCATCAATCCGAAAGAGTCTCCCTTCAA
TCCTAAGAGTCCTCATCAAGCACAATGTGTGTCCTCATCAAGCACTAAGTCTGAGTCATGTTTTACTGCCTTTAAGAGTCAAAATAAATCTAAGCCTTGTAAGTCCACAT
CTTCGACCAATGAGACTGTTGTCAATTGTCTTCCATTTCCATCTTCTAATTCAAGCCCACTGCCAACACCTCCCAATCCTATACCTCTTGTCCAAGCCTCTCCACTGCCA
TCTCCTGGGTTTTGTTCAACACAACCCTCTCCTTTGAGTTATCCTTCAGGTGATGTTCCTTCCTCTTCTGGTGGTGCTCTTATATTACCGCTAGGGGTGTTCGTCGGTCG
GTCGGAGTCGGTTTTGGGCTAA
Protein sequenceShow/hide protein sequence
MSDEIKTPESRITAASVAVTTPTANIISSSFGHPLSTVLTVKLDGKNYLLWRGMVLAILRGQKVDGYVGTKARPSEFLESSSEADKSELTPNPKFEEWTTVDQALSGWLF
GSMTPVVAADVVNFTTSREVWKALEQMYGATNKARINQLKGVLQNTKKGSTKMLDYLATMRQASENLKLVGAPVSLSDLVSPISTEVSVDQTSNYAYNRQGNFSGGQQFQ
GQNNQRNKNRGNQGYQYSNYGPRNNNNGNRGRGRGTYGNQRGNNSQPTCQLYRKFGHAAPACYFRFEEDFNNPHGSSNKNNGENSAFIATPEVVCDPNWLADSGPTSHIT
ANATNMNVKTDYNGNDSLTLKIPLQRKSINPKESPFNPKSPHQAQCVSSSSTKSESCFTAFKSQNKSKPCKSTSSTNETVVNCLPFPSSNSSPLPTPPNPIPLVQASPLP
SPGFCSTQPSPLSYPSGDVPSSSGGALILPLGVFVGRSESVLG