; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007094 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007094
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein Ycf2-like
Genome locationchr6:48637912..48639963
RNA-Seq ExpressionLag0007094
SyntenyLag0007094
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXC30509.1 hypothetical protein L484_010758 [Morus notabilis]2.6e-4340.26Show/hide
Query:  EPSKKNRRVDRKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK
        +P      VD +     +P  + +   KINL +KA ++D +   L  +  + F+  CFGHLLDF  +K  SQL+ HLI  QC   + +EL+F I G I+K
Subjt:  EPSKKNRRVDRKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK

Query:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA
        FG++EFALITGLNC+ +P + + +L E S  K K+F +GK V+R  L+ +F+A + G + D+VK+A+LYCLES L+P++ +  I+ +HL MV++ E+F+ 
Subjt:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA

Query:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG
        YPWG++++ +   Y+ ++  S+E+   G+GG
Subjt:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG

KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]2.4e-4936.91Show/hide
Query:  VSKRSSQRLKAAGVTPGKKHPPQTSPITLDSAQESEGEE--AEMSTVDSVPKDVQPKGVKRE--REEGGSGKKKGLVKNQSPRRLPRRLTKTSRNRASNI
        +  R+S RL+AAG+T  +K  P T    L S+ E   E+  AE S           K V+ E  +EE    KK+ +    S +R+ R   K  +     I
Subjt:  VSKRSSQRLKAAGVTPGKKHPPQTSPITLDSAQESEGEE--AEMSTVDSVPKDVQPKGVKRE--REEGGSGKKKGLVKNQSPRRLPRRLTKTSRNRASNI

Query:  LYLRPWSCETQEETDTSDGTDEDHSENDDFTTSDSGDEEPSKKNRRVDRK---------DDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAF
              +  T E     D      +            EE ++++RR   K         D  Y M   +R++ LKINL  K+ +++ I+  LGD+  + F
Subjt:  LYLRPWSCETQEETDTSDGTDEDHSENDDFTTSDSGDEEPSKKNRRVDRK---------DDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAF

Query:  KNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILKFGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKA
        +   FGH L+ S    SSQLLLHLIQ  CKPK  S+L F IGG++L+FGLREFALITGL C   P ++ + +    R K  YF+  K V R+ L+++F  
Subjt:  KNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILKFGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKA

Query:  IKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAFGLLTNYMHKAWLSRESCGIGMGGLCMP
           G + D +KMA+LY LESFL+P+QE   ++ DH++MV+D+E+F+ YPWG+VAF LL ++M++   S+   GI MGG   P
Subjt:  IKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAFGLLTNYMHKAWLSRESCGIGMGGLCMP

TYK12922.1 uncharacterized protein E5676_scaffold255G005170 [Cucumis melo var. makuwa]4.9e-4239.39Show/hide
Query:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK
        PS+ +  VD  K+    +P S  S   +INL +K D++  I+ TL ++    FK +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK

Query:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA
        FG+++FALITGLNC   P +D  K+Q+  +F  +YF   K ++R  L  +F  +  G   D+VKMA+LY LE F+L +Q +  I  ++ L+++D+E F++
Subjt:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA

Query:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG
        YPWG++++ +  +++ KA  S ++  IG+GG
Subjt:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG

XP_024031030.1 uncharacterized protein LOC21394043 [Morus notabilis]2.6e-4340.26Show/hide
Query:  EPSKKNRRVDRKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK
        +P      VD +     +P  + +   KINL +KA ++D +   L  +  + F+  CFGHLLDF  +K  SQL+ HLI  QC   + +EL+F I G I+K
Subjt:  EPSKKNRRVDRKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK

Query:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA
        FG++EFALITGLNC+ +P + + +L E S  K K+F +GK V+R  L+ +F+A + G + D+VK+A+LYCLES L+P++ +  I+ +HL MV++ E+F+ 
Subjt:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA

Query:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG
        YPWG++++ +   Y+ ++  S+E+   G+GG
Subjt:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG

XP_038883715.1 uncharacterized protein LOC120074618 isoform X1 [Benincasa hispida]4.9e-4239.39Show/hide
Query:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK
        PS+ + +VD  K+    +P S  S   +INL +K D++  I+ TL ++    FK +CFG  LD    K SSQL  HL++ QC     +EL+F + G+I K
Subjt:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK

Query:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA
        FG++EF+LITGLNC   P++D  K+Q+  +F  +YF   K +KR  L  +F  +  G   D+VKMA+LY LE F+L +Q +  I  ++ L+V+D+E F+ 
Subjt:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA

Query:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG
        YPWG++++ +  +++ KA  S ++  IG+GG
Subjt:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG

TrEMBL top hitse value%identityAlignment
A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X52.4e-4239.39Show/hide
Query:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK
        PS+ +  VD  K+    +P S  S   +INL +K D++  I+ TL ++    FK +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK

Query:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA
        FG+++FALITGLNC   P +D  K+Q+  +F  +YF   K ++R  L  +F  +  G   D+VKMA+LY LE F+L +Q +  I  ++ L+++D+E F++
Subjt:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA

Query:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG
        YPWG++++ +  +++ KA  S ++  IG+GG
Subjt:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG

A0A1S3B181 uncharacterized protein LOC103484737 isoform X72.4e-4239.39Show/hide
Query:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK
        PS+ +  VD  K+    +P S  S   +INL +K D++  I+ TL ++    FK +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK

Query:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA
        FG+++FALITGLNC   P +D  K+Q+  +F  +YF   K ++R  L  +F  +  G   D+VKMA+LY LE F+L +Q +  I  ++ L+++D+E F++
Subjt:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA

Query:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG
        YPWG++++ +  +++ KA  S ++  IG+GG
Subjt:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG

A0A5A7U047 Protein Ycf2-like1.2e-4936.91Show/hide
Query:  VSKRSSQRLKAAGVTPGKKHPPQTSPITLDSAQESEGEE--AEMSTVDSVPKDVQPKGVKRE--REEGGSGKKKGLVKNQSPRRLPRRLTKTSRNRASNI
        +  R+S RL+AAG+T  +K  P T    L S+ E   E+  AE S           K V+ E  +EE    KK+ +    S +R+ R   K  +     I
Subjt:  VSKRSSQRLKAAGVTPGKKHPPQTSPITLDSAQESEGEE--AEMSTVDSVPKDVQPKGVKRE--REEGGSGKKKGLVKNQSPRRLPRRLTKTSRNRASNI

Query:  LYLRPWSCETQEETDTSDGTDEDHSENDDFTTSDSGDEEPSKKNRRVDRK---------DDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAF
              +  T E     D      +            EE ++++RR   K         D  Y M   +R++ LKINL  K+ +++ I+  LGD+  + F
Subjt:  LYLRPWSCETQEETDTSDGTDEDHSENDDFTTSDSGDEEPSKKNRRVDRK---------DDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAF

Query:  KNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILKFGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKA
        +   FGH L+ S    SSQLLLHLIQ  CKPK  S+L F IGG++L+FGLREFALITGL C   P ++ + +    R K  YF+  K V R+ L+++F  
Subjt:  KNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILKFGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKA

Query:  IKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAFGLLTNYMHKAWLSRESCGIGMGGLCMP
           G + D +KMA+LY LESFL+P+QE   ++ DH++MV+D+E+F+ YPWG+VAF LL ++M++   S+   GI MGG   P
Subjt:  IKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAFGLLTNYMHKAWLSRESCGIGMGGLCMP

A0A5D3CNI7 TF-B3 domain-containing protein2.4e-4239.39Show/hide
Query:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK
        PS+ +  VD  K+    +P S  S   +INL +K D++  I+ TL ++    FK +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  PSKKNRRVD-RKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK

Query:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA
        FG+++FALITGLNC   P +D  K+Q+  +F  +YF   K ++R  L  +F  +  G   D+VKMA+LY LE F+L +Q +  I  ++ L+++D+E F++
Subjt:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA

Query:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG
        YPWG++++ +  +++ KA  S ++  IG+GG
Subjt:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG

W9SF50 DUF1985 domain-containing protein1.3e-4340.26Show/hide
Query:  EPSKKNRRVDRKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK
        +P      VD +     +P  + +   KINL +KA ++D +   L  +  + F+  CFGHLLDF  +K  SQL+ HLI  QC   + +EL+F I G I+K
Subjt:  EPSKKNRRVDRKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILK

Query:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA
        FG++EFALITGLNC+ +P + + +L E S  K K+F +GK V+R  L+ +F+A + G + D+VK+A+LYCLES L+P++ +  I+ +HL MV++ E+F+ 
Subjt:  FGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNA

Query:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG
        YPWG++++ +   Y+ ++  S+E+   G+GG
Subjt:  YPWGQVAFGLLTNYMHKAWLSRESCGIGMGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)8.7e-1327.13Show/hide
Query:  KINLCTKADIMDHIRRTL-GDKCHDAFKNTCFGHLLDFSFRKTS-SQLLLH-LIQHQCKPKRASELYFKIGGKILKFGLREFALITGLNCAPFPQLDKDK
        ++N+ ++ + +  I   L G +  +  K++ FG L +F   + S S  L+H L+  Q   K+  EL+F  GG  ++F +REF ++TGL C   P  D+ K
Subjt:  KINLCTKADIMDHIRRTL-GDKCHDAFKNTCFGHLLDFSFRKTS-SQLLLH-LIQHQCKPKRASELYFKIGGKILKFGLREFALITGLNCAPFPQLDKDK

Query:  LQECSRFKA---KYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMA-QLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAF
          + S++ +   + F E + V   T+  + + ++    S   K+   L  +   ++   ++ F+  D + M+ D + F  YPWG+ AF
Subjt:  LQECSRFKA---KYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMA-QLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAF

AT2G06420.1 Domain of unknown function (DUF1985)5.7e-0426.06Show/hide
Query:  KRASELYFKIGGKILKFGLREFALITGLNCAPFPQLDKD-KLQECSRFKAKYFDEGKGVKRKTLDI---IFKAIKHGVES-DLVKMAQLYCLESFLL---
        KR  E +F + G  +++G+ E ALI+G NC  +  +    K++E   FK K+F   K +  K  D+   +   +  G  S + ++M  LY L + ++   
Subjt:  KRASELYFKIGGKILKFGLREFALITGLNCAPFPQLDKD-KLQECSRFKAKYFDEGKGVKRKTLDI---IFKAIKHGVES-DLVKMAQLYCLESFLL---

Query:  -PRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAFG-LLTNYMH
            +   +++  L  V D      + WG+ +F  +L N  H
Subjt:  -PRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAFG-LLTNYMH

AT5G45570.1 Ulp1 protease family protein1.3e-0525.51Show/hide
Query:  PLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDF--SFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILKFGLREFALITGLNCAP
        PL KRS A   + C  + I   I+  LG    D  K T  G  + F  S    ++Q +   + +Q +     E++  I  + ++F L EF  ITGLNC  
Subjt:  PLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDF--SFRKTSSQLLLHLIQHQCKPKRASELYFKIGGKILKFGLREFALITGLNCAP

Query:  FPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAFGLLT
        F + D  +      +         G     L+ +F+  K       + + +L  L   +        +       V D   F  YPWG+VAF  L+
Subjt:  FPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLLMVEDEEMFNAYPWGQVAFGLLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATCCGATAAAAATGTATCGAAGCGATCGAGCCAAAGGTTAAAAGCGGCTGGAGTGACGCCCGGAAAGAAACACCCCCCACAGACATCCCCCATCACGTTAGACAG
CGCCCAAGAGTCCGAGGGTGAGGAAGCTGAAATGAGCACTGTCGATTCGGTACCGAAGGATGTCCAGCCAAAGGGGGTCAAAAGGGAGAGAGAAGAAGGAGGATCAGGAA
AGAAGAAAGGTTTAGTAAAAAACCAAAGTCCAAGGAGATTGCCAAGAAGGCTGACGAAGACAAGTCGGAACAGGGCGAGTAACATACTGTATTTACGTCCGTGGTCGTGT
GAAACACAGGAGGAAACCGACACAAGCGATGGCACGGACGAAGACCATTCAGAAAACGACGATTTTACCACGAGCGATAGTGGAGACGAGGAACCGAGCAAGAAAAACCG
AAGAGTAGACAGAAAGGACGATGACTACTTCATGCCGCTCTCGAAACGCAGCAAAGCTCTGAAAATTAATCTCTGCACCAAAGCCGACATTATGGACCACATCCGCCGCA
CATTGGGTGATAAATGTCATGACGCATTCAAAAACACATGCTTCGGACACCTTCTCGACTTTTCATTCAGAAAAACGTCATCACAACTCCTCTTGCACTTGATTCAGCAT
CAATGCAAGCCAAAACGAGCATCGGAGCTATATTTCAAGATTGGAGGGAAAATACTCAAATTCGGACTCCGAGAGTTTGCGTTGATTACCGGACTAAATTGTGCTCCGTT
CCCACAACTCGACAAAGACAAGCTACAGGAGTGTTCAAGATTTAAGGCCAAGTATTTTGATGAAGGCAAGGGAGTAAAAAGGAAGACTCTCGACATCATATTCAAAGCAA
TAAAGCACGGTGTTGAGTCAGACCTAGTAAAGATGGCCCAACTATATTGTTTGGAGAGCTTCCTACTCCCCCGGCAAGAGAAGGTATTTATCGAAGATGACCACCTACTA
ATGGTCGAAGATGAGGAGATGTTTAATGCCTACCCATGGGGACAGGTCGCCTTCGGATTGCTAACAAATTACATGCACAAAGCATGGCTTAGCCGCGAAAGCTGTGGAAT
AGGAATGGGGGGTTTGTGTATGCCATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACATCCGATAAAAATGTATCGAAGCGATCGAGCCAAAGGTTAAAAGCGGCTGGAGTGACGCCCGGAAAGAAACACCCCCCACAGACATCCCCCATCACGTTAGACAG
CGCCCAAGAGTCCGAGGGTGAGGAAGCTGAAATGAGCACTGTCGATTCGGTACCGAAGGATGTCCAGCCAAAGGGGGTCAAAAGGGAGAGAGAAGAAGGAGGATCAGGAA
AGAAGAAAGGTTTAGTAAAAAACCAAAGTCCAAGGAGATTGCCAAGAAGGCTGACGAAGACAAGTCGGAACAGGGCGAGTAACATACTGTATTTACGTCCGTGGTCGTGT
GAAACACAGGAGGAAACCGACACAAGCGATGGCACGGACGAAGACCATTCAGAAAACGACGATTTTACCACGAGCGATAGTGGAGACGAGGAACCGAGCAAGAAAAACCG
AAGAGTAGACAGAAAGGACGATGACTACTTCATGCCGCTCTCGAAACGCAGCAAAGCTCTGAAAATTAATCTCTGCACCAAAGCCGACATTATGGACCACATCCGCCGCA
CATTGGGTGATAAATGTCATGACGCATTCAAAAACACATGCTTCGGACACCTTCTCGACTTTTCATTCAGAAAAACGTCATCACAACTCCTCTTGCACTTGATTCAGCAT
CAATGCAAGCCAAAACGAGCATCGGAGCTATATTTCAAGATTGGAGGGAAAATACTCAAATTCGGACTCCGAGAGTTTGCGTTGATTACCGGACTAAATTGTGCTCCGTT
CCCACAACTCGACAAAGACAAGCTACAGGAGTGTTCAAGATTTAAGGCCAAGTATTTTGATGAAGGCAAGGGAGTAAAAAGGAAGACTCTCGACATCATATTCAAAGCAA
TAAAGCACGGTGTTGAGTCAGACCTAGTAAAGATGGCCCAACTATATTGTTTGGAGAGCTTCCTACTCCCCCGGCAAGAGAAGGTATTTATCGAAGATGACCACCTACTA
ATGGTCGAAGATGAGGAGATGTTTAATGCCTACCCATGGGGACAGGTCGCCTTCGGATTGCTAACAAATTACATGCACAAAGCATGGCTTAGCCGCGAAAGCTGTGGAAT
AGGAATGGGGGGTTTGTGTATGCCATCCTAG
Protein sequenceShow/hide protein sequence
MTSDKNVSKRSSQRLKAAGVTPGKKHPPQTSPITLDSAQESEGEEAEMSTVDSVPKDVQPKGVKREREEGGSGKKKGLVKNQSPRRLPRRLTKTSRNRASNILYLRPWSC
ETQEETDTSDGTDEDHSENDDFTTSDSGDEEPSKKNRRVDRKDDDYFMPLSKRSKALKINLCTKADIMDHIRRTLGDKCHDAFKNTCFGHLLDFSFRKTSSQLLLHLIQH
QCKPKRASELYFKIGGKILKFGLREFALITGLNCAPFPQLDKDKLQECSRFKAKYFDEGKGVKRKTLDIIFKAIKHGVESDLVKMAQLYCLESFLLPRQEKVFIEDDHLL
MVEDEEMFNAYPWGQVAFGLLTNYMHKAWLSRESCGIGMGGLCMPS