; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008485 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008485
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:23233630..23241116
RNA-Seq ExpressionLag0008485
SyntenyLag0008485
Gene Ontology termsGO:0006561 - proline biosynthetic process (biological process)
GO:0005829 - cytosol (cellular component)
GO:0004349 - glutamate 5-kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_019178790.1 PREDICTED: uncharacterized protein LOC109173896 [Ipomoea nil]3.2e-4735.76Show/hide
Query:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA---------------ATKARQM------QGKACGLCLMTTHTTDACPEI
        GLLP DRSMVD +SGG+L +KTP +ARQLIS+MAENSQQ+GTR   ++ +                 T  RQM         ACG+C    H TD CP +
Subjt:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA---------------ATKARQM------QGKACGLCLMTTHTTDACPEI

Query:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP
        Q+ N E+NAIGGF G   R Y+P++N YNPGWRDHPN SY NQ     F+N                                                 
Subjt:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP

Query:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA
                               +N+     P             H S    +N  +    ++K       +F+ ETRA +QQLG Q+SQLA +V KLEA
Subjt:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA

Query:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPIDFNSYVPKASFPSRLGLQPEPLKEKEEKD
        +   KL    ++  +E V A TLRSG  +    Q  SP + E+ +E K  EE  S  N   K   +      ++S +P   FPSRL    E   E+ E  
Subjt:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPIDFNSYVPKASFPSRLGLQPEPLKEKEEKD

Query:  ILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK
        IL+ F+KVE+NIPLLEA++QIPK  KFLK+ C+ K K K
Subjt:  ILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK

XP_019180076.1 PREDICTED: uncharacterized protein LOC109175288 [Ipomoea nil]2.2e-4836.99Show/hide
Query:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA---------------ATKARQM------QGKACGLCLMTTHTTDACPEI
        GLLP DRSMVD ASGG+L +KTP +ARQLIS+MAENSQQ+GTR   ++ +                 T  RQM         ACG+C    H TD CP +
Subjt:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA---------------ATKARQM------QGKACGLCLMTTHTTDACPEI

Query:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP
        Q+ N E+NAIGGF G   R Y+P++N YNPGWRDHPN SY NQ     F+N                    N      P QP+                 
Subjt:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP

Query:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA
                                                            +N  +    ++K       +F+ ETRA +QQLG QVSQLA +V KLEA
Subjt:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA

Query:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPI-DFNSYVPKASFPSRLGLQPEPLKEKEEK
        +   KLP   E+  +E V A TLRSG  +    Q  SP + E+ +E K  EE  S  N  +K  V  +     ++S +P   FPSRL       KE+ E 
Subjt:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPI-DFNSYVPKASFPSRLGLQPEPLKEKEEK

Query:  DILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGK
         IL+ F+KVEVNIPLLEA++QIPK  KFLK+ C+ K K
Subjt:  DILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGK

XP_019188859.1 PREDICTED: uncharacterized protein LOC109183094 [Ipomoea nil]3.5e-4636.14Show/hide
Query:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA-ATKARQMQGK--------------------ACGLCLMTTHTTDACPEI
        GLLP DRSMVD ASGG+L +KTP +ARQLIS+MAENSQQ+GTR   ++ +       Q++ K                    ACG+C    H TD CP +
Subjt:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA-ATKARQMQGK--------------------ACGLCLMTTHTTDACPEI

Query:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP
        Q+ N E+NAIGGF G   R Y+P++N YNPGWRDHPN SY NQ     F++                    N      P Q +  P+A+           
Subjt:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP

Query:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA
                                                 + G  L              ++K       +F+ ETRA +QQLG Q+SQLA +V KLEA
Subjt:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA

Query:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPI-DFNSYVPKASFPSRLGLQPEPLKEKEEK
        +   KLP   E+  +E V A TLRSG  +    Q  SP + E+ +  K  EE    +N  +K  V F+    ++NS +  + FPSRL    E   E+ E 
Subjt:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPI-DFNSYVPKASFPSRLGLQPEPLKEKEEK

Query:  DILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK
         IL+ F+KVE+NIPLLEA++QIPK  KFLK+ C+ K K K
Subjt:  DILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK

XP_031091054.1 uncharacterized protein LOC115996048 [Ipomoea triloba]6.0e-4635.91Show/hide
Query:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA---------------ATKARQM------QGKACGLCLMTTHTTDACPEI
        GLLPMDRSMVD ASGG+L +KTP +A+QLIS+MAENSQQ+GTR   ++ +                 T  RQM          CG+C    H TD CP++
Subjt:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA---------------ATKARQM------QGKACGLCLMTTHTTDACPEI

Query:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP
        Q+ + E+NAIGGF+    R Y+ ++N YNPGWRDHPN SY NQ     F+N                    N      P Q +  P+A            
Subjt:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP

Query:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA
                                                            +N  +    ++K       +F++ETRA +QQLG Q+SQLA +V KLEA
Subjt:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA

Query:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPI-DFNSYVPKASFPSRLGLQPEPLKEKEEK
        +   KLP+  E+  +E V A TLRSG  +    Q  S    E+ +E K  EE  S  N   K  V  +  +  +NS +P   FPSRL    E   E+ E 
Subjt:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPI-DFNSYVPKASFPSRLGLQPEPLKEKEEK

Query:  DILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK
         IL+ F+KVE+NIPLLEA++QIPK  KFLK+ C+ K K K
Subjt:  DILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK

XP_031131881.1 uncharacterized protein LOC116033267 [Ipomoea triloba]2.4e-4736.59Show/hide
Query:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA---------------ATKARQM------QGKACGLCLMTTHTTDACPEI
        GLLPMDRSMVD ASGG+L +KTP +ARQLIS+MA+NSQQ+GTR   ++ +                 T  RQM         ACG+C    H TD CP +
Subjt:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQA---------------ATKARQM------QGKACGLCLMTTHTTDACPEI

Query:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP
        Q+ + E+NAIGGF+G   R Y+P++N Y+PGWRDHPN SY NQ     F+N                    N      P Q +  P+A            
Subjt:  QD-NGEVNAIGGFNGN-QRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP

Query:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA
                                                            +N  +    ++K       +F++ET A +QQLG Q+SQLA +V KLEA
Subjt:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEA

Query:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPI-DFNSYVPKASFPSRLGLQPEPLKEKEEK
        +   KLP   E+  +E V A TLRSG  +    Q  SP   E+  E K  EE  S  N  K   V  +  +  +NS +P   FPSRL    E   E+ E 
Subjt:  KID-KLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPI-DFNSYVPKASFPSRLGLQPEPLKEKEEK

Query:  DILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK
         IL+ F+KVE+NIPLLEAV+QIPK  KFLK+ C+ K K K
Subjt:  DILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK

TrEMBL top hitse value%identityAlignment
A0A0A0KER1 Uncharacterized protein1.4e-4358.39Show/hide
Query:  PEATVVPIVREFYANMTDRSITSFVRGKMIPFDSASINQFYDHPNIDRDGYNDYASNHFDAHQIIEHLCRSGAVWLIRRGEAINFKSSDLTVDKRAWHSF
        PE  V+ IVREFYANM + S  SFVRG+ + FD  +IN++Y  PN +RD Y+ YAS H D HQII  LC+ GA W+I  GE I FKSS+LTV  + WH F
Subjt:  PEATVVPIVREFYANMTDRSITSFVRGKMIPFDSASINQFYDHPNIDRDGYNDYASNHFDAHQIIEHLCRSGAVWLIRRGEAINFKSSDLTVDKRAWHSF

Query:  LCAKLMLVMHLSDVTKKRATLLFAIATSRSVDVGKVIYASMRRIGRGATTVALGHPSLITA
        +CAKL+ V H S VTK+RA LL+AIAT RSVDVGKVI  S+  I +   T  LGH SLITA
Subjt:  LCAKLMLVMHLSDVTKKRATLLFAIATSRSVDVGKVIYASMRRIGRGATTVALGHPSLITA

A0A1S3C7Y0 uncharacterized protein LOC1034979964.6e-4459.01Show/hide
Query:  PEATVVPIVREFYANMTDRSITSFVRGKMIPFDSASINQFYDHPNIDRDGYNDYASNHFDAHQIIEHLCRSGAVWLIRRGEAINFKSSDLTVDKRAWHSF
        PE  VV IVREFYANM + S  SFVRG+ + FD  +IN++Y  PN +RD Y  YAS H D HQII  LC+ GA W+I  GE I FKSS+LTV  + WH F
Subjt:  PEATVVPIVREFYANMTDRSITSFVRGKMIPFDSASINQFYDHPNIDRDGYNDYASNHFDAHQIIEHLCRSGAVWLIRRGEAINFKSSDLTVDKRAWHSF

Query:  LCAKLMLVMHLSDVTKKRATLLFAIATSRSVDVGKVIYASMRRIGRGATTVALGHPSLITA
        +CAKL+ V H S VTK+RA LL+AIAT RSVDVGKVI+ S+  I +   T  LGH SLITA
Subjt:  LCAKLMLVMHLSDVTKKRATLLFAIATSRSVDVGKVIYASMRRIGRGATTVALGHPSLITA

A0A5D3BBY3 Putative S-locus lectin protein kinase family protein4.6e-4459.01Show/hide
Query:  PEATVVPIVREFYANMTDRSITSFVRGKMIPFDSASINQFYDHPNIDRDGYNDYASNHFDAHQIIEHLCRSGAVWLIRRGEAINFKSSDLTVDKRAWHSF
        PE  VV IVREFYANM + S  SFVRG+ + FD  +IN++Y  PN +RD Y  YAS H D HQII  LC+ GA W+I  GE I FKSS+LTV  + WH F
Subjt:  PEATVVPIVREFYANMTDRSITSFVRGKMIPFDSASINQFYDHPNIDRDGYNDYASNHFDAHQIIEHLCRSGAVWLIRRGEAINFKSSDLTVDKRAWHSF

Query:  LCAKLMLVMHLSDVTKKRATLLFAIATSRSVDVGKVIYASMRRIGRGATTVALGHPSLITA
        +CAKL+ V H S VTK+RA LL+AIAT RSVDVGKVI+ S+  I +   T  LGH SLITA
Subjt:  LCAKLMLVMHLSDVTKKRATLLFAIATSRSVDVGKVIYASMRRIGRGATTVALGHPSLITA

A0A6A2ZNL1 Squalene monooxygenase1.0e-4330.7Show/hide
Query:  SQGNDLRFEPEISRAETRFRRQARRRRVNNLGEVE-ALAMVERTLRQLAVPDLNQKPLCITYPKT-----------------------------------
        S G +L+++PEI +              +N  EVE  +A  ERTLR+L VP++NQ+PLCI YP                                     
Subjt:  SQGNDLRFEPEISRAETRFRRQARRRRVNNLGEVE-ALAMVERTLRQLAVPDLNQKPLCITYPKT-----------------------------------

Query:  ------------TGLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQAATKARQMQGKACGLCLMTTHTTDACPEIQDNGE--V
                     GLLPM+R M+D ASGGA+ NKTP +AR+LIS MA NSQQFG R   +  +     +  Q KACG+C    + TD CP +QD+     
Subjt:  ------------TGLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTRGPPAVVQAATKARQMQGKACGLCLMTTHTTDACPEIQDNGE--V

Query:  NAIGGF-NGNQRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATPDCPPGFP
        NA+G F +  QR Y+PY N YNPGWRDHPN SY                                   G  P+Q + +P             P+     P
Subjt:  NAIGGF-NGNQRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATPDCPPGFP

Query:  VVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEAKIDKLPI
         +SL                                                  ++K       +F++ETR  +Q L  QVSQLA +V +LE++  KLP 
Subjt:  VVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQLGAQVSQLADTVRKLEAKIDKLPI

Query:  HPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPIDFNSYVPKASFPSRLGLQPEPLKEKEEKDILDPFKKV
           +  ++ V A TLRS   +          A E+  E + +  +T      + +G++ +   +F   V + SFPSR     +  +E EEK+IL+ F+K+
Subjt:  HPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPIDFNSYVPKASFPSRLGLQPEPLKEKEEKDILDPFKKV

Query:  EVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK
        EVNIPLL+A+EQ+P+  K LK+ C+ K K K
Subjt:  EVNIPLLEAVEQIPKVGKFLKKWCSRKGKPK

A0A6P6VIA6 uncharacterized protein LOC1137236122.3e-4334.09Show/hide
Query:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTR--GPP-------------------AVVQAATKARQMQGKACGLCLMTTHTTDACPEI
        GL   DRS++D ASGGAL NKTP EA +LI +MAENSQQFG R   PP                   +VV+        + K CG+C    H TD+CP +
Subjt:  GLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQFGTR--GPP-------------------AVVQAATKARQMQGKACGLCLMTTHTTDACPEI

Query:  QDNG--EVNAIGGFNGNQRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP
        Q++G  +VN  GG    +R Y+PY+N YNPGWRDHPN SY N+Q         QN                                          A P
Subjt:  QDNG--EVNAIGGFNGNQRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNKSSGNLPTQPEVNPKANVNAVWSGVATP

Query:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQL---KFKEETRARMQQLGAQVSQLADTVRK
        + PPGF                 + W     P      S +       +    N A  +A + +ET    +   +F+++T+A M+ + A++SQLA  + +
Subjt:  DCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQL---KFKEETRARMQQLGAQVSQLADTVRK

Query:  LEA-KIDKLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPIDFNSYVPKASFPSRLGLQPEPLKEKE
        LE+    KLP  PE+  R  V A TLRSG  +   P+  +P +  +    K++EE+      PK   V F P     S +P   FP RL    +  K ++
Subjt:  LEA-KIDKLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPIDFNSYVPKASFPSRLGLQPEPLKEKE

Query:  EKDILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGK
        EK++LD F+KVE+NIPLL+A++QIPK  KFLK  C+ K K
Subjt:  EKDILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACACGTTCTTCGCAGGGAAACGATCTTAGATTTGAACCTGAAATTTCTAGAGCGGAGACGAGATTTAGAAGGCAAGCAAGAAGAAGAAGAGTTAATAATCTAGG
CGAGGTTGAAGCACTAGCCATGGTTGAGAGGACATTAAGGCAGCTGGCAGTGCCGGATCTAAACCAGAAGCCACTCTGTATTACCTACCCTAAGACGACAGGATTACTAC
CAATGGATCGAAGTATGGTGGATGTCGCCAGTGGAGGAGCATTATTCAACAAGACACCTGTTGAGGCTAGACAACTGATTTCAAGTATGGCTGAGAACTCTCAGCAATTC
GGGACGAGAGGACCACCTGCAGTTGTGCAAGCCGCCACAAAAGCGAGACAGATGCAAGGGAAAGCATGTGGACTTTGCTTGATGACTACCCATACTACCGATGCCTGCCC
TGAGATACAAGACAATGGAGAGGTGAATGCCATTGGTGGATTCAATGGGAATCAAAGGCACTACAACCCATACAACAACATTTACAATCCAGGATGGAGAGACCATCCAA
ATTTTAGTTATGAGAACCAACAAAAGCCACTGATGTTCAAGAACACCATTCAGAATTTGGGAAACGAAATCACTCAGCTGGCTACCCAAGTCAGTAAGATGAACAATAAA
AGTTCTGGTAACCTCCCTACCCAACCTGAGGTAAATCCTAAGGCAAACGTGAACGCAGTGTGGAGTGGAGTGGCCACACCTGACTGTCCCCCTGGATTCCCCGTTGTCTC
TCTGGGCGCTTTTTATAGGCTGAGGATACTGACTTCTCCAAGAAATTGGCTTGCGTGTCCCAACCCTCTCTTGCCAGCTTGCAAATCAATCATACATCAGGGCAACCATC
TTTCCAGCTATCATCGTGCGAATTATGCTCTGGGATCTGCGCCTGTGCTGAAGGAAACTTGGGAATGTCAACTCAAATTCAAAGAGGAGACCAGAGCTAGGATGCAACAA
TTGGGAGCCCAAGTCTCTCAATTGGCTGACACTGTCAGAAAATTGGAGGCCAAAATTGACAAACTACCTATTCATCCTGAGATCCGAAGAAGAGAAGAGGTATGTGCTAC
CACGTTAAGGAGTGGGACAGTCATGAGTTCCAGTCCACAATTTCCTTCTCCGTCTGCATTTGAAAAGAATAGAGAGACAAAGAAGCTAGAAGAGAAGACGAGCAATCTCA
ACTTTCCAAAGAAGCGAGGGGTACAGTTCGATCCTCCTATTGATTTTAATTCTTATGTTCCTAAAGCTTCTTTCCCTAGCAGGTTAGGACTTCAGCCTGAACCTCTCAAG
GAAAAGGAAGAAAAGGACATACTTGACCCATTCAAGAAGGTGGAGGTCAACATCCCACTTCTGGAAGCCGTAGAGCAAATTCCTAAGGTAGGAAAATTTTTAAAGAAATG
GTGCTCCAGGAAAGGTAAGCCTAAGGTTCACTGGGCACCCGAGGCCACTGTAGTCCCTATAGTTAGGGAGTTTTACGCTAACATGACGGATAGATCCATTACTTCCTTTG
TCAGGGGTAAAATGATCCCTTTCGACTCGGCCTCCATTAACCAATTCTATGACCATCCCAATATTGATCGCGATGGGTACAATGATTACGCAAGTAATCACTTTGACGCC
CATCAAATTATAGAACACTTATGTAGGTCGGGGGCAGTTTGGTTAATAAGAAGGGGTGAAGCAATAAACTTTAAATCCTCTGACCTGACAGTGGATAAAAGAGCATGGCA
TAGTTTTCTGTGTGCCAAACTCATGCTTGTGATGCATCTTAGTGATGTAACAAAGAAGAGAGCGACTTTGTTATTCGCTATAGCAACCAGCCGCAGTGTAGACGTAGGAA
AAGTTATATATGCTTCCATGCGCCGGATTGGCAGAGGAGCGACCACAGTAGCTCTAGGGCACCCTTCCCTCATCACAGCATCTGTAGAGCCGTTGGGGTCGTTTGGGACC
CCCGTGAGGAGATTAGCCATCCTGCAGCTGCGATTGATGGGAACTTCATCACGATTAGATTTAGGGAGCCTGAACCTAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGATCACACGTTCTTCGCAGGGAAACGATCTTAGATTTGAACCTGAAATTTCTAGAGCGGAGACGAGATTTAGAAGGCAAGCAAGAAGAAGAAGAGTTAATAATCTAGG
CGAGGTTGAAGCACTAGCCATGGTTGAGAGGACATTAAGGCAGCTGGCAGTGCCGGATCTAAACCAGAAGCCACTCTGTATTACCTACCCTAAGACGACAGGATTACTAC
CAATGGATCGAAGTATGGTGGATGTCGCCAGTGGAGGAGCATTATTCAACAAGACACCTGTTGAGGCTAGACAACTGATTTCAAGTATGGCTGAGAACTCTCAGCAATTC
GGGACGAGAGGACCACCTGCAGTTGTGCAAGCCGCCACAAAAGCGAGACAGATGCAAGGGAAAGCATGTGGACTTTGCTTGATGACTACCCATACTACCGATGCCTGCCC
TGAGATACAAGACAATGGAGAGGTGAATGCCATTGGTGGATTCAATGGGAATCAAAGGCACTACAACCCATACAACAACATTTACAATCCAGGATGGAGAGACCATCCAA
ATTTTAGTTATGAGAACCAACAAAAGCCACTGATGTTCAAGAACACCATTCAGAATTTGGGAAACGAAATCACTCAGCTGGCTACCCAAGTCAGTAAGATGAACAATAAA
AGTTCTGGTAACCTCCCTACCCAACCTGAGGTAAATCCTAAGGCAAACGTGAACGCAGTGTGGAGTGGAGTGGCCACACCTGACTGTCCCCCTGGATTCCCCGTTGTCTC
TCTGGGCGCTTTTTATAGGCTGAGGATACTGACTTCTCCAAGAAATTGGCTTGCGTGTCCCAACCCTCTCTTGCCAGCTTGCAAATCAATCATACATCAGGGCAACCATC
TTTCCAGCTATCATCGTGCGAATTATGCTCTGGGATCTGCGCCTGTGCTGAAGGAAACTTGGGAATGTCAACTCAAATTCAAAGAGGAGACCAGAGCTAGGATGCAACAA
TTGGGAGCCCAAGTCTCTCAATTGGCTGACACTGTCAGAAAATTGGAGGCCAAAATTGACAAACTACCTATTCATCCTGAGATCCGAAGAAGAGAAGAGGTATGTGCTAC
CACGTTAAGGAGTGGGACAGTCATGAGTTCCAGTCCACAATTTCCTTCTCCGTCTGCATTTGAAAAGAATAGAGAGACAAAGAAGCTAGAAGAGAAGACGAGCAATCTCA
ACTTTCCAAAGAAGCGAGGGGTACAGTTCGATCCTCCTATTGATTTTAATTCTTATGTTCCTAAAGCTTCTTTCCCTAGCAGGTTAGGACTTCAGCCTGAACCTCTCAAG
GAAAAGGAAGAAAAGGACATACTTGACCCATTCAAGAAGGTGGAGGTCAACATCCCACTTCTGGAAGCCGTAGAGCAAATTCCTAAGGTAGGAAAATTTTTAAAGAAATG
GTGCTCCAGGAAAGGTAAGCCTAAGGTTCACTGGGCACCCGAGGCCACTGTAGTCCCTATAGTTAGGGAGTTTTACGCTAACATGACGGATAGATCCATTACTTCCTTTG
TCAGGGGTAAAATGATCCCTTTCGACTCGGCCTCCATTAACCAATTCTATGACCATCCCAATATTGATCGCGATGGGTACAATGATTACGCAAGTAATCACTTTGACGCC
CATCAAATTATAGAACACTTATGTAGGTCGGGGGCAGTTTGGTTAATAAGAAGGGGTGAAGCAATAAACTTTAAATCCTCTGACCTGACAGTGGATAAAAGAGCATGGCA
TAGTTTTCTGTGTGCCAAACTCATGCTTGTGATGCATCTTAGTGATGTAACAAAGAAGAGAGCGACTTTGTTATTCGCTATAGCAACCAGCCGCAGTGTAGACGTAGGAA
AAGTTATATATGCTTCCATGCGCCGGATTGGCAGAGGAGCGACCACAGTAGCTCTAGGGCACCCTTCCCTCATCACAGCATCTGTAGAGCCGTTGGGGTCGTTTGGGACC
CCCGTGAGGAGATTAGCCATCCTGCAGCTGCGATTGATGGGAACTTCATCACGATTAGATTTAGGGAGCCTGAACCTAGGATAG
Protein sequenceShow/hide protein sequence
MITRSSQGNDLRFEPEISRAETRFRRQARRRRVNNLGEVEALAMVERTLRQLAVPDLNQKPLCITYPKTTGLLPMDRSMVDVASGGALFNKTPVEARQLISSMAENSQQF
GTRGPPAVVQAATKARQMQGKACGLCLMTTHTTDACPEIQDNGEVNAIGGFNGNQRHYNPYNNIYNPGWRDHPNFSYENQQKPLMFKNTIQNLGNEITQLATQVSKMNNK
SSGNLPTQPEVNPKANVNAVWSGVATPDCPPGFPVVSLGAFYRLRILTSPRNWLACPNPLLPACKSIIHQGNHLSSYHRANYALGSAPVLKETWECQLKFKEETRARMQQ
LGAQVSQLADTVRKLEAKIDKLPIHPEIRRREEVCATTLRSGTVMSSSPQFPSPSAFEKNRETKKLEEKTSNLNFPKKRGVQFDPPIDFNSYVPKASFPSRLGLQPEPLK
EKEEKDILDPFKKVEVNIPLLEAVEQIPKVGKFLKKWCSRKGKPKVHWAPEATVVPIVREFYANMTDRSITSFVRGKMIPFDSASINQFYDHPNIDRDGYNDYASNHFDA
HQIIEHLCRSGAVWLIRRGEAINFKSSDLTVDKRAWHSFLCAKLMLVMHLSDVTKKRATLLFAIATSRSVDVGKVIYASMRRIGRGATTVALGHPSLITASVEPLGSFGT
PVRRLAILQLRLMGTSSRLDLGSLNLG