; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019040 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019040
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationscaffold12:24474261..24481821
RNA-Seq ExpressionSpg019040
SyntenySpg019040
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043405.1 hypothetical protein E6C27_scaffold1639G00040 [Cucumis melo var. makuwa]1.0e-3846.33Show/hide
Query:  ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK-------------KKLLKEGYSLPTTRKGLGYKSPEP
        A KE  + + T   +KGE STS  K  ++ D+K SN  ILRYVPLSRR+KGESPF +  + +K             KKLL+EG+ +P +RKG GYKSPEP
Subjt:  ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK-------------KKLLKEGYSLPTTRKGLGYKSPEP

Query:  VRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSA
        + I R+ K KV D+NHIT+ EVD  +EK+   QR S F +IRP V R  VF+RLSV +TE +  Q T+S  + S  +RL+M  ++E+ T      T+PSA
Subjt:  VRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSA

Query:  FQRLNVPMGKEESTFSAP
        F+RL+V   K+  T   P
Subjt:  FQRLNVPMGKEESTFSAP

KAA0055957.1 uncharacterized protein E6C27_scaffold319G00830 [Cucumis melo var. makuwa]5.5e-4041.42Show/hide
Query:  DVPASSSGTVAESQADARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK---------------------
        +VP  +     + ++ A KE  + + T    K E ST+  K  ++ D+K SN PILRYVPLSRRKKGESPF E  + +K                     
Subjt:  DVPASSSGTVAESQADARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK---------------------

Query:  ----------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRL
                              KKLL+EG+ +P +RKGLGYKSPEP+RITR+GK KV D NHIT++EVD  +EK+  +QRTS F RI P VARA VF+RL
Subjt:  ----------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRL

Query:  SVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSAFQRLNVPMGKEESTFSAPEVAR
        S+ E + +  Q T++  R S F+RL++  +EE+    T   T+PSAF+RL++   K   T  AP + R
Subjt:  SVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSAFQRLNVPMGKEESTFSAPEVAR

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-4129.11Show/hide
Query:  IIEDHTPLAVASSISKLMEESSKDRVAVKDNPLFES---VIPTSKRSKDTLNLDVMSVLMADIDQDERMAEMERKLNLLMKALIQFGTLDPIVVRFQKEA
        ++ + TPL  +       E+     +A     L E     +P  KRS+   N+D  +           ++    K  +L + +++      I +  ++E 
Subjt:  IIEDHTPLAVASSISKLMEESSKDRVAVKDNPLFES---VIPTSKRSKDTLNLDVMSVLMADIDQDERMAEMERKLNLLMKALIQFGTLDPIVVRFQKEA

Query:  TMKGSQEKYVSIEDENEGWTLVVRRKKQKQSYARKESRLFRDNKRKAKSQKKKGKKKSRRSKPVMEESEDFFCPPQPITLAEYFPRRFLDDSQGEALETV
        T + SQEK   IE+++EGWT+V RRKK+K +  +KESRL+ + +R  K+QK K KKK+R+ K V E+ +DF    + +TLA++FP RFL D Q E    V
Subjt:  TMKGSQEKYVSIEDENEGWTLVVRRKKQKQSYARKESRLFRDNKRKAKSQKKKGKKKSRRSKPVMEESEDFFCPPQPITLAEYFPRRFLDDSQGEALETV

Query:  T-----------------CHIVDVVEDD--------DVPASSSGTVAESQAD------------------------------------------------
                          C  +D  ++D        + P   SG V E + +                                                
Subjt:  T-----------------CHIVDVVEDD--------DVPASSSGTVAESQAD------------------------------------------------

Query:  -------------------------------ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK------
                                       A +E+ +   T    K E STS  K  +V D+K SN PILRYVPLSRRKKGESPF E  + +K      
Subjt:  -------------------------------ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK------

Query:  -----------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITR
                                                                               KKLL+EG+++P +RKGLGYK PEP+RITR
Subjt:  -----------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITR

Query:  RGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEEST
        +GK KV D+NHIT++EVD  +EK+   QRTS F R+ P VARA VF+RLS+ E E +  Q T+S  R S F+RL+M  + E+ +
Subjt:  RGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEEST

TYK05005.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.5e-4127.3Show/hide
Query:  ERKLNLLMKALIQFGTLDPIVVRFQKEATMKGSQEKYVSIEDENEGWTLVVRRKKQKQSYARKESRLFRDNKRKAKSQKKKGKKKSRRSKPVMEESEDFF
        E+K+ L    L +FGT +P+VVRF +E   + SQEK   IE+++E WT+V RRKK+K +  +KE R +R+ +R  K+QK K KKK+R+ K + +E +DF 
Subjt:  ERKLNLLMKALIQFGTLDPIVVRFQKEATMKGSQEKYVSIEDENEGWTLVVRRKKQKQSYARKESRLFRDNKRKAKSQKKKGKKKSRRSKPVMEESEDFF

Query:  CPPQPITLAEYFPRRFLDDSQGEALETVTCHIVDVVEDDDVPASS-------------------------------------------------------
           + ITLA++FP RFL D Q E    V CH ++  E++ +P  S                                                       
Subjt:  CPPQPITLAEYFPRRFLDDSQGEALETVTCHIVDVVEDDDVPASS-------------------------------------------------------

Query:  ----------------------SGTVAESQAD--------------------------------------------------------------------
                              SG V E + D                                                                    
Subjt:  ----------------------SGTVAESQAD--------------------------------------------------------------------

Query:  --------------------------------ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK-----
                                        A KE    + T   +K E STS  K  ++ D+K SN  ILRYVPLSRRKKGESPF E  + +K     
Subjt:  --------------------------------ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK-----

Query:  ------------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRIT
                                                                                KKLL+EG+++P +RKGLGYK PEP+RIT
Subjt:  ------------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRIT

Query:  RRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPS-----
        R+GK K+ D+NHIT++EVD  KEK+   QRTS F RI P VARA VF+RLSV E E +  Q T++  R S F RLS+  ++   T   P + +       
Subjt:  RRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPS-----

Query:  AFQRLNVPMGKEESTFSAPEVARPSVFQR---LNVTTRRDK--------EEQSASSISHRLQTGKFKQQGAKRGHSAHQRRDATL
             ++ + K+EST      +R SV+ R   +NV +R  K        E +  S++  R++   F    A +G    +R D  L
Subjt:  AFQRLNVPMGKEESTFSAPEVARPSVFQR---LNVTTRRDK--------EEQSASSISHRLQTGKFKQQGAKRGHSAHQRRDATL

TYK28162.1 uncharacterized protein E5676_scaffold289G00760 [Cucumis melo var. makuwa]3.0e-3837.38Show/hide
Query:  DVPASSSGTVAESQADARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK---------------------
        +VP  +     + ++ A KE  + + T    K E ST+  K  ++ D+K SN PILRYVPLSRRKKGESPF E  + +K                     
Subjt:  DVPASSSGTVAESQADARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK---------------------

Query:  --------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPP
                                        KKLL+EG+ +P +RKGLGYKSPEP+RITR+GK KV D+NHIT++EVD  +EK+  +QRTS F RI P 
Subjt:  --------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPP

Query:  VARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSAFQRLNVPMGKEESTFSAPEVARPS-----VFQRLNVTTRRDKEE
        VARA VF+RLS+ E E +  Q T++  + S F+RL++  +EE+    T   T+PSAF+RL++   K   T  AP + R       V    ++ T++ KE 
Subjt:  VARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSAFQRLNVPMGKEESTFSAPEVARPS-----VFQRLNVTTRRDKEE

Query:  QSASSISHRLQTGKFKQQGAK
         S   + HR++  K + +  K
Subjt:  QSASSISHRLQTGKFKQQGAK

TrEMBL top hitse value%identityAlignment
A0A5A7TPR5 RNase H domain-containing protein5.0e-3946.33Show/hide
Query:  ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK-------------KKLLKEGYSLPTTRKGLGYKSPEP
        A KE  + + T   +KGE STS  K  ++ D+K SN  ILRYVPLSRR+KGESPF +  + +K             KKLL+EG+ +P +RKG GYKSPEP
Subjt:  ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK-------------KKLLKEGYSLPTTRKGLGYKSPEP

Query:  VRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSA
        + I R+ K KV D+NHIT+ EVD  +EK+   QR S F +IRP V R  VF+RLSV +TE +  Q T+S  + S  +RL+M  ++E+ T      T+PSA
Subjt:  VRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSA

Query:  FQRLNVPMGKEESTFSAP
        F+RL+V   K+  T   P
Subjt:  FQRLNVPMGKEESTFSAP

A0A5A7UMY2 Reverse transcriptase domain-containing protein2.7e-4041.42Show/hide
Query:  DVPASSSGTVAESQADARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK---------------------
        +VP  +     + ++ A KE  + + T    K E ST+  K  ++ D+K SN PILRYVPLSRRKKGESPF E  + +K                     
Subjt:  DVPASSSGTVAESQADARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK---------------------

Query:  ----------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRL
                              KKLL+EG+ +P +RKGLGYKSPEP+RITR+GK KV D NHIT++EVD  +EK+  +QRTS F RI P VARA VF+RL
Subjt:  ----------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRL

Query:  SVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSAFQRLNVPMGKEESTFSAPEVAR
        S+ E + +  Q T++  R S F+RL++  +EE+    T   T+PSAF+RL++   K   T  AP + R
Subjt:  SVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSAFQRLNVPMGKEESTFSAPEVAR

A0A5D3BY54 Ty3-gypsy retrotransposon protein1.9e-4129.11Show/hide
Query:  IIEDHTPLAVASSISKLMEESSKDRVAVKDNPLFES---VIPTSKRSKDTLNLDVMSVLMADIDQDERMAEMERKLNLLMKALIQFGTLDPIVVRFQKEA
        ++ + TPL  +       E+     +A     L E     +P  KRS+   N+D  +           ++    K  +L + +++      I +  ++E 
Subjt:  IIEDHTPLAVASSISKLMEESSKDRVAVKDNPLFES---VIPTSKRSKDTLNLDVMSVLMADIDQDERMAEMERKLNLLMKALIQFGTLDPIVVRFQKEA

Query:  TMKGSQEKYVSIEDENEGWTLVVRRKKQKQSYARKESRLFRDNKRKAKSQKKKGKKKSRRSKPVMEESEDFFCPPQPITLAEYFPRRFLDDSQGEALETV
        T + SQEK   IE+++EGWT+V RRKK+K +  +KESRL+ + +R  K+QK K KKK+R+ K V E+ +DF    + +TLA++FP RFL D Q E    V
Subjt:  TMKGSQEKYVSIEDENEGWTLVVRRKKQKQSYARKESRLFRDNKRKAKSQKKKGKKKSRRSKPVMEESEDFFCPPQPITLAEYFPRRFLDDSQGEALETV

Query:  T-----------------CHIVDVVEDD--------DVPASSSGTVAESQAD------------------------------------------------
                          C  +D  ++D        + P   SG V E + +                                                
Subjt:  T-----------------CHIVDVVEDD--------DVPASSSGTVAESQAD------------------------------------------------

Query:  -------------------------------ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK------
                                       A +E+ +   T    K E STS  K  +V D+K SN PILRYVPLSRRKKGESPF E  + +K      
Subjt:  -------------------------------ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK------

Query:  -----------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITR
                                                                               KKLL+EG+++P +RKGLGYK PEP+RITR
Subjt:  -----------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITR

Query:  RGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEEST
        +GK KV D+NHIT++EVD  +EK+   QRTS F R+ P VARA VF+RLS+ E E +  Q T+S  R S F+RL+M  + E+ +
Subjt:  RGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEEST

A0A5D3C0W6 Ty3-gypsy retrotransposon protein4.1e-4127.3Show/hide
Query:  ERKLNLLMKALIQFGTLDPIVVRFQKEATMKGSQEKYVSIEDENEGWTLVVRRKKQKQSYARKESRLFRDNKRKAKSQKKKGKKKSRRSKPVMEESEDFF
        E+K+ L    L +FGT +P+VVRF +E   + SQEK   IE+++E WT+V RRKK+K +  +KE R +R+ +R  K+QK K KKK+R+ K + +E +DF 
Subjt:  ERKLNLLMKALIQFGTLDPIVVRFQKEATMKGSQEKYVSIEDENEGWTLVVRRKKQKQSYARKESRLFRDNKRKAKSQKKKGKKKSRRSKPVMEESEDFF

Query:  CPPQPITLAEYFPRRFLDDSQGEALETVTCHIVDVVEDDDVPASS-------------------------------------------------------
           + ITLA++FP RFL D Q E    V CH ++  E++ +P  S                                                       
Subjt:  CPPQPITLAEYFPRRFLDDSQGEALETVTCHIVDVVEDDDVPASS-------------------------------------------------------

Query:  ----------------------SGTVAESQAD--------------------------------------------------------------------
                              SG V E + D                                                                    
Subjt:  ----------------------SGTVAESQAD--------------------------------------------------------------------

Query:  --------------------------------ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK-----
                                        A KE    + T   +K E STS  K  ++ D+K SN  ILRYVPLSRRKKGESPF E  + +K     
Subjt:  --------------------------------ARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK-----

Query:  ------------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRIT
                                                                                KKLL+EG+++P +RKGLGYK PEP+RIT
Subjt:  ------------------------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRIT

Query:  RRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPS-----
        R+GK K+ D+NHIT++EVD  KEK+   QRTS F RI P VARA VF+RLSV E E +  Q T++  R S F RLS+  ++   T   P + +       
Subjt:  RRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPS-----

Query:  AFQRLNVPMGKEESTFSAPEVARPSVFQR---LNVTTRRDK--------EEQSASSISHRLQTGKFKQQGAKRGHSAHQRRDATL
             ++ + K+EST      +R SV+ R   +NV +R  K        E +  S++  R++   F    A +G    +R D  L
Subjt:  AFQRLNVPMGKEESTFSAPEVARPSVFQR---LNVTTRRDK--------EEQSASSISHRLQTGKFKQQGAKRGHSAHQRRDATL

A0A5D3DXC7 Reverse transcriptase domain-containing protein1.5e-3837.38Show/hide
Query:  DVPASSSGTVAESQADARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK---------------------
        +VP  +     + ++ A KE  + + T    K E ST+  K  ++ D+K SN PILRYVPLSRRKKGESPF E  + +K                     
Subjt:  DVPASSSGTVAESQADARKEAVEDVNTSDLKKGETSTSLVKPKVVKDKKCSNSPILRYVPLSRRKKGESPFTECSESIK---------------------

Query:  --------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPP
                                        KKLL+EG+ +P +RKGLGYKSPEP+RITR+GK KV D+NHIT++EVD  +EK+  +QRTS F RI P 
Subjt:  --------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPP

Query:  VARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSAFQRLNVPMGKEESTFSAPEVARPS-----VFQRLNVTTRRDKEE
        VARA VF+RLS+ E E +  Q T++  + S F+RL++  +EE+    T   T+PSAF+RL++   K   T  AP + R       V    ++ T++ KE 
Subjt:  VARALVFQRLSVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSAFQRLNVPMGKEESTFSAPEVARPS-----VFQRLNVTTRRDKEE

Query:  QSASSISHRLQTGKFKQQGAK
         S   + HR++  K + +  K
Subjt:  QSASSISHRLQTGKFKQQGAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAGAACCGAAGAGAATCAAAGAAGAAAAAGTCAATTTGGGGTCAAATGCAGACCAGCGTCGAGACGCTGCTCCTTGAGCGTCTCGACGCTAGCTTTCCTTTTCT
GAATACGCGCGTTTTAGAGGCAGCGTCGAGACGCTGTCTTGATAGCGTCTCGACGCTACGGCAAAAATCTGCACTATATTTAATAGGGATATCGGGTTCAATTGCATATG
GGAGAATCATTGAGGATCATACTCCTCTTGCTGTTGCAAGCAGCATCTCAAAGCTGATGGAGGAATCCTCTAAGGATAGGGTTGCAGTCAAAGACAACCCGCTATTCGAA
TCTGTCATTCCAACATCTAAGCGGTCAAAGGATACACTAAATCTTGATGTGATGTCCGTCCTGATGGCTGATATAGACCAAGATGAAAGAATGGCAGAGATGGAAAGAAA
ACTCAATCTCTTAATGAAGGCATTGATTCAATTTGGGACCCTTGATCCCATAGTGGTTCGATTCCAAAAAGAAGCCACGATGAAGGGATCCCAAGAAAAATATGTTTCCA
TCGAAGATGAAAACGAAGGTTGGACCCTTGTCGTTCGTCGCAAAAAGCAAAAGCAAAGTTACGCACGGAAAGAGTCCCGCCTATTTCGAGACAATAAAAGAAAGGCTAAG
TCTCAAAAGAAGAAAGGAAAAAAGAAGTCAAGGAGGTCAAAGCCTGTCATGGAAGAAAGTGAAGATTTCTTTTGTCCTCCACAACCCATAACTTTGGCAGAATACTTCCC
AAGGCGCTTTCTCGATGATAGTCAAGGAGAGGCACTTGAAACTGTCACGTGTCACATTGTGGACGTGGTGGAAGATGATGATGTCCCTGCTAGCTCCTCGGGAACGGTGG
CAGAGTCACAAGCAGATGCAAGAAAGGAAGCTGTTGAAGATGTGAATACTTCCGACCTGAAAAAGGGTGAAACGTCTACAAGCCTTGTGAAACCTAAGGTTGTAAAGGAT
AAGAAGTGTTCAAATTCACCTATCCTACGATACGTCCCCTTATCTCGACGTAAAAAGGGTGAATCACCTTTCACTGAATGTTCAGAAAGCATAAAGAAGAAACTTCTAAA
GGAAGGTTATAGTCTGCCTACAACAAGAAAAGGGCTTGGATATAAGTCGCCAGAGCCGGTTCGTATAACAAGAAGAGGGAAGGCAAAAGTGGCAGACACAAATCATATAA
CAATAGAGGAGGTTGATGACTCAAAAGAAAAAAAGAGTGTCGACCAACGAACTTCTGTTTTTAGGCGCATCAGGCCACCGGTTGCTCGTGCTTTAGTCTTTCAGAGATTA
AGTGTCAATGAAACGGAAGAAGAGAGTACACAACCTACCAATAGCTCCACTCGACCTTCAGTTTTTCGAAGGTTAAGTATGCCCAGTGAGGAAGAAGAGAGTACATTTTC
AACTCCGAATGTCACTCAACCTTCAGCTTTTCAAAGGTTAAATGTGCCCATGGGGAAAGAAGAGAGTACATTTTCAGCTCCGGAGGTGGCTCGACCATCAGTTTTTCAAA
GGTTAAATGTTACCACGAGAAGAGACAAAGAAGAACAATCTGCTTCATCGATTTCTCACCGACTTCAAACCGGAAAGTTCAAGCAACAAGGAGCCAAGAGAGGGCATTCT
GCCCATCAGCGTCGAGACGCCACTCTTGGAGGGTCTCGACGCTGGTTTTCCTTGATTAGAAGAGGCGGCAGCGTCACAGCGTCGAGACGCTGTGACAATAGCGTCTCGAC
GCTACCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAAGAACCGAAGAGAATCAAAGAAGAAAAAGTCAATTTGGGGTCAAATGCAGACCAGCGTCGAGACGCTGCTCCTTGAGCGTCTCGACGCTAGCTTTCCTTTTCT
GAATACGCGCGTTTTAGAGGCAGCGTCGAGACGCTGTCTTGATAGCGTCTCGACGCTACGGCAAAAATCTGCACTATATTTAATAGGGATATCGGGTTCAATTGCATATG
GGAGAATCATTGAGGATCATACTCCTCTTGCTGTTGCAAGCAGCATCTCAAAGCTGATGGAGGAATCCTCTAAGGATAGGGTTGCAGTCAAAGACAACCCGCTATTCGAA
TCTGTCATTCCAACATCTAAGCGGTCAAAGGATACACTAAATCTTGATGTGATGTCCGTCCTGATGGCTGATATAGACCAAGATGAAAGAATGGCAGAGATGGAAAGAAA
ACTCAATCTCTTAATGAAGGCATTGATTCAATTTGGGACCCTTGATCCCATAGTGGTTCGATTCCAAAAAGAAGCCACGATGAAGGGATCCCAAGAAAAATATGTTTCCA
TCGAAGATGAAAACGAAGGTTGGACCCTTGTCGTTCGTCGCAAAAAGCAAAAGCAAAGTTACGCACGGAAAGAGTCCCGCCTATTTCGAGACAATAAAAGAAAGGCTAAG
TCTCAAAAGAAGAAAGGAAAAAAGAAGTCAAGGAGGTCAAAGCCTGTCATGGAAGAAAGTGAAGATTTCTTTTGTCCTCCACAACCCATAACTTTGGCAGAATACTTCCC
AAGGCGCTTTCTCGATGATAGTCAAGGAGAGGCACTTGAAACTGTCACGTGTCACATTGTGGACGTGGTGGAAGATGATGATGTCCCTGCTAGCTCCTCGGGAACGGTGG
CAGAGTCACAAGCAGATGCAAGAAAGGAAGCTGTTGAAGATGTGAATACTTCCGACCTGAAAAAGGGTGAAACGTCTACAAGCCTTGTGAAACCTAAGGTTGTAAAGGAT
AAGAAGTGTTCAAATTCACCTATCCTACGATACGTCCCCTTATCTCGACGTAAAAAGGGTGAATCACCTTTCACTGAATGTTCAGAAAGCATAAAGAAGAAACTTCTAAA
GGAAGGTTATAGTCTGCCTACAACAAGAAAAGGGCTTGGATATAAGTCGCCAGAGCCGGTTCGTATAACAAGAAGAGGGAAGGCAAAAGTGGCAGACACAAATCATATAA
CAATAGAGGAGGTTGATGACTCAAAAGAAAAAAAGAGTGTCGACCAACGAACTTCTGTTTTTAGGCGCATCAGGCCACCGGTTGCTCGTGCTTTAGTCTTTCAGAGATTA
AGTGTCAATGAAACGGAAGAAGAGAGTACACAACCTACCAATAGCTCCACTCGACCTTCAGTTTTTCGAAGGTTAAGTATGCCCAGTGAGGAAGAAGAGAGTACATTTTC
AACTCCGAATGTCACTCAACCTTCAGCTTTTCAAAGGTTAAATGTGCCCATGGGGAAAGAAGAGAGTACATTTTCAGCTCCGGAGGTGGCTCGACCATCAGTTTTTCAAA
GGTTAAATGTTACCACGAGAAGAGACAAAGAAGAACAATCTGCTTCATCGATTTCTCACCGACTTCAAACCGGAAAGTTCAAGCAACAAGGAGCCAAGAGAGGGCATTCT
GCCCATCAGCGTCGAGACGCCACTCTTGGAGGGTCTCGACGCTGGTTTTCCTTGATTAGAAGAGGCGGCAGCGTCACAGCGTCGAGACGCTGTGACAATAGCGTCTCGAC
GCTACCGTAA
Protein sequenceShow/hide protein sequence
MQKNRRESKKKKSIWGQMQTSVETLLLERLDASFPFLNTRVLEAASRRCLDSVSTLRQKSALYLIGISGSIAYGRIIEDHTPLAVASSISKLMEESSKDRVAVKDNPLFE
SVIPTSKRSKDTLNLDVMSVLMADIDQDERMAEMERKLNLLMKALIQFGTLDPIVVRFQKEATMKGSQEKYVSIEDENEGWTLVVRRKKQKQSYARKESRLFRDNKRKAK
SQKKKGKKKSRRSKPVMEESEDFFCPPQPITLAEYFPRRFLDDSQGEALETVTCHIVDVVEDDDVPASSSGTVAESQADARKEAVEDVNTSDLKKGETSTSLVKPKVVKD
KKCSNSPILRYVPLSRRKKGESPFTECSESIKKKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKVADTNHITIEEVDDSKEKKSVDQRTSVFRRIRPPVARALVFQRL
SVNETEEESTQPTNSSTRPSVFRRLSMPSEEEESTFSTPNVTQPSAFQRLNVPMGKEESTFSAPEVARPSVFQRLNVTTRRDKEEQSASSISHRLQTGKFKQQGAKRGHS
AHQRRDATLGGSRRWFSLIRRGGSVTASRRCDNSVSTLP