; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026703 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026703
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold6:18664736..18666677
RNA-Seq ExpressionSpg026703
SyntenySpg026703
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]3.4e-2327.09Show/hide
Query:  LSYDRFVNNLARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYANIDKEEGFLAIVRVREVG----------
        +++ +F N+ A+A++     R+  FE GF       G     +   +    W  F   P SVNA +V EFYANI K       VR +++           
Subjt:  LSYDRFVNNLARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYANIDKEEGFLAIVRVREVG----------

Query:  -----------------------------IEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADE
                                      E  +W   +T + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  -----------------------------IEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADE

Query:  ISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------DINTILEQLAL-SASRQEFAERQAL-----
        +  C  KK   L FPN IT LC++  V EN  D IL     I    L  L   +  +    V+       + N  +  LAL  A  Q  A+  AL     
Subjt:  ISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------DINTILEQLAL-SASRQEFAERQAL-----

Query:  TFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQE
         F+ YV+ RD  ++   QE         P FP+++L  +      E E D  + P  +
Subjt:  TFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQE

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]6.4e-2232.02Show/hide
Query:  RFVNNLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYAN-----------------------------
        +F    A  +Y   +  R    E+GF        G LP F+   I  H W+ FC+ PE     +V EFYAN                             
Subjt:  RFVNNLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYAN-----------------------------

Query:  ---IDKEEGFL-------AIVRVREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEIS
           +D+   F+        I  +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS++RMLL  ++L   SI+VG++I  EI 
Subjt:  ---IDKEEGFL-------AIVRVREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEIS

Query:  GCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRTQE
         C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+  TQE
Subjt:  GCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.8e-3229.22Show/hide
Query:  ESQLSYDRFVNNLARAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYANIDKEEGFLAIVR-----------
        E++ +  R+ NN+          R    E+GF        G LP F+   I  H W+ FC+ PE     +V EFYAN+   E     VR           
Subjt:  ESQLSYDRFVNNLARAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYANIDKEEGFLAIVR-----------

Query:  ----------------------------VREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKI
                                    +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++RMLL  ++L   SI+VG++
Subjt:  ----------------------------VREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKI

Query:  IADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYDINTILEQLALSASR
        I  EI  C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +
Subjt:  IADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYDINTILEQLALSASR

Query:  Q-------EFAERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ
        Q       +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E + DG  +  +
Subjt:  Q-------EFAERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]4.0e-2434.4Show/hide
Query:  ANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRT
        A  W  F+K RLLPTTH  TVS++RMLL +++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L   G ID   +AR+  T
Subjt:  ANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRT

Query:  QEAR--------------------QGGLVYDINTILEQLALSASRQ-------EFAERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPW
        QE +                     G ++  +  + ++L+    +Q       +   +Q   FW Y + RD  LKKALQ NF++P P  P FP++LL   
Subjt:  QEAR--------------------QGGLVYDINTILEQLALSASRQ-------EFAERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPW

Query:  IPPPPVEREGDGEEDPGQ
              E + DG  +  +
Subjt:  IPPPPVEREGDGEEDPGQ

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.6e-2835.86Show/hide
Query:  EFYANIDKEEGFLAIVRVREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEISGCWKK
        EF  NI + E    I  +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS++RMLL  ++L   SI+VG++I  EI  C  +
Subjt:  EFYANIDKEEGFLAIVRVREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEISGCWKK

Query:  KVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL--------------QRTQEARQGGLVYDINTILEQLALSASRQEFAERQALTFWNYV
        K G LFFP+ IT LC+ A    NE    L + G ID   +AR+               R   A       D+   L+ L    S+QE   +Q   FW Y 
Subjt:  KVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL--------------QRTQEARQGGLVYDINTILEQLALSASRQEFAERQALTFWNYV

Query:  RTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ
        + RD  LKKALQ NF++P P  PAFP+++L         E + DG  +  +
Subjt:  RTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.1e-2232.02Show/hide
Query:  RFVNNLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYAN-----------------------------
        +F    A  +Y   +  R    E+GF        G LP F+   I  H W+ FC+ PE     +V EFYAN                             
Subjt:  RFVNNLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYAN-----------------------------

Query:  ---IDKEEGFL-------AIVRVREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEIS
           +D+   F+        I  +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS++RMLL  ++L   SI+VG++I  EI 
Subjt:  ---IDKEEGFL-------AIVRVREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEIS

Query:  GCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRTQE
         C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+  TQE
Subjt:  GCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)3.3e-3229.22Show/hide
Query:  ESQLSYDRFVNNLARAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYANIDKEEGFLAIVR-----------
        E++ +  R+ NN+          R    E+GF        G LP F+   I  H W+ FC+ PE     +V EFYAN+   E     VR           
Subjt:  ESQLSYDRFVNNLARAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYANIDKEEGFLAIVR-----------

Query:  ----------------------------VREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKI
                                    +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++RMLL  ++L   SI+VG++
Subjt:  ----------------------------VREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKI

Query:  IADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYDINTILEQLALSASR
        I  EI  C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +
Subjt:  IADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYDINTILEQLALSASR

Query:  Q-------EFAERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ
        Q       +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E + DG  +  +
Subjt:  Q-------EFAERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ

A0A2P5CEY2 Uncharacterized protein1.9e-2434.4Show/hide
Query:  ANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRT
        A  W  F+K RLLPTTH  TVS++RMLL +++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L   G ID   +AR+  T
Subjt:  ANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRT

Query:  QEAR--------------------QGGLVYDINTILEQLALSASRQ-------EFAERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPW
        QE +                     G ++  +  + ++L+    +Q       +   +Q   FW Y + RD  LKKALQ NF++P P  P FP++LL   
Subjt:  QEAR--------------------QGGLVYDINTILEQLALSASRQ-------EFAERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPW

Query:  IPPPPVEREGDGEEDPGQ
              E + DG  +  +
Subjt:  IPPPPVEREGDGEEDPGQ

A0A2P5DXM3 Uncharacterized protein2.2e-2835.86Show/hide
Query:  EFYANIDKEEGFLAIVRVREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEISGCWKK
        EF  NI + E    I  +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS++RMLL  ++L   SI+VG++I  EI  C  +
Subjt:  EFYANIDKEEGFLAIVRVREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEISGCWKK

Query:  KVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL--------------QRTQEARQGGLVYDINTILEQLALSASRQEFAERQALTFWNYV
        K G LFFP+ IT LC+ A    NE    L + G ID   +AR+               R   A       D+   L+ L    S+QE   +Q   FW Y 
Subjt:  KVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL--------------QRTQEARQGGLVYDINTILEQLALSASRQEFAERQALTFWNYV

Query:  RTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ
        + RD  LKKALQ NF++P P  PAFP+++L         E + DG  +  +
Subjt:  RTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ

A0A6A3BU96 Uncharacterized protein1.6e-2327.09Show/hide
Query:  LSYDRFVNNLARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYANIDKEEGFLAIVRVREVG----------
        +++ +F N+ A+A++     R+  FE GF       G     +   +    W  F   P SVNA +V EFYANI K       VR +++           
Subjt:  LSYDRFVNNLARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYANIDKEEGFLAIVRVREVG----------

Query:  -----------------------------IEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADE
                                      E  +W   +T + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  -----------------------------IEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADE

Query:  ISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------DINTILEQLAL-SASRQEFAERQAL-----
        +  C  KK   L FPN IT LC++  V EN  D IL     I    L  L   +  +    V+       + N  +  LAL  A  Q  A+  AL     
Subjt:  ISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------DINTILEQLAL-SASRQEFAERQAL-----

Query:  TFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQE
         F+ YV+ RD  ++   QE         P FP+++L  +      E E D  + P  +
Subjt:  TFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTGCCACGTGTCGCACAGGGATAATCCAACGGTTTAGCCGTTTGACAGCACATAATTATTTACCATGGGCGATTGGAATTCGAGATTTTACCGTTGATGGATT
TAATTTTCAGATTTGCATGAGATTTGAAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTGACCCCCGAAGCACCGAAAGTAAAGGCAAAGAAGAAGAAGACAC
CAGAAGAAAAAGAAGCTAAAAGAAGAAGAAAACAGCAGAGGACTGAAGATCAAGAAGTTGCTCAGAAAGCGGCGGAGGATGTTATTGCGGAAGAAGATCCGAAAGAACCA
GAAGGACGGAATCAAGAGCAGTCTGAGCCAGGAGTTGCAGATACAGAGGAGGTTCGAGAAGAAAATACAGAGGAAGTTCAAGAAAAACAGGCAGAGGATGTGCAAGAAGA
ACAGGCAGAGGTTGAACCTGAAGAAGTTAATGAGCAAAAACAGGAGGCTCGTGTGGAGGTGATCATGCCGGAAGTGCCCAAACGCCGTCGTATAAAGCGAAAAGCGGGCC
GTGTTAAGGTAGTCCGAACTGATACCCCCTCACCTCCAACTACTGATTCTGAAAGAGAGAATGCTGAGAAAGAAGAGCGTGAGAAGAAGGAGGCCGAAGATAAAGCAAGA
GAGGAAGCAGAGAAAAAGGCTGAAGAAGAAAGATTGCGCAAGCAAAGGGCAGACAGGGGCAAGAGTGTTGCTGCGGCATCAGAGGAACCTGATGAAATAGAAGAGTCACA
ATTGTCGTATGATCGCTTTGTCAACAATCTTGCCAGAGCAAAATATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAAAGGGGATTTAGTGGTGATCTTCCACATTTTC
TGAGGACCGGTATTGCAGATCACGGGTGGGAATGGTTTTGTTCAAAGCCTGAATCTGTGAATGCGCAGGTGGTGCACGAGTTTTATGCAAATATTGACAAAGAAGAAGGT
TTCCTAGCAATTGTTCGAGTGAGGGAAGTTGGTATTGAAGGGGCGCAGTGGTGGCTTTCGAAAACAGAGAAGAGGACGTTCCAGTCAGCCTATTTGAAGAGGGAAGCAAA
TACTTGGATGGGATTTATCAAACAAAGGTTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGAATGCTTCTGGCTTTCGCTATTTTGAGGTCTCTCAGTATTG
ATGTGGGAAAAATTATTGCTGATGAAATATCTGGATGTTGGAAGAAGAAAGTGGGGAAGTTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGGTTCCA
GAGAATGAAGGAGATGTGATATTATTTGACAAGGGAATCATTGACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTGGTCTACGACAT
CAATACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTAGAACTCGTGATGCCAATCTGAAGA
AGGCGCTACAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCCACCGCCTGTCGAGAGAGAAGGAGATGGA
GAAGAAGATCCTGGTCAGGAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATTGCCACGTGTCGCACAGGGATAATCCAACGGTTTAGCCGTTTGACAGCACATAATTATTTACCATGGGCGATTGGAATTCGAGATTTTACCGTTGATGGATT
TAATTTTCAGATTTGCATGAGATTTGAAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTGACCCCCGAAGCACCGAAAGTAAAGGCAAAGAAGAAGAAGACAC
CAGAAGAAAAAGAAGCTAAAAGAAGAAGAAAACAGCAGAGGACTGAAGATCAAGAAGTTGCTCAGAAAGCGGCGGAGGATGTTATTGCGGAAGAAGATCCGAAAGAACCA
GAAGGACGGAATCAAGAGCAGTCTGAGCCAGGAGTTGCAGATACAGAGGAGGTTCGAGAAGAAAATACAGAGGAAGTTCAAGAAAAACAGGCAGAGGATGTGCAAGAAGA
ACAGGCAGAGGTTGAACCTGAAGAAGTTAATGAGCAAAAACAGGAGGCTCGTGTGGAGGTGATCATGCCGGAAGTGCCCAAACGCCGTCGTATAAAGCGAAAAGCGGGCC
GTGTTAAGGTAGTCCGAACTGATACCCCCTCACCTCCAACTACTGATTCTGAAAGAGAGAATGCTGAGAAAGAAGAGCGTGAGAAGAAGGAGGCCGAAGATAAAGCAAGA
GAGGAAGCAGAGAAAAAGGCTGAAGAAGAAAGATTGCGCAAGCAAAGGGCAGACAGGGGCAAGAGTGTTGCTGCGGCATCAGAGGAACCTGATGAAATAGAAGAGTCACA
ATTGTCGTATGATCGCTTTGTCAACAATCTTGCCAGAGCAAAATATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAAAGGGGATTTAGTGGTGATCTTCCACATTTTC
TGAGGACCGGTATTGCAGATCACGGGTGGGAATGGTTTTGTTCAAAGCCTGAATCTGTGAATGCGCAGGTGGTGCACGAGTTTTATGCAAATATTGACAAAGAAGAAGGT
TTCCTAGCAATTGTTCGAGTGAGGGAAGTTGGTATTGAAGGGGCGCAGTGGTGGCTTTCGAAAACAGAGAAGAGGACGTTCCAGTCAGCCTATTTGAAGAGGGAAGCAAA
TACTTGGATGGGATTTATCAAACAAAGGTTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGAATGCTTCTGGCTTTCGCTATTTTGAGGTCTCTCAGTATTG
ATGTGGGAAAAATTATTGCTGATGAAATATCTGGATGTTGGAAGAAGAAAGTGGGGAAGTTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGGTTCCA
GAGAATGAAGGAGATGTGATATTATTTGACAAGGGAATCATTGACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTGGTCTACGACAT
CAATACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTAGAACTCGTGATGCCAATCTGAAGA
AGGCGCTACAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCCACCGCCTGTCGAGAGAGAAGGAGATGGA
GAAGAAGATCCTGGTCAGGAGGATTGA
Protein sequenceShow/hide protein sequence
MDIATCRTGIIQRFSRLTAHNYLPWAIGIRDFTVDGFNFQICMRFERKERDNEEEEVPVTPEAPKVKAKKKKTPEEKEAKRRRKQQRTEDQEVAQKAAEDVIAEEDPKEP
EGRNQEQSEPGVADTEEVREENTEEVQEKQAEDVQEEQAEVEPEEVNEQKQEARVEVIMPEVPKRRRIKRKAGRVKVVRTDTPSPPTTDSERENAEKEEREKKEAEDKAR
EEAEKKAEEERLRKQRADRGKSVAAASEEPDEIEESQLSYDRFVNNLARAKYAELLKRDFLFERGFSGDLPHFLRTGIADHGWEWFCSKPESVNAQVVHEFYANIDKEEG
FLAIVRVREVGIEGAQWWLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERMLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVP
ENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVYDINTILEQLALSASRQEFAERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDG
EEDPGQED