; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031240 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031240
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold2:36390371..36394177
RNA-Seq ExpressionSpg031240
SyntenySpg031240
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]4.5e-2127.25Show/hide
Query:  FINHFARAKYINMLKRDFLFERGF------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPLLAR
        F++  A+  Y  +  R   FE GF      + +L   +   +T H W+ F   P PVN+ +V+EFY+NI E     V+VR I +RF              
Subjt:  FINHFARAKYINMLKRDFLFERGF------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPLLAR

Query:  EELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRIQSIAGDL----IRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTHDS
                 INR F   ++       +S    +     Q I  DL     R      + K VDR  +L                W  F+K +L+PT+H++
Subjt:  EELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRIQSIAGDL----IRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTHDS

Query:  TVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARLQRTQEARQGGLVCGIHQILEQL
        TVS +R+LL  +IL   +ID+GKII      C +++   L FPN I+ LC++  V E   D IL     ++   +  L   +EA+         ++    
Subjt:  TVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARLQRTQEARQGGLVCGIHQILEQL

Query:  TLSASRQEFVERQAHT-------------FWNYVKRRDAALRKAL
         + AS  +  +    T             ++ Y KRRDA L  AL
Subjt:  TLSASRQEFVERQAHT-------------FWNYVKRRDAALRKAL

KAF4375842.1 hypothetical protein G4B88_026421 [Cannabis sativa]5.4e-1925.82Show/hide
Query:  GKGIAGAEVE----AEVGRPEEGRLSYERFINHFARAKYINMLK-RDFLFERGF------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYAN-I
        GK I G++ +      +  P +     +   +   + KY+N ++ ++F  +RG        G +P +L   I    W   C  P     QVV+EFYAN +
Subjt:  GKGIAGAEVE----AEVGRPEEGRLSYERFINHFARAKYINMLK-RDFLFERGF------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYAN-I

Query:  DEEEGFQVIVRRIVLRFVWVLNKGAHPLLAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRIQSIAGDLIRLVVSVRRSKNVDRSIVLYFAFY
          E   ++ VR + ++                   F D  IN  ++              + T + S+      D+ +L+  +   + V R        +
Subjt:  DEEEGFQVIVRRIVLRFVWVLNKGAHPLLAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRIQSIAGDLIRLVVSVRRSKNVDRSIVLYFAFY

Query:  HATYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVID-T
          T LK++      F++  LLPT+HDSTVSRER+ + + I++   I+VGK+IA EIF C  +  GKLFF   I+  C+ A VP   ++  +  KGV+   
Subjt:  HATYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVID-T

Query:  PNLARLQRTQEARQGGLVCGIHQILEQLTLSASRQEFVERQAHTFWNYVKRRDAALRKALQANF
        P+ A  + T              + E+L+   + Q+ +  +  T WNY + RD  + + L+ N+
Subjt:  PNLARLQRTQEARQGGLVCGIHQILEQLTLSASRQEFVERQAHTFWNYVKRRDAALRKALQANF

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.9e-2030.53Show/hide
Query:  RFINHFARAKYINMLK-RDFLFERGF-------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPL
        +F    A  +Y N ++ R    E+GF        G LP F+   IT H W+ FC+ PE     +VREFYAN+ +     V VR +               
Subjt:  RFINHFARAKYINMLK-RDFLFERGF-------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPL

Query:  LAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTH
            ++ + +  IN +F  G+  P+   +  ++   +   I   +++A       VS + +    RS            L   A  W  F+K  LLPTTH
Subjt:  LAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTH

Query:  DSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARLQRTQE
          TVS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ I+ LC+ A  P    +  L + G ID   +AR+  TQE
Subjt:  DSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]5.4e-2730.3Show/hide
Query:  RFINHFARAKYINMLK-RDFLFERGF-------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPL
        +F    A  +Y N ++ R    E+GF        G LP F+   IT H W+ FC+ PE     +VREFYAN+ + E   V VR +               
Subjt:  RFINHFARAKYINMLK-RDFLFERGF-------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPL

Query:  LAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTH
            ++ + +  IN +F  G+  P+   +  +    Q   I   +++A       VS + +    RS            L   A  W  F+K RLLPTTH
Subjt:  LAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTH

Query:  DSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARL------QRTQEA--------
          TVS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ I+ LC+ A  P    +  L + G ID   +AR+      + TQ+         
Subjt:  DSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARL------QRTQEA--------

Query:  ---RQGGLVCGIHQILEQLTLSASRQEF--------VERQAHTFWNYVKRRDAALRKALQANF
           R  G +    + LEQ       Q++          +Q   FW Y K RD AL+KALQ NF
Subjt:  ---RQGGLVCGIHQILEQLTLSASRQEF--------VERQAHTFWNYVKRRDAALRKALQANF

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.1e-1930.1Show/hide
Query:  VVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPLLAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKN
        +VREFYAN+ + E   + VR +                   ++ + +  IN +F  G+  P+   +  ++   +   I   +++A       VS + +  
Subjt:  VVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPLLAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKN

Query:  VDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAED
          RS            L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ I+ LC+ A  P    +
Subjt:  VDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAED

Query:  VILLDKGVIDTPNLARL--------------QRTQEARQGGLVCGIHQILEQLTLSASRQEFVERQAHTFWNYVKRRDAALRKALQANF
          L + G ID   +AR+               R   A        + Q L+ L    S+QE   +Q   FW Y K RD AL+KALQ NF
Subjt:  VILLDKGVIDTPNLARL--------------QRTQEARQGGLVCGIHQILEQLTLSASRQEFVERQAHTFWNYVKRRDAALRKALQANF

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)2.6e-2730.3Show/hide
Query:  RFINHFARAKYINMLK-RDFLFERGF-------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPL
        +F    A  +Y N ++ R    E+GF        G LP F+   IT H W+ FC+ PE     +VREFYAN+ + E   V VR +               
Subjt:  RFINHFARAKYINMLK-RDFLFERGF-------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPL

Query:  LAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTH
            ++ + +  IN +F  G+  P+   +  +    Q   I   +++A       VS + +    RS            L   A  W  F+K RLLPTTH
Subjt:  LAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTH

Query:  DSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARL------QRTQEA--------
          TVS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ I+ LC+ A  P    +  L + G ID   +AR+      + TQ+         
Subjt:  DSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARL------QRTQEA--------

Query:  ---RQGGLVCGIHQILEQLTLSASRQEF--------VERQAHTFWNYVKRRDAALRKALQANF
           R  G +    + LEQ       Q++          +Q   FW Y K RD AL+KALQ NF
Subjt:  ---RQGGLVCGIHQILEQLTLSASRQEF--------VERQAHTFWNYVKRRDAALRKALQANF

A0A2P5CEY2 Uncharacterized protein7.7e-1936.41Show/hide
Query:  LKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLAR
        L   A  W  F+K RLLPTTH  TVS++R+LL +++L   SI+VG++I  EI  C  +K G LFFP+ I+ LC+ A  P    +  L   G ID   +AR
Subjt:  LKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLAR

Query:  L------QRTQE-----------ARQGGLVCGIHQILEQLTLSASRQEF--------VERQAHTFWNYVKRRDAALRKALQANF
        +      + TQ+           +R  G +    + LEQ       Q++          +Q   FW Y K RD AL+KALQ NF
Subjt:  L------QRTQE-----------ARQGGLVCGIHQILEQLTLSASRQEF--------VERQAHTFWNYVKRRDAALRKALQANF

A0A2P5DXM3 Uncharacterized protein5.3e-2030.1Show/hide
Query:  VVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPLLAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKN
        +VREFYAN+ + E   + VR +                   ++ + +  IN +F  G+  P+   +  ++   +   I   +++A       VS + +  
Subjt:  VVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPLLAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRI---QSIAGDLIRLVVSVRRSKN

Query:  VDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAED
          RS            L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ I+ LC+ A  P    +
Subjt:  VDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAED

Query:  VILLDKGVIDTPNLARL--------------QRTQEARQGGLVCGIHQILEQLTLSASRQEFVERQAHTFWNYVKRRDAALRKALQANF
          L + G ID   +AR+               R   A        + Q L+ L    S+QE   +Q   FW Y K RD AL+KALQ NF
Subjt:  VILLDKGVIDTPNLARL--------------QRTQEARQGGLVCGIHQILEQLTLSASRQEFVERQAHTFWNYVKRRDAALRKALQANF

A0A6A2ZUE4 Uncharacterized protein2.2e-2127.25Show/hide
Query:  FINHFARAKYINMLKRDFLFERGF------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPLLAR
        F++  A+  Y  +  R   FE GF      + +L   +   +T H W+ F   P PVN+ +V+EFY+NI E     V+VR I +RF              
Subjt:  FINHFARAKYINMLKRDFLFERGF------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAHPLLAR

Query:  EELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRIQSIAGDL----IRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTHDS
                 INR F   ++       +S    +     Q I  DL     R      + K VDR  +L                W  F+K +L+PT+H++
Subjt:  EELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRIQSIAGDL----IRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTHDS

Query:  TVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARLQRTQEARQGGLVCGIHQILEQL
        TVS +R+LL  +IL   +ID+GKII      C +++   L FPN I+ LC++  V E   D IL     ++   +  L   +EA+         ++    
Subjt:  TVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARLQRTQEARQGGLVCGIHQILEQL

Query:  TLSASRQEFVERQAHT-------------FWNYVKRRDAALRKAL
         + AS  +  +    T             ++ Y KRRDA L  AL
Subjt:  TLSASRQEFVERQAHT-------------FWNYVKRRDAALRKAL

A0A7J6FZ22 Uncharacterized protein2.6e-1925.82Show/hide
Query:  GKGIAGAEVE----AEVGRPEEGRLSYERFINHFARAKYINMLK-RDFLFERGF------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYAN-I
        GK I G++ +      +  P +     +   +   + KY+N ++ ++F  +RG        G +P +L   I    W   C  P     QVV+EFYAN +
Subjt:  GKGIAGAEVE----AEVGRPEEGRLSYERFINHFARAKYINMLK-RDFLFERGF------SGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYAN-I

Query:  DEEEGFQVIVRRIVLRFVWVLNKGAHPLLAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRIQSIAGDLIRLVVSVRRSKNVDRSIVLYFAFY
          E   ++ VR + ++                   F D  IN  ++              + T + S+      D+ +L+  +   + V R        +
Subjt:  DEEEGFQVIVRRIVLRFVWVLNKGAHPLLAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRIQSIAGDLIRLVVSVRRSKNVDRSIVLYFAFY

Query:  HATYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVID-T
          T LK++      F++  LLPT+HDSTVSRER+ + + I++   I+VGK+IA EIF C  +  GKLFF   I+  C+ A VP   ++  +  KGV+   
Subjt:  HATYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVID-T

Query:  PNLARLQRTQEARQGGLVCGIHQILEQLTLSASRQEFVERQAHTFWNYVKRRDAALRKALQANF
        P+ A  + T              + E+L+   + Q+ +  +  T WNY + RD  + + L+ N+
Subjt:  PNLARLQRTQEARQGGLVCGIHQILEQLTLSASRQEFVERQAHTFWNYVKRRDAALRKALQANF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCTCTTTTCATTTCATCTTCCCATCGTGCTTTCTTTCTACTCTCTTCCTGTTCTTGCGAAAAGCGAAGAAGAAACAAACGCCGGCAGAAAGGGAAGCCAAGCGCT
TAAGAGGCTACAAAAGGTTGAAGTTTCTGAAGTATTAGAGAAGGTAATCGAGGACATTGCAGAGGAAGTAGTCGCTGAGGAAGAACCGAAAGATTCAGAAAAAGATTCTG
AGAAGATTGTTGAAACAAAGCAAGAGAGTCAAGAAAAGGCAGTAATAGATTTGCAAGGGCAAGAAGATGAAACCGAGAAGAAGAATAGTGAAGATAAGGGCAAAGGTATT
GCAGGAGCTGAGGTTGAAGCGGAAGTGGGAAGGCCAGAGGAAGGGAGGTTGTCGTATGAGCGCTTCATCAATCATTTTGCTAGGGCAAAATATATTAACATGTTGAAGAG
AGATTTCCTGTTCGAGAGAGGGTTCAGTGGTGATCTTCCGCGCTTTCTGACGGCTGGCATTACAAACCATGGCTGGGAGTTATTCTGTTCCAAGCCAGAGCCGGTGAATT
CGCAAGTGGTGCGAGAGTTCTATGCGAACATAGACGAGGAAGAAGGTTTTCAAGTGATTGTTCGAAGGATTGTCCTTAGATTTGTATGGGTCTTGAACAAGGGCGCTCAT
CCTCTCCTGGCCCGAGAGGAACTGCTATTTATTGATTGGATCATAAACAGATTGTTCATTAGAGGAATCAGTGGTCCATTAGGTCCACCTGCTAGCTCATTGGATCCCAC
AATCCAAGCCTCAAGGATTCAGAGCATAGCGGGTGATCTTATTCGATTAGTGGTGTCCGTTAGAAGATCAAAGAACGTGGATAGGAGCATTGTGTTGTATTTCGCCTTCT
ACCACGCTACTTACTTGAAGAAGGAAGCCAATACGTGGATGGGGTTTATCAAACAGAGGTTGCTTCCAACGACTCATGACTCCACAGTCTCTCGAGAAAGAGTTCTTTTG
GCTTTTGCAATACTAAGGTCTCTTAGCATCGACGTAGGTAAGATTATTGCTGGTGAAATTTTTGGGTGTTGGAGAAAGAAGGTTGGAAAATTGTTCTTTCCAAACACAAT
TTCAATGCTATGTAAGAGAGCAGGGGTTCCAGAGAGCGCAGAAGATGTGATTTTATTGGACAAGGGAGTAATCGACACGCCTAACTTGGCACGGCTTCAGCGAACGCAAG
AGGCACGTCAAGGAGGGTTAGTCTGTGGCATTCACCAGATTTTAGAACAACTCACACTTTCAGCCAGCAGGCAAGAGTTTGTTGAGAGGCAAGCTCATACCTTCTGGAAT
TATGTTAAAAGACGTGATGCCGCATTGAGGAAGGCACTTCAAGCAAACTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCTCTTTTCATTTCATCTTCCCATCGTGCTTTCTTTCTACTCTCTTCCTGTTCTTGCGAAAAGCGAAGAAGAAACAAACGCCGGCAGAAAGGGAAGCCAAGCGCT
TAAGAGGCTACAAAAGGTTGAAGTTTCTGAAGTATTAGAGAAGGTAATCGAGGACATTGCAGAGGAAGTAGTCGCTGAGGAAGAACCGAAAGATTCAGAAAAAGATTCTG
AGAAGATTGTTGAAACAAAGCAAGAGAGTCAAGAAAAGGCAGTAATAGATTTGCAAGGGCAAGAAGATGAAACCGAGAAGAAGAATAGTGAAGATAAGGGCAAAGGTATT
GCAGGAGCTGAGGTTGAAGCGGAAGTGGGAAGGCCAGAGGAAGGGAGGTTGTCGTATGAGCGCTTCATCAATCATTTTGCTAGGGCAAAATATATTAACATGTTGAAGAG
AGATTTCCTGTTCGAGAGAGGGTTCAGTGGTGATCTTCCGCGCTTTCTGACGGCTGGCATTACAAACCATGGCTGGGAGTTATTCTGTTCCAAGCCAGAGCCGGTGAATT
CGCAAGTGGTGCGAGAGTTCTATGCGAACATAGACGAGGAAGAAGGTTTTCAAGTGATTGTTCGAAGGATTGTCCTTAGATTTGTATGGGTCTTGAACAAGGGCGCTCAT
CCTCTCCTGGCCCGAGAGGAACTGCTATTTATTGATTGGATCATAAACAGATTGTTCATTAGAGGAATCAGTGGTCCATTAGGTCCACCTGCTAGCTCATTGGATCCCAC
AATCCAAGCCTCAAGGATTCAGAGCATAGCGGGTGATCTTATTCGATTAGTGGTGTCCGTTAGAAGATCAAAGAACGTGGATAGGAGCATTGTGTTGTATTTCGCCTTCT
ACCACGCTACTTACTTGAAGAAGGAAGCCAATACGTGGATGGGGTTTATCAAACAGAGGTTGCTTCCAACGACTCATGACTCCACAGTCTCTCGAGAAAGAGTTCTTTTG
GCTTTTGCAATACTAAGGTCTCTTAGCATCGACGTAGGTAAGATTATTGCTGGTGAAATTTTTGGGTGTTGGAGAAAGAAGGTTGGAAAATTGTTCTTTCCAAACACAAT
TTCAATGCTATGTAAGAGAGCAGGGGTTCCAGAGAGCGCAGAAGATGTGATTTTATTGGACAAGGGAGTAATCGACACGCCTAACTTGGCACGGCTTCAGCGAACGCAAG
AGGCACGTCAAGGAGGGTTAGTCTGTGGCATTCACCAGATTTTAGAACAACTCACACTTTCAGCCAGCAGGCAAGAGTTTGTTGAGAGGCAAGCTCATACCTTCTGGAAT
TATGTTAAAAGACGTGATGCCGCATTGAGGAAGGCACTTCAAGCAAACTTTTAA
Protein sequenceShow/hide protein sequence
MFLFSFHLPIVLSFYSLPVLAKSEEETNAGRKGSQALKRLQKVEVSEVLEKVIEDIAEEVVAEEEPKDSEKDSEKIVETKQESQEKAVIDLQGQEDETEKKNSEDKGKGI
AGAEVEAEVGRPEEGRLSYERFINHFARAKYINMLKRDFLFERGFSGDLPRFLTAGITNHGWELFCSKPEPVNSQVVREFYANIDEEEGFQVIVRRIVLRFVWVLNKGAH
PLLAREELLFIDWIINRLFIRGISGPLGPPASSLDPTIQASRIQSIAGDLIRLVVSVRRSKNVDRSIVLYFAFYHATYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLL
AFAILRSLSIDVGKIIAGEIFGCWRKKVGKLFFPNTISMLCKRAGVPESAEDVILLDKGVIDTPNLARLQRTQEARQGGLVCGIHQILEQLTLSASRQEFVERQAHTFWN
YVKRRDAALRKALQANF