; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026736 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026736
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold6:20516852..20524912
RNA-Seq ExpressionSpg026736
SyntenySpg026736
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8680640.1 hypothetical protein F3Y22_tig00111372pilonHSYRG00020 [Hibiscus syriacus]4.6e-2629.32Show/hide
Query:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F ++ A+A++Q   K+   FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    +  + VRG  + ++P A+   F LQ
Subjt:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSAAVREANT-----------------------WMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILD
        D    HA F E   + + D++   +   NT                       W  F+K +L+PT++++TVS  R+ L  +I  S  IDVG+II  ++ D
Subjt:  DF--PHAVFNEMVVAPSNDQLSAAVREANT-----------------------WMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILD

Query:  CWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL---------QRTHE----ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFW
        C  KK   L FPN IT LCR+  V E+  D ILP    I    L  L         +  HE      Q      +  ++E++    + +     + + F+
Subjt:  CWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL---------QRTHE----ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFW

Query:  DYVKRRD
         YVK RD
Subjt:  DYVKRRD

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.6e-2938.55Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQD--FPHAVFNE-------------MVVAPSNDQLSA-----AVREANT-----WMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEI
        L D    H+ F E             + VA +   +SA      +R A T     W  F+K  LLPTTH  TVS+DR+ L  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVFNE-------------MVVAPSNDQLSA-----AVREANT-----WMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]5.1e-4135.73Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQD--FPHAVFNEMV-----------VAPSNDQ------------LSAAVREANTWMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEI
        L D    H+ F + +           VA +  +             SA    A  W  F+K RLLPTTH  TVS+DR+ L  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVFNEMV-----------VAPSNDQ------------LSAAVREANTWMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---THEARQ---------------GGLVCGIQQI------QELLQLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T   +Q               G ++  ++ +      QE+ Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---THEARQ---------------GGLVCGIQQI------QELLQLH-S

Query:  SRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLL
        S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP+++L
Subjt:  SRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]2.1e-2638.28Show/hide
Query:  PSNDQLSAAVREANTWMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGI
        P N +  A    A  W  F+K RLLPTTH  TVS+DR+ L +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G 
Subjt:  PSNDQLSAAVREANTWMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGI

Query:  IDTPNLARLQR-----------------THEARQGGLVCGIQQI---------QELLQLH-SSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPA
        ID   +AR+ +                    +R  G +  +QQ+         QE+ Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P 
Subjt:  IDTPNLARLQR-----------------THEARQGGLVCGIQQI---------QELLQLH-SSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPA

Query:  LPVFPEDLL
         P FP++LL
Subjt:  LPVFPEDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.2e-3538.32Show/hide
Query:  VREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQD--FPHAVFNEMVVAPSNDQL-----------------------SAAVREANTWMGFIKLRLL
        VREFYANL D EE  + VRGV V WS EA+N +F L D    H+ F E +  P    +                       SA    A  W  F+K RLL
Subjt:  VREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQD--FPHAVFNEMVVAPSNDQL-----------------------SAAVREANTWMGFIKLRLL

Query:  PTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDT-----------------PNLAR
        PTTH   VS+DR+ L  ++L+  SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID                  P+ +R
Subjt:  PTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDT-----------------PNLAR

Query:  LQRTHEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLL
              +R  G V  +QQ++ L Q   S+ E   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP+++L
Subjt:  LQRTHEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.3e-2938.55Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQD--FPHAVFNE-------------MVVAPSNDQLSA-----AVREANT-----WMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEI
        L D    H+ F E             + VA +   +SA      +R A T     W  F+K  LLPTTH  TVS+DR+ L  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVFNE-------------MVVAPSNDQLSA-----AVREANT-----WMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)2.5e-4135.73Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQD--FPHAVFNEMV-----------VAPSNDQ------------LSAAVREANTWMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEI
        L D    H+ F + +           VA +  +             SA    A  W  F+K RLLPTTH  TVS+DR+ L  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVFNEMV-----------VAPSNDQ------------LSAAVREANTWMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---THEARQ---------------GGLVCGIQQI------QELLQLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T   +Q               G ++  ++ +      QE+ Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---THEARQ---------------GGLVCGIQQI------QELLQLH-S

Query:  SRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLL
        S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP+++L
Subjt:  SRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLL

A0A2P5CEY2 Uncharacterized protein1.0e-2638.28Show/hide
Query:  PSNDQLSAAVREANTWMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGI
        P N +  A    A  W  F+K RLLPTTH  TVS+DR+ L +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G 
Subjt:  PSNDQLSAAVREANTWMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGI

Query:  IDTPNLARLQR-----------------THEARQGGLVCGIQQI---------QELLQLH-SSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPA
        ID   +AR+ +                    +R  G +  +QQ+         QE+ Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P 
Subjt:  IDTPNLARLQR-----------------THEARQGGLVCGIQQI---------QELLQLH-SSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPA

Query:  LPVFPEDLL
         P FP++LL
Subjt:  LPVFPEDLL

A0A2P5DXM3 Uncharacterized protein1.5e-3538.32Show/hide
Query:  VREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQD--FPHAVFNEMVVAPSNDQL-----------------------SAAVREANTWMGFIKLRLL
        VREFYANL D EE  + VRGV V WS EA+N +F L D    H+ F E +  P    +                       SA    A  W  F+K RLL
Subjt:  VREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQD--FPHAVFNEMVVAPSNDQL-----------------------SAAVREANTWMGFIKLRLL

Query:  PTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDT-----------------PNLAR
        PTTH   VS+DR+ L  ++L+  SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID                  P+ +R
Subjt:  PTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDT-----------------PNLAR

Query:  LQRTHEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLL
              +R  G V  +QQ++ L Q   S+ E   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP+++L
Subjt:  LQRTHEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLL

A0A6A2YMQ9 Uncharacterized protein2.2e-2629.32Show/hide
Query:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F ++ A+A++Q   K+   FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    +  + VRG  + ++P A+   F LQ
Subjt:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTRIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSAAVREANT-----------------------WMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILD
        D    HA F E   + + D++   +   NT                       W  F+K +L+PT++++TVS  R+ L  +I  S  IDVG+II  ++ D
Subjt:  DF--PHAVFNEMVVAPSNDQLSAAVREANT-----------------------WMGFIKLRLLPTTHDSTVSRDRVFLAFAILHSMSIDVGKIISSEILD

Query:  CWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL---------QRTHE----ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFW
        C  KK   L FPN IT LCR+  V E+  D ILP    I    L  L         +  HE      Q      +  ++E++    + +     + + F+
Subjt:  CWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL---------QRTHE----ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFW

Query:  DYVKRRD
         YVK RD
Subjt:  DYVKRRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTGAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGAAGAGAGCAGCGTAGACAGCAGAATCAAATGGCTGACGTACC
GCATCTACCGCAAGGGTCAGAAGGTTGTTGCGGCAAGGTGATGGCTGGAGCAGAATCATCCGAACTACGAAGGGATTTCGTTGAGAATTTATTAGTTTTACCGTTGGGAT
TTTCTTTGAATTTTCGCAGGAAAGAAAGAGAAAGTGAGGAGGAGGAGGTGCCCGTTACCCCTGAAGTTCGGAAAGCTAAAACCAAGAAGAAGAAAACGCCAGAAGAGAAA
GAAGCCAAACGGAGAAGAAGGCAGCATAGGGCTGCGGAGCAAGAAGCTATCCAAGAAGGACCAGTGAATGACCCAGATACGGAAGGAATTCAGAATCCTGAGGTAGAACC
AATAGTCCAAGATTCGGTGCAAAAGGAGAATGTTGAGAAGAATCAAGAAACACAGGCGGAAGAAGTTCGAGACGAACAGACCGCGGTTGTGCCTGAGGAAGGGGATGAAC
AGGAAACGGTGCAGGAGGCTCATGTTGAGGTCATAATGCCTGAACCACCAAAGCGCCGCCGCATCAAGCGGAAGGCTGGGCGCGTTCAGGTGATTCGGACTGATACCCCA
TCACCACCATCGTCGGATTCTGAGAAAGAGAAGGCAGAGCGAGAGGAACGAGAGAAAAAAGAAGCTGAGGAAAGAGTGCGAGAAGAAGGAAAGAAGGCTGAGGAAGAGAT
TTTGCAGAAGCGAAGAGAAGACAAGGGCAAAGGTATTGCCGAGGCATCAGGTGCGGCTGACGAGGTTGAGGCACAAGGGTTACCTTTTATTCGCTTCGTCAACAACCTTG
CTCGAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAACGAGGATTTGGCAATGAGTTGCCACGGTTCTTGAGGACTAGAATAGAAAACCTCGGCTGGAGC
CAATTTTGTGCGAAACCAGAGCCTGTGAATTCCAACTTTGTTCGGGAATTTTACGCAAATCTTGACGATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAGTGGA
TTGGAGCCCAGAAGCTGTTAATGAATTGTTTGATCTCCAGGATTTTCCGCATGCAGTCTTCAATGAGATGGTGGTTGCCCCATCTAACGATCAGTTAAGTGCGGCTGTCC
GAGAGGCCAACACCTGGATGGGTTTTATTAAGTTGCGCTTACTACCGACTACGCATGACTCCACAGTATCTCGGGACAGGGTATTTCTTGCCTTTGCTATTCTTCACTCA
ATGAGTATTGATGTAGGAAAAATAATTTCGTCTGAGATTCTTGACTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATTACGATGCTATGCCGAAGGGC
AGGGGTGCCAGAGAGTGAGGATGATATGATATTACCAGATAAGGGAATAATTGATACGCCAAATTTGGCTAGACTTCAGAGAACACATGAAGCACGCCAAGGGGGTTTGG
TGTGCGGCATCCAACAAATTCAGGAGCTGTTGCAATTGCATTCCAGCAGAATGGAATTCGCTGAAAGGCAATTTCAGACTTTCTGGGACTATGTAAAGAGAAGGGATGCT
GCCTTAAGGGTGGCCTTGCAATCAAATTTTTCCGAACCATACCCGGCCTTACCCGTATTCCCTGAGGACCTACTGAACCCCTGGATTCCGCCCCCACCAATGGAAGGAGG
AGAAGAGGAAGATGGAAATGAACCGGGCCAAGAGGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTGAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGAAGAGAGCAGCGTAGACAGCAGAATCAAATGGCTGACGTACC
GCATCTACCGCAAGGGTCAGAAGGTTGTTGCGGCAAGGTGATGGCTGGAGCAGAATCATCCGAACTACGAAGGGATTTCGTTGAGAATTTATTAGTTTTACCGTTGGGAT
TTTCTTTGAATTTTCGCAGGAAAGAAAGAGAAAGTGAGGAGGAGGAGGTGCCCGTTACCCCTGAAGTTCGGAAAGCTAAAACCAAGAAGAAGAAAACGCCAGAAGAGAAA
GAAGCCAAACGGAGAAGAAGGCAGCATAGGGCTGCGGAGCAAGAAGCTATCCAAGAAGGACCAGTGAATGACCCAGATACGGAAGGAATTCAGAATCCTGAGGTAGAACC
AATAGTCCAAGATTCGGTGCAAAAGGAGAATGTTGAGAAGAATCAAGAAACACAGGCGGAAGAAGTTCGAGACGAACAGACCGCGGTTGTGCCTGAGGAAGGGGATGAAC
AGGAAACGGTGCAGGAGGCTCATGTTGAGGTCATAATGCCTGAACCACCAAAGCGCCGCCGCATCAAGCGGAAGGCTGGGCGCGTTCAGGTGATTCGGACTGATACCCCA
TCACCACCATCGTCGGATTCTGAGAAAGAGAAGGCAGAGCGAGAGGAACGAGAGAAAAAAGAAGCTGAGGAAAGAGTGCGAGAAGAAGGAAAGAAGGCTGAGGAAGAGAT
TTTGCAGAAGCGAAGAGAAGACAAGGGCAAAGGTATTGCCGAGGCATCAGGTGCGGCTGACGAGGTTGAGGCACAAGGGTTACCTTTTATTCGCTTCGTCAACAACCTTG
CTCGAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAACGAGGATTTGGCAATGAGTTGCCACGGTTCTTGAGGACTAGAATAGAAAACCTCGGCTGGAGC
CAATTTTGTGCGAAACCAGAGCCTGTGAATTCCAACTTTGTTCGGGAATTTTACGCAAATCTTGACGATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAGTGGA
TTGGAGCCCAGAAGCTGTTAATGAATTGTTTGATCTCCAGGATTTTCCGCATGCAGTCTTCAATGAGATGGTGGTTGCCCCATCTAACGATCAGTTAAGTGCGGCTGTCC
GAGAGGCCAACACCTGGATGGGTTTTATTAAGTTGCGCTTACTACCGACTACGCATGACTCCACAGTATCTCGGGACAGGGTATTTCTTGCCTTTGCTATTCTTCACTCA
ATGAGTATTGATGTAGGAAAAATAATTTCGTCTGAGATTCTTGACTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATTACGATGCTATGCCGAAGGGC
AGGGGTGCCAGAGAGTGAGGATGATATGATATTACCAGATAAGGGAATAATTGATACGCCAAATTTGGCTAGACTTCAGAGAACACATGAAGCACGCCAAGGGGGTTTGG
TGTGCGGCATCCAACAAATTCAGGAGCTGTTGCAATTGCATTCCAGCAGAATGGAATTCGCTGAAAGGCAATTTCAGACTTTCTGGGACTATGTAAAGAGAAGGGATGCT
GCCTTAAGGGTGGCCTTGCAATCAAATTTTTCCGAACCATACCCGGCCTTACCCGTATTCCCTGAGGACCTACTGAACCCCTGGATTCCGCCCCCACCAATGGAAGGAGG
AGAAGAGGAAGATGGAAATGAACCGGGCCAAGAGGACTAA
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDPEIERTFRIRRREQRRQQNQMADVPHLPQGSEGCCGKVMAGAESSELRRDFVENLLVLPLGFSLNFRRKERESEEEEVPVTPEVRKAKTKKKKTPEEK
EAKRRRRQHRAAEQEAIQEGPVNDPDTEGIQNPEVEPIVQDSVQKENVEKNQETQAEEVRDEQTAVVPEEGDEQETVQEAHVEVIMPEPPKRRRIKRKAGRVQVIRTDTP
SPPSSDSEKEKAEREEREKKEAEERVREEGKKAEEEILQKRREDKGKGIAEASGAADEVEAQGLPFIRFVNNLARAKYQEMLKRDFLFERGFGNELPRFLRTRIENLGWS
QFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQDFPHAVFNEMVVAPSNDQLSAAVREANTWMGFIKLRLLPTTHDSTVSRDRVFLAFAILHS
MSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTHEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDA
ALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPMEGGEEEDGNEPGQED