; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029537 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029537
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold2:23096470..23103425
RNA-Seq ExpressionSpg029537
SyntenySpg029537
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.2e-3138.6Show/hide
Query:  IRFINELAREKYREMLK-RDFLFERGFGDD-------LPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFN
        ++F  E A  +Y   ++ R    E+GF  D       LP F+   I  H W QFCA P+     +VREFYAN+ +  E    VRGV V WS  AIN++F 
Subjt:  IRFINELAREKYREMLK-RDFLFERGFGDD-------LPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFN

Query:  LQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNE
        L + P    +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL+ ++L   SI+VG++I +E
Subjt:  LQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNE

Query:  IFNCWRKKVGKLFFPNTITMLCSRAGVP
        I  C  +K G LFFP+ IT LC  A  P
Subjt:  IFNCWRKKVGKLFFPNTITMLCSRAGVP

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.0e-3831.7Show/hide
Query:  IRFINELAREKYREMLK-RDFLFERGFGDD-------LPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFN
        ++F  E A  +Y   ++ R    E+GF  D       LP F+   I  H W QFCA P+     +VREFYAN+ + +E    VRGV V WS  AIN++F 
Subjt:  IRFINELAREKYREMLK-RDFLFERGFGDD-------LPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFN

Query:  LQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNE
        L + P    +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+DR+LL+ ++L   SI+VG++I +E
Subjt:  LQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNE

Query:  IFNCWRKKVGKLFFPNTITMLCSRAGVP---------------------------------------------TVQGGLVCGIHQIQEQLALSTSRQ---
        I  C  +K G LFFP+ IT LC  A  P                                                G ++  +  ++++L+    +Q   
Subjt:  IFNCWRKKVGKLFFPNTITMLCSRAGVP---------------------------------------------TVQGGLVCGIHQIQEQLALSTSRQ---

Query:  ----EFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL
            +   +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++
Subjt:  ----EFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]7.5e-2934.36Show/hide
Query:  IRFINELAREKYRE-------MLKRDFLFERGFGDDLPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNL
        ++F ++ A  +Y E        ++++F+++     + P F+   I  H W  FCA P+     +VREFY N+ N  +    +RGV V  S  AIN++F+L
Subjt:  IRFINELAREKYRE-------MLKRDFLFERGFGDDLPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNL

Query:  QNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEI
         + P    +E V   +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++ V L++++L   SI+VG++I  EI
Subjt:  QNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVP
          C  +K G LFFP+ IT +C     P
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]9.5e-3234.81Show/hide
Query:  IVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNLQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLR
        +VREFYAN+ + +E    VRGV V WS  AIN++F L + P    +E +   +  +L   +  V   GA+W +S     T   + L   A  W  F+K R
Subjt:  IVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNLQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLR

Query:  LLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRA----------GVPTVQGGLVCGIHQ--------------
        LLPTTH   VS+DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A              +    V  I Q              
Subjt:  LLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRA----------GVPTVQGGLVCGIHQ--------------

Query:  -------------IQEQLALS--TSRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL
                     +Q+  AL    S+QE   +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++
Subjt:  -------------IQEQLALS--TSRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL

TYH88163.1 hypothetical protein ES332_D01G168900v1 [Gossypium tomentosum]4.6e-2634.67Show/hide
Query:  INELAREKYREMLK-RDFLFERGF---GDDL---PHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNLQNF
        I+E  +E++  + K +  + E+GF    +DL   P  +R  I    W +FC      +  +VREFYA++      + IVR   V  +  +IN LFNL + 
Subjt:  INELAREKYREMLK-RDFLFERGF---GDDL---PHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNLQNF

Query:  PHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEIFNC
            +  M+   + D L   +  V   G+QW + K    + +  YLK  AN W  F++   +P +H  T+S +R+LL++ IL   SI+VGKII  EI NC
Subjt:  PHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEIFNC

Query:  WRKKVGKLFFPNTITMLCSRAGVPT
         +KK G ++FP+ IT LC +A V T
Subjt:  WRKKVGKLFFPNTITMLCSRAGVPT

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)6.0e-3238.6Show/hide
Query:  IRFINELAREKYREMLK-RDFLFERGFGDD-------LPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFN
        ++F  E A  +Y   ++ R    E+GF  D       LP F+   I  H W QFCA P+     +VREFYAN+ +  E    VRGV V WS  AIN++F 
Subjt:  IRFINELAREKYREMLK-RDFLFERGFGDD-------LPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFN

Query:  LQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNE
        L + P    +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL+ ++L   SI+VG++I +E
Subjt:  LQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNE

Query:  IFNCWRKKVGKLFFPNTITMLCSRAGVP
        I  C  +K G LFFP+ IT LC  A  P
Subjt:  IFNCWRKKVGKLFFPNTITMLCSRAGVP

A0A2P5BCG4 Uncharacterized protein (Fragment)1.5e-3831.7Show/hide
Query:  IRFINELAREKYREMLK-RDFLFERGFGDD-------LPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFN
        ++F  E A  +Y   ++ R    E+GF  D       LP F+   I  H W QFCA P+     +VREFYAN+ + +E    VRGV V WS  AIN++F 
Subjt:  IRFINELAREKYREMLK-RDFLFERGFGDD-------LPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFN

Query:  LQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNE
        L + P    +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+DR+LL+ ++L   SI+VG++I +E
Subjt:  LQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNE

Query:  IFNCWRKKVGKLFFPNTITMLCSRAGVP---------------------------------------------TVQGGLVCGIHQIQEQLALSTSRQ---
        I  C  +K G LFFP+ IT LC  A  P                                                G ++  +  ++++L+    +Q   
Subjt:  IFNCWRKKVGKLFFPNTITMLCSRAGVP---------------------------------------------TVQGGLVCGIHQIQEQLALSTSRQ---

Query:  ----EFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL
            +   +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++
Subjt:  ----EFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL

A0A2P5DAQ2 Uncharacterized protein3.6e-2934.36Show/hide
Query:  IRFINELAREKYRE-------MLKRDFLFERGFGDDLPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNL
        ++F ++ A  +Y E        ++++F+++     + P F+   I  H W  FCA P+     +VREFY N+ N  +    +RGV V  S  AIN++F+L
Subjt:  IRFINELAREKYRE-------MLKRDFLFERGFGDDLPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNL

Query:  QNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEI
         + P    +E V   +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++ V L++++L   SI+VG++I  EI
Subjt:  QNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVP
          C  +K G LFFP+ IT +C     P
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVP

A0A2P5DXM3 Uncharacterized protein4.6e-3234.81Show/hide
Query:  IVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNLQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLR
        +VREFYAN+ + +E    VRGV V WS  AIN++F L + P    +E +   +  +L   +  V   GA+W +S     T   + L   A  W  F+K R
Subjt:  IVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNLQNFPHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLR

Query:  LLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRA----------GVPTVQGGLVCGIHQ--------------
        LLPTTH   VS+DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A              +    V  I Q              
Subjt:  LLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRA----------GVPTVQGGLVCGIHQ--------------

Query:  -------------IQEQLALS--TSRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL
                     +Q+  AL    S+QE   +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++
Subjt:  -------------IQEQLALS--TSRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDL

A0A5D2MA47 Uncharacterized protein2.2e-2634.67Show/hide
Query:  INELAREKYREMLK-RDFLFERGF---GDDL---PHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNLQNF
        I+E  +E++  + K +  + E+GF    +DL   P  +R  I    W +FC      +  +VREFYA++      + IVR   V  +  +IN LFNL + 
Subjt:  INELAREKYREMLK-RDFLFERGF---GDDL---PHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNLQNF

Query:  PHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEIFNC
            +  M+   + D L   +  V   G+QW + K    + +  YLK  AN W  F++   +P +H  T+S +R+LL++ IL   SI+VGKII  EI NC
Subjt:  PHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEIFNC

Query:  WRKKVGKLFFPNTITMLCSRAGVPT
         +KK G ++FP+ IT LC +A V T
Subjt:  WRKKVGKLFFPNTITMLCSRAGVPT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTCACCTTTGGGAAGGGGGCTATTTATAGGAGCAGAAACAGACCTTTCTTGCTGCCCAAGACGCACAGCGTCGAGACAGACGCTATGACGTCAGGTTTTCTTGG
AGCAGAAGTTGTCGCAGCGTCGCGACGCTACCGCGATTCTGGAATTGTCTTCGCTGCTTGCTTAGCGTCGAGACGCTCTGGTCCTAGCGTCTCGACGCTAGGCCTTCTAG
AAGCCTTAAACACGGATTTTTGCCTCCTTTCTTCATTCTTTTGGGCTTTTGGGCTTCCACTTCAATTGTTCTATTTTCTTTTCTTCTTTTATGGCCCAAAGTCTTCTTTT
TATGCCCAAAATTGTTTGAATTCCGTATCCAAGATTCTCCAGATTGCATGGGTTTTGGACGACAACGAGGAAGAAGAGGTACCTGTTACCCCCGAAGTACAGAAAGTGAA
AGCAAAGAAAAAGAAAACACCGGAGGAAAAAGAAGCCAAAAGAAGGAGAAGGCAACAGAGGGCTGAGGATCAAGAAAGTGTACAGAAGGTGGTAGAAGATGTGGCTGCCA
CAGTGGTTGAAGACCCGAAGGAACCAGAGGGACAGAACACTGAGCTGAGTAACCCAGTAGTTGCGGATACGGAGGGAGTTCAAGAAGAACAAACAGAGGAAGTTCAAGAA
AAACAGGCCGAAGATACGCAAGAAGGTAGGACAGAGGATGTTCAGGAAACAGGTAATGAGCAAGTGGAGCAAGAGCAAGAGGCTCATGTTGAGGTTATCATGTCGGAAGT
ACCAAAACGTCGTCGTGTGAAGCAAAAACCTGGACGCGTCAAGGTAGTCCGAACTGATACCCCATCACCGCCATCGACGGATTCTGAGAAAGAGGATGCAGAGAGAGAGG
AACGGGAGAAAAAGGAGGCTGAGGACAGAGAGAGAGAAGAAGCAGGAAAGAAAGCAGCGGAGGAAACTTTGACAAAGCATCAAGAAGACAGGGGCAAAGGAATTGCTGAA
GCATCGGATGAACCTATAGAAGAAGCAGAAGAAGGACCATTCATCCGCTTCATCAATGAACTTGCTCGAGAAAAATACCGGGAGATGCTAAAAAGGGATTTCTTATTCGA
AAGAGGGTTTGGTGATGATCTGCCACATTTCTTAAGGGCAGGGATCGCGAATCACGGCTGGAGTCAGTTCTGTGCGAAACCAGACCCAGTGAATTCGAACATTGTTCGAG
AGTTTTATGCGAATGTTGATAATGCAAAGGAATTTCAGGCCATAGTCCGAGGAGTGACTGTTGACTGGAGCCCAGGAGCTATCAATTCATTATTTAACCTCCAGAACTTT
CCACACGCATGCTTCAATGAGATGGTGGTGACACCATCGAGTGATCAGTTAAATGCGGCGGTCCGAAAGGTTGGCATTGAGGGGGCTCAATGGAGGCTATCAAAGATGGA
GAAGCGAACATTTCAAGCTGCCTATCTAAAGAGTGAAGCCAATACTTGGTTGGGCTTCATCAAGCTACGTTTGCTTCCGACTACACATGATTCAACAGTGTCTCGCGACC
GAGTGCTACTGATATTCACAATTCTTCGATCCTTAAGTATTGATGTTGGAAAAATCATTTCGAATGAAATCTTTAATTGCTGGCGCAAAAAGGTGGGGAAGCTGTTTTTC
CCGAACACGATCACTATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCAGGGTGGGCTAGTGTGTGGGATTCATCAAATCCAAGAGCAATTGGCACTTTCGACCAGTAG
GCAAGAGTTTGCTGAGAGGCAAGCTCAAACCTATTGGACCTATGCTAAAAGGAGAGATGACACACTCAGGAGGGCCTTGCAATCCAATTTCTCAAAACCATATCAAGCCT
TCCCTATGTTTCCCGATGATTTATTTAACCTTTGGATACCGCCCCCACCTGTCGAAAGAGAAGAAGAGGATGATGAAAATGACTTGGACTGGTTAAGCTTAATTAGATCA
AGTCTAATTGGTGATGAGTTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAGAACAAAAA
TATAAACCCCTTAAAAATGTATTTTAATATGTCTGATAATAGAGCTAGGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAACT
ATTTTGCTGCAGCAGTCCTTGGTTTTGCAGAATGCTCAGAATATATTGTTGAGCGACTTAAGGGAGCAAAATCTGTGCTGGAGCAAAGCTGGGAGCAAAACTGCCACGTC
ACAGCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTTTCACCTTTGGGAAGGGGGCTATTTATAGGAGCAGAAACAGACCTTTCTTGCTGCCCAAGACGCACAGCGTCGAGACAGACGCTATGACGTCAGGTTTTCTTGG
AGCAGAAGTTGTCGCAGCGTCGCGACGCTACCGCGATTCTGGAATTGTCTTCGCTGCTTGCTTAGCGTCGAGACGCTCTGGTCCTAGCGTCTCGACGCTAGGCCTTCTAG
AAGCCTTAAACACGGATTTTTGCCTCCTTTCTTCATTCTTTTGGGCTTTTGGGCTTCCACTTCAATTGTTCTATTTTCTTTTCTTCTTTTATGGCCCAAAGTCTTCTTTT
TATGCCCAAAATTGTTTGAATTCCGTATCCAAGATTCTCCAGATTGCATGGGTTTTGGACGACAACGAGGAAGAAGAGGTACCTGTTACCCCCGAAGTACAGAAAGTGAA
AGCAAAGAAAAAGAAAACACCGGAGGAAAAAGAAGCCAAAAGAAGGAGAAGGCAACAGAGGGCTGAGGATCAAGAAAGTGTACAGAAGGTGGTAGAAGATGTGGCTGCCA
CAGTGGTTGAAGACCCGAAGGAACCAGAGGGACAGAACACTGAGCTGAGTAACCCAGTAGTTGCGGATACGGAGGGAGTTCAAGAAGAACAAACAGAGGAAGTTCAAGAA
AAACAGGCCGAAGATACGCAAGAAGGTAGGACAGAGGATGTTCAGGAAACAGGTAATGAGCAAGTGGAGCAAGAGCAAGAGGCTCATGTTGAGGTTATCATGTCGGAAGT
ACCAAAACGTCGTCGTGTGAAGCAAAAACCTGGACGCGTCAAGGTAGTCCGAACTGATACCCCATCACCGCCATCGACGGATTCTGAGAAAGAGGATGCAGAGAGAGAGG
AACGGGAGAAAAAGGAGGCTGAGGACAGAGAGAGAGAAGAAGCAGGAAAGAAAGCAGCGGAGGAAACTTTGACAAAGCATCAAGAAGACAGGGGCAAAGGAATTGCTGAA
GCATCGGATGAACCTATAGAAGAAGCAGAAGAAGGACCATTCATCCGCTTCATCAATGAACTTGCTCGAGAAAAATACCGGGAGATGCTAAAAAGGGATTTCTTATTCGA
AAGAGGGTTTGGTGATGATCTGCCACATTTCTTAAGGGCAGGGATCGCGAATCACGGCTGGAGTCAGTTCTGTGCGAAACCAGACCCAGTGAATTCGAACATTGTTCGAG
AGTTTTATGCGAATGTTGATAATGCAAAGGAATTTCAGGCCATAGTCCGAGGAGTGACTGTTGACTGGAGCCCAGGAGCTATCAATTCATTATTTAACCTCCAGAACTTT
CCACACGCATGCTTCAATGAGATGGTGGTGACACCATCGAGTGATCAGTTAAATGCGGCGGTCCGAAAGGTTGGCATTGAGGGGGCTCAATGGAGGCTATCAAAGATGGA
GAAGCGAACATTTCAAGCTGCCTATCTAAAGAGTGAAGCCAATACTTGGTTGGGCTTCATCAAGCTACGTTTGCTTCCGACTACACATGATTCAACAGTGTCTCGCGACC
GAGTGCTACTGATATTCACAATTCTTCGATCCTTAAGTATTGATGTTGGAAAAATCATTTCGAATGAAATCTTTAATTGCTGGCGCAAAAAGGTGGGGAAGCTGTTTTTC
CCGAACACGATCACTATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCAGGGTGGGCTAGTGTGTGGGATTCATCAAATCCAAGAGCAATTGGCACTTTCGACCAGTAG
GCAAGAGTTTGCTGAGAGGCAAGCTCAAACCTATTGGACCTATGCTAAAAGGAGAGATGACACACTCAGGAGGGCCTTGCAATCCAATTTCTCAAAACCATATCAAGCCT
TCCCTATGTTTCCCGATGATTTATTTAACCTTTGGATACCGCCCCCACCTGTCGAAAGAGAAGAAGAGGATGATGAAAATGACTTGGACTGGTTAAGCTTAATTAGATCA
AGTCTAATTGGTGATGAGTTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAGAACAAAAA
TATAAACCCCTTAAAAATGTATTTTAATATGTCTGATAATAGAGCTAGGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAACT
ATTTTGCTGCAGCAGTCCTTGGTTTTGCAGAATGCTCAGAATATATTGTTGAGCGACTTAAGGGAGCAAAATCTGTGCTGGAGCAAAGCTGGGAGCAAAACTGCCACGTC
ACAGCTCGTTAG
Protein sequenceShow/hide protein sequence
MIFTFGKGAIYRSRNRPFLLPKTHSVETDAMTSGFLGAEVVAASRRYRDSGIVFAACLASRRSGPSVSTLGLLEALNTDFCLLSSFFWAFGLPLQLFYFLFFFYGPKSSF
YAQNCLNSVSKILQIAWVLDDNEEEEVPVTPEVQKVKAKKKKTPEEKEAKRRRRQQRAEDQESVQKVVEDVAATVVEDPKEPEGQNTELSNPVVADTEGVQEEQTEEVQE
KQAEDTQEGRTEDVQETGNEQVEQEQEAHVEVIMSEVPKRRRVKQKPGRVKVVRTDTPSPPSTDSEKEDAEREEREKKEAEDREREEAGKKAAEETLTKHQEDRGKGIAE
ASDEPIEEAEEGPFIRFINELAREKYREMLKRDFLFERGFGDDLPHFLRAGIANHGWSQFCAKPDPVNSNIVREFYANVDNAKEFQAIVRGVTVDWSPGAINSLFNLQNF
PHACFNEMVVTPSSDQLNAAVRKVGIEGAQWRLSKMEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDRVLLIFTILRSLSIDVGKIISNEIFNCWRKKVGKLFF
PNTITMLCSRAGVPTVQGGLVCGIHQIQEQLALSTSRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQAFPMFPDDLFNLWIPPPPVEREEEDDENDLDWLSLIRS
SLIGDEFEARVYCTIKWVIPCLRAYDCRAALSLKNKNINPLKMYFNMSDNRARLWQVLRIELKVVIICPCRKNYFAAAVLGFAECSEYIVERLKGAKSVLEQSWEQNCHV
TAR