; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020425 (gene) of Snake gourd v1 genome

Gene IDTan0020425
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCACTA en-spm transposon protein
Genome locationLG02:80959828..80961228
RNA-Seq ExpressionTan0020425
SyntenyTan0020425
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032748.1 hypothetical protein E6C27_scaffold853G00910 [Cucumis melo var. makuwa]3.4e-5845.99Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI
        MSSR+FG  NRSR  SN      SN+++  EQE  S +S  Q           TR V LE+YV+ +G   I+I+ EDRKP+  +       +  L     
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT
                     NYFI+D+NEP+V+DYL+HEMSVL+RDFRCSLH +Y+KY+SP +ARKHR KRVA           W       + +AN K       T
Subjt:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT

Query:  HRGGTMPFLRHRKKRY----KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGK
         R      LRH    +    K+E K+++EI LF+LTH NEKNGWV EAKEK+D+MVELKT SSQEG EPL + EIC++VLG R  HVKG G GPR ++  
Subjt:  HRGGTMPFLRHRKKRY----KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGK

Query:  NEAIYQKSKETEALISQLQSIVQS
        NE I   +KE  A I+  QSIV+S
Subjt:  NEAIYQKSKETEALISQLQSIVQS

KAA0046978.1 formin-like protein 4 isoform X2 [Cucumis melo var. makuwa]2.4e-5644.55Show/hide
Query:  SNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQENYFIMDINEP
        SN+++  EQE  S +S  Q           TRGV LE+YV+A+G I I+I+ EDRKP+CK+SSKLNS IG+ VR  +PL+CEHWSDVS+E        E 
Subjt:  SNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQENYFIMDINEP

Query:  IVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRYKEESKKL
        + R  L HE +V +      +   Y    +    R    + +  DL+++   +                                        KEE K++
Subjt:  IVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRYKEESKKL

Query:  TEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAIYQKSKETEALISQLQSIVQSQQA
        +EIELF+LTH NEKNGWV EAKEK+D+MVELKT SSQEG EPL + +IC RVLG R GHVKG GWGPR ++ +NE I   +K+  A I+ LQSIV+SQQA
Subjt:  TEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAIYQKSKETEALISQLQSIVQSQQA

Query:  TIEEILRRLPPSGESSSSNLEMNPEQSNDN
        TIE ILRRL  S + ++SN+E + EQSND+
Subjt:  TIEEILRRLPPSGESSSSNLEMNPEQSNDN

KAE8651942.1 hypothetical protein Csa_006405 [Cucumis sativus]2.1e-6363Show/hide
Query:  IMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRY
        I ++++P+V+DYLDHEMSVL+RDF CSLH +Y+K +SPTEARKH DKRVA+D DW RLCDRWE E FKS SEAN+KAR+ LPFTHRGGT+ FLRH K++ 
Subjt:  IMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRY

Query:  KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAIYQKSKETEALISQLQS
        KEE K+++ IELF+ TH +EK  WV EAK K+D+MVELKT  SQEG + L + EIC+RVL  R G VK  GWGPR ++ +NEAI Q +K+  A I+ LQS
Subjt:  KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAIYQKSKETEALISQLQS

TYJ98779.1 hypothetical protein E5676_scaffold156G00880 [Cucumis melo var. makuwa]3.4e-5845.99Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI
        MSSR+FG  NRSR  SN      SN+++  EQE  S +S  Q           TR V LE+YV+ +G   I+I+ EDRKP+  +       +  L     
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT
                     NYFI+D+NEP+V+DYL+HEMSVL+RDFRCSLH +Y+KY+SP +ARKHR KRVA           W       + +AN K       T
Subjt:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT

Query:  HRGGTMPFLRHRKKRY----KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGK
         R      LRH    +    K+E K+++EI LF+LTH NEKNGWV EAKEK+D+MVELKT SSQEG EPL + EIC++VLG R  HVKG G GPR ++  
Subjt:  HRGGTMPFLRHRKKRY----KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGK

Query:  NEAIYQKSKETEALISQLQSIVQS
        NE I   +KE  A I+  QSIV+S
Subjt:  NEAIYQKSKETEALISQLQSIVQS

TYK04806.1 formin-like protein 4 isoform X2 [Cucumis melo var. makuwa]4.6e-6345.61Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI
        MSSR+FGPGNRSR  SN      SN+++  EQE  S +S  Q           TRGV LE+YV+A+G I I+I+ EDRKP+CK+SSKLNS IG+ VR  +
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT
        PL+CEHWSDVS+E        E + R  L HE +V +      +   Y    +    R    + +  DL+++   +                        
Subjt:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT

Query:  HRGGTMPFLRHRKKRYKEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAI
                        KEE K+++EIELF+LTH NEKNGWV EAKEK+D+MVELKT SSQEG EPL + +IC RVLG R GHVKG GWGPR ++ +NE I
Subjt:  HRGGTMPFLRHRKKRYKEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAI

Query:  YQKSKETEALISQLQSIVQSQQATIEEILRRLPPSGESSSSNLEMNPEQSNDN
           +K+  A I+ LQSIV+SQQATIE ILRRL  S + ++SN+E + EQSND+
Subjt:  YQKSKETEALISQLQSIVQSQQATIEEILRRLPPSGESSSSNLEMNPEQSNDN

TrEMBL top hitse value%identityAlignment
A0A0A0LM17 Uncharacterized protein4.1e-4958.92Show/hide
Query:  TEESNEDTTREQEDDSTNSSAQITRGV-ALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQENYFIMDINEPIVRDYL
        +  SN+++  EQE    +S  Q  +G+ ALE+YV+A+G I IKIEL+DRKP+CK+SSK NS IG                  +ENYFI+D+++P+V+DYL
Subjt:  TEESNEDTTREQEDDSTNSSAQITRGV-ALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQENYFIMDINEPIVRDYL

Query:  DHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKK
        DHEMSVL+RDF CSLH +Y+K +SPTEARKH DKRVA+D DW RLCDRWE E FKS SEAN+KAR+ LPFTHRGGT+ FLRH++K
Subjt:  DHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKK

A0A5A7SQC8 DUF4218 domain-containing protein1.7e-5845.99Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI
        MSSR+FG  NRSR  SN      SN+++  EQE  S +S  Q           TR V LE+YV+ +G   I+I+ EDRKP+  +       +  L     
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT
                     NYFI+D+NEP+V+DYL+HEMSVL+RDFRCSLH +Y+KY+SP +ARKHR KRVA           W       + +AN K       T
Subjt:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT

Query:  HRGGTMPFLRHRKKRY----KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGK
         R      LRH    +    K+E K+++EI LF+LTH NEKNGWV EAKEK+D+MVELKT SSQEG EPL + EIC++VLG R  HVKG G GPR ++  
Subjt:  HRGGTMPFLRHRKKRY----KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGK

Query:  NEAIYQKSKETEALISQLQSIVQS
        NE I   +KE  A I+  QSIV+S
Subjt:  NEAIYQKSKETEALISQLQSIVQS

A0A5A7TVR8 Formin-like protein 4 isoform X21.2e-5644.55Show/hide
Query:  SNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQENYFIMDINEP
        SN+++  EQE  S +S  Q           TRGV LE+YV+A+G I I+I+ EDRKP+CK+SSKLNS IG+ VR  +PL+CEHWSDVS+E        E 
Subjt:  SNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQENYFIMDINEP

Query:  IVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRYKEESKKL
        + R  L HE +V +      +   Y    +    R    + +  DL+++   +                                        KEE K++
Subjt:  IVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRYKEESKKL

Query:  TEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAIYQKSKETEALISQLQSIVQSQQA
        +EIELF+LTH NEKNGWV EAKEK+D+MVELKT SSQEG EPL + +IC RVLG R GHVKG GWGPR ++ +NE I   +K+  A I+ LQSIV+SQQA
Subjt:  TEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAIYQKSKETEALISQLQSIVQSQQA

Query:  TIEEILRRLPPSGESSSSNLEMNPEQSNDN
        TIE ILRRL  S + ++SN+E + EQSND+
Subjt:  TIEEILRRLPPSGESSSSNLEMNPEQSNDN

A0A5D3BIG6 DUF4218 domain-containing protein1.7e-5845.99Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI
        MSSR+FG  NRSR  SN      SN+++  EQE  S +S  Q           TR V LE+YV+ +G   I+I+ EDRKP+  +       +  L     
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT
                     NYFI+D+NEP+V+DYL+HEMSVL+RDFRCSLH +Y+KY+SP +ARKHR KRVA           W       + +AN K       T
Subjt:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT

Query:  HRGGTMPFLRHRKKRY----KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGK
         R      LRH    +    K+E K+++EI LF+LTH NEKNGWV EAKEK+D+MVELKT SSQEG EPL + EIC++VLG R  HVKG G GPR ++  
Subjt:  HRGGTMPFLRHRKKRY----KEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGK

Query:  NEAIYQKSKETEALISQLQSIVQS
        NE I   +KE  A I+  QSIV+S
Subjt:  NEAIYQKSKETEALISQLQSIVQS

A0A5D3C2Z5 Formin-like protein 4 isoform X22.2e-6345.61Show/hide
Query:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI
        MSSR+FGPGNRSR  SN      SN+++  EQE  S +S  Q           TRGV LE+YV+A+G I I+I+ EDRKP+CK+SSKLNS IG+ VR  +
Subjt:  MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQ----------ITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTI

Query:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT
        PL+CEHWSDVS+E        E + R  L HE +V +      +   Y    +    R    + +  DL+++   +                        
Subjt:  PLQCEHWSDVSQENYFIMDINEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFT

Query:  HRGGTMPFLRHRKKRYKEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAI
                        KEE K+++EIELF+LTH NEKNGWV EAKEK+D+MVELKT SSQEG EPL + +IC RVLG R GHVKG GWGPR ++ +NE I
Subjt:  HRGGTMPFLRHRKKRYKEESKKLTEIELFELTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAI

Query:  YQKSKETEALISQLQSIVQSQQATIEEILRRLPPSGESSSSNLEMNPEQSNDN
           +K+  A I+ LQSIV+SQQATIE ILRRL  S + ++SN+E + EQSND+
Subjt:  YQKSKETEALISQLQSIVQSQQATIEEILRRLPPSGESSSSNLEMNPEQSNDN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCGAGAAGTTTCGGACCAGGAAATCGTAGTAGGCCCATATCAAACGAACAACCAACGGAGGAATCTAATGAAGATACTACTAGAGAGCAAGAAGATGATTCTAC
AAATTCAAGTGCTCAGATAACTAGAGGGGTTGCACTTGAGAGATATGTTCGAGCTCATGGTCGGATTCATATTAAAATTGAGCTTGAAGATAGAAAACCAGTTTGCAAGA
ACTCATCCAAATTGAACTCCACCATCGGACAACTGGTACGGGGAACAATACCTCTTCAGTGCGAACATTGGTCAGATGTCTCTCAAGAGAATTACTTCATTATGGATATC
AATGAACCCATAGTACGAGACTACCTAGATCATGAGATGAGTGTATTACACAGGGATTTTCGTTGTTCGTTACACAGTACTTATAGAAAATACAATTCTCCAACTGAAGC
ACGAAAACATCGAGACAAACGAGTGGCACGAGATTTAGATTGGAATCGTTTGTGTGATCGGTGGGAAACAGAAGAATTTAAGAGTCTTTCTGAAGCAAATTCTAAGGCTC
GAGCCTCTCTTCCATTCACTCATAGAGGTGGTACTATGCCATTTTTACGCCATAGGAAAAAAAGGTATAAAGAAGAGAGTAAAAAGTTAACTGAAATTGAGTTATTCGAG
CTCACTCACCATAATGAAAAGAATGGATGGGTGAAAGAGGCAAAAGAGAAATTTGACAAAATGGTGGAATTAAAAACTGCTTCATCACAAGAAGGCCATGAACCTCTTCA
GGATAGTGAAATATGTCAACGTGTACTAGGTACACGACCTGGGCATGTGAAGGGTCTCGGTTGGGGACCAAGAGCAAGGCATGGTAAAAATGAAGCAATCTACCAAAAAT
CTAAAGAAACGGAGGCTTTGATTTCTCAATTGCAAAGTATAGTTCAATCACAACAAGCCACAATTGAAGAAATTTTAAGAAGGTTGCCACCAAGTGGAGAAAGTTCATCA
TCTAATCTAGAAATGAATCCAGAACAATCAAATGACAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCGAGAAGTTTCGGACCAGGAAATCGTAGTAGGCCCATATCAAACGAACAACCAACGGAGGAATCTAATGAAGATACTACTAGAGAGCAAGAAGATGATTCTAC
AAATTCAAGTGCTCAGATAACTAGAGGGGTTGCACTTGAGAGATATGTTCGAGCTCATGGTCGGATTCATATTAAAATTGAGCTTGAAGATAGAAAACCAGTTTGCAAGA
ACTCATCCAAATTGAACTCCACCATCGGACAACTGGTACGGGGAACAATACCTCTTCAGTGCGAACATTGGTCAGATGTCTCTCAAGAGAATTACTTCATTATGGATATC
AATGAACCCATAGTACGAGACTACCTAGATCATGAGATGAGTGTATTACACAGGGATTTTCGTTGTTCGTTACACAGTACTTATAGAAAATACAATTCTCCAACTGAAGC
ACGAAAACATCGAGACAAACGAGTGGCACGAGATTTAGATTGGAATCGTTTGTGTGATCGGTGGGAAACAGAAGAATTTAAGAGTCTTTCTGAAGCAAATTCTAAGGCTC
GAGCCTCTCTTCCATTCACTCATAGAGGTGGTACTATGCCATTTTTACGCCATAGGAAAAAAAGGTATAAAGAAGAGAGTAAAAAGTTAACTGAAATTGAGTTATTCGAG
CTCACTCACCATAATGAAAAGAATGGATGGGTGAAAGAGGCAAAAGAGAAATTTGACAAAATGGTGGAATTAAAAACTGCTTCATCACAAGAAGGCCATGAACCTCTTCA
GGATAGTGAAATATGTCAACGTGTACTAGGTACACGACCTGGGCATGTGAAGGGTCTCGGTTGGGGACCAAGAGCAAGGCATGGTAAAAATGAAGCAATCTACCAAAAAT
CTAAAGAAACGGAGGCTTTGATTTCTCAATTGCAAAGTATAGTTCAATCACAACAAGCCACAATTGAAGAAATTTTAAGAAGGTTGCCACCAAGTGGAGAAAGTTCATCA
TCTAATCTAGAAATGAATCCAGAACAATCAAATGACAATTAA
Protein sequenceShow/hide protein sequence
MSSRSFGPGNRSRPISNEQPTEESNEDTTREQEDDSTNSSAQITRGVALERYVRAHGRIHIKIELEDRKPVCKNSSKLNSTIGQLVRGTIPLQCEHWSDVSQENYFIMDI
NEPIVRDYLDHEMSVLHRDFRCSLHSTYRKYNSPTEARKHRDKRVARDLDWNRLCDRWETEEFKSLSEANSKARASLPFTHRGGTMPFLRHRKKRYKEESKKLTEIELFE
LTHHNEKNGWVKEAKEKFDKMVELKTASSQEGHEPLQDSEICQRVLGTRPGHVKGLGWGPRARHGKNEAIYQKSKETEALISQLQSIVQSQQATIEEILRRLPPSGESSS
SNLEMNPEQSNDN