; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G03110 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G03110
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr04:9461903..9465123
RNA-Seq ExpressionClc04G03110
SyntenyClc04G03110
Gene Ontology termsGO:0006725 - cellular aromatic compound metabolic process (biological process)
GO:0008198 - ferrous iron binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR004183 - Extradiol ring-cleavage dioxygenase, class III enzyme, subunit B
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB75469.1 copia-type reverse transcriptase-like protein [Arabidopsis thaliana]9.4e-5948.96Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM
        MDVKSAFLNG LEE+VYI+QPQGY V+G+E+KVL+LKK LY LKQ+ ++      +   +++F KCPYEHAL IK+Q  D+LI CLYVDDLIFTG+NPSM
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM

Query:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------VGVISRFMEKPTTTHLKAAKRILR
        F +FK+EMT EFEMTDIGLMSYYL                                                    VGV+SR+ME PTTTH KAAKRILR
Subjt:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------VGVISRFMEKPTTTHLKAAKRILR

Query:  HIK--------------------------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY
        +IK                                            DNKSAIALAKNPVFH+RSKHI+TRYHYIREC+ +K+VQL+Y
Subjt:  HIK--------------------------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY

KAE8673808.1 hypothetical protein F3Y22_tig00111772pilonHSYRG00252 [Hibiscus syriacus]4.7e-5041.56Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVSTQRN-FTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM
        MDVKSAFLNGVLEE+VYI+QP GY+V+G E+KVLKLKK LY LKQ+     S      Q N F KCPYEHAL IK+++ D+LIVCLYVDDLIFTGSNP+M
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVSTQRN-FTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM

Query:  FNKFKEEMTNEFEMTDIGLMSYYL------------------------------VGVISRFMEKPTTTHLKAAKRILRHIK-------------------
        F +FK+ M   FEMT++GLM+YYL                               G++SR+ME PTTTH KAAKRILR++K                   
Subjt:  FNKFKEEMTNEFEMTDIGLMSYYL------------------------------VGVISRFMEKPTTTHLKAAKRILRHIK-------------------

Query:  -------------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHI
                                                                                       DNKSAIALAKNPVFH+RSKHI
Subjt:  -------------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHI

Query:  NTRYHYIRECIKRKNVQLKY
        + RYHYIREC+ R +V+++Y
Subjt:  NTRYHYIRECIKRKNVQLKY

KAE8683276.1 TMV resistance protein N-like [Hibiscus syriacus]1.7e-5242.81Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVSTQRN-FTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM
        MDVKSAFLNGVLEE++YI+QP GY+V+G E+KVLKLKKALY LKQ+     S      Q N F KCPYEHAL IK+++ D+LIVCLYVDDLIFTGSNPSM
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVSTQRN-FTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM

Query:  FNKFKEEMTNEFEMTDIGLMSYYL------------------------------VGVISRFMEKPTTTHLKAAKRILRHIK-------------------
        FN+FK+ M  EFEMTD+GLM+YYL                               G++SR+ME PTT H KAAKRILR++K                   
Subjt:  FNKFKEEMTNEFEMTDIGLMSYYL------------------------------VGVISRFMEKPTTTHLKAAKRILRHIK-------------------

Query:  -------------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHI
                                                                                       DNKSAIALAKNPVFH+RSKHI
Subjt:  -------------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHI

Query:  NTRYHYIRECIKRKNVQLKY
        + RYHYIREC+ RK+V+++Y
Subjt:  NTRYHYIRECIKRKNVQLKY

KYP69041.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.7e-5038.59Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM
        MDVKSAFLNGVLEE+VYI+QPQGY+V+G+E+KVL+LKKALY LKQ+ ++      +   + NF KCPYEHAL IK Q  D+LIVCLYVDDLIFTG+NPSM
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM

Query:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------------------------------
        F +FK++MT EFEMTD+GLM+YYL                                                                            
Subjt:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------------------------------

Query:  --VGVISRFMEKPTTTHLKAAKRILRHIK-----------------------------------------------------------------------
          VGV+SR+ME PTTTHLK AKRILR+IK                                                                       
Subjt:  --VGVISRFMEKPTTTHLKAAKRILRHIK-----------------------------------------------------------------------

Query:  ---------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY
                                   DNKSAIALAKNPVFH+RSKHI+TRYHYIRECI  K+VQ++Y
Subjt:  ---------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY

XP_013583262.1 PREDICTED: LOW QUALITY PROTEIN: copia protein [Brassica oleracea var. oleracea]1.4e-5444.72Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVS---TQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNP
        MDVKSAFLNG LEE+VYI+QPQGY V+G+E+KVL+LKKALY LKQ+      N Q+     ++ F KCPYEHAL IK QNND+LI CLYVDDLIFTG+NP
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVS---TQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNP

Query:  SMFNKFKEEMTNEFEMTDIGLMSYYL-------------------------VGVISRFMEKPTTTHLKAAKRILRHIK----------------------
         MF  FK EMT EF MTDIGLMSYYL                         VGV+SR+ME PTTTH KAAKRILR+IK                      
Subjt:  SMFNKFKEEMTNEFEMTDIGLMSYYL-------------------------VGVISRFMEKPTTTHLKAAKRILRHIK----------------------

Query:  ----------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHINTR
                                                                                    DNKSAIALAKNPVFH+RSKHI+TR
Subjt:  ----------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHINTR

Query:  YHYIRECIKRKNVQLKYNDVNE
        YHYIREC+ + +VQL+Y   N+
Subjt:  YHYIRECIKRKNVQLKYNDVNE

TrEMBL top hitse value%identityAlignment
A0A151RPT4 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-5038.59Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM
        MDVKSAFLNGVLEE+VYI+QPQGY+V+G+E+KVL+LKKALY LKQ+ ++      +   + NF KCPYEHAL IK Q  D+LIVCLYVDDLIFTG+NPSM
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM

Query:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------------------------------
        F +FK++MT EFEMTD+GLM+YYL                                                                            
Subjt:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------------------------------

Query:  --VGVISRFMEKPTTTHLKAAKRILRHIK-----------------------------------------------------------------------
          VGV+SR+ME PTTTHLK AKRILR+IK                                                                       
Subjt:  --VGVISRFMEKPTTTHLKAAKRILRHIK-----------------------------------------------------------------------

Query:  ---------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY
                                   DNKSAIALAKNPVFH+RSKHI+TRYHYIRECI  K+VQ++Y
Subjt:  ---------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY

A0A151TPU3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-5038.59Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM
        MDVKSAFLNGVLEE+VYI+QPQGY+V+G+E+KVL+LKKALY LKQ+ ++      +   + NF KCPYEHAL IK Q  D+LIVCLYVDDLIFTG+NPSM
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM

Query:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------------------------------
        F +FK++MT EFEMTD+GLM+YYL                                                                            
Subjt:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------------------------------

Query:  --VGVISRFMEKPTTTHLKAAKRILRHIK-----------------------------------------------------------------------
          VGV+SR+ME PTTTHLK AKRILR+IK                                                                       
Subjt:  --VGVISRFMEKPTTTHLKAAKRILRHIK-----------------------------------------------------------------------

Query:  ---------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY
                                   DNKSAIALAKNPVFH+RSKHI+TRYHYIRECI  K+VQ++Y
Subjt:  ---------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY

A0A6A2XEN6 Uncharacterized protein2.3e-5041.56Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVSTQRN-FTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM
        MDVKSAFLNGVLEE+VYI+QP GY+V+G E+KVLKLKK LY LKQ+     S      Q N F KCPYEHAL IK+++ D+LIVCLYVDDLIFTGSNP+M
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVSTQRN-FTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM

Query:  FNKFKEEMTNEFEMTDIGLMSYYL------------------------------VGVISRFMEKPTTTHLKAAKRILRHIK-------------------
        F +FK+ M   FEMT++GLM+YYL                               G++SR+ME PTTTH KAAKRILR++K                   
Subjt:  FNKFKEEMTNEFEMTDIGLMSYYL------------------------------VGVISRFMEKPTTTHLKAAKRILRHIK-------------------

Query:  -------------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHI
                                                                                       DNKSAIALAKNPVFH+RSKHI
Subjt:  -------------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHI

Query:  NTRYHYIRECIKRKNVQLKY
        + RYHYIREC+ R +V+++Y
Subjt:  NTRYHYIRECIKRKNVQLKY

A0A6A2YV41 TMV resistance protein N-like8.3e-5342.81Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVSTQRN-FTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM
        MDVKSAFLNGVLEE++YI+QP GY+V+G E+KVLKLKKALY LKQ+     S      Q N F KCPYEHAL IK+++ D+LIVCLYVDDLIFTGSNPSM
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVSTQRN-FTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM

Query:  FNKFKEEMTNEFEMTDIGLMSYYL------------------------------VGVISRFMEKPTTTHLKAAKRILRHIK-------------------
        FN+FK+ M  EFEMTD+GLM+YYL                               G++SR+ME PTT H KAAKRILR++K                   
Subjt:  FNKFKEEMTNEFEMTDIGLMSYYL------------------------------VGVISRFMEKPTTTHLKAAKRILRHIK-------------------

Query:  -------------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHI
                                                                                       DNKSAIALAKNPVFH+RSKHI
Subjt:  -------------------------------------------------------------------------------DNKSAIALAKNPVFHNRSKHI

Query:  NTRYHYIRECIKRKNVQLKY
        + RYHYIREC+ RK+V+++Y
Subjt:  NTRYHYIRECIKRKNVQLKY

Q9M197 Copia-type reverse transcriptase-like protein4.6e-5948.96Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM
        MDVKSAFLNG LEE+VYI+QPQGY V+G+E+KVL+LKK LY LKQ+ ++      +   +++F KCPYEHAL IK+Q  D+LI CLYVDDLIFTG+NPSM
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQS-KSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSM

Query:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------VGVISRFMEKPTTTHLKAAKRILR
        F +FK+EMT EFEMTDIGLMSYYL                                                    VGV+SR+ME PTTTH KAAKRILR
Subjt:  FNKFKEEMTNEFEMTDIGLMSYYL----------------------------------------------------VGVISRFMEKPTTTHLKAAKRILR

Query:  HIK--------------------------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY
        +IK                                            DNKSAIALAKNPVFH+RSKHI+TRYHYIREC+ +K+VQL+Y
Subjt:  HIK--------------------------------------------DNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKY

SwissProt top hitse value%identityAlignment
I3PFJ3 4,5-DOPA dioxygenase extradiol 11.1e-1450.7Show/hide
Query:  IKRKNVQLKYNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS
        +++  +  ++ +VN YE KAP+ K+AHP P+HFYPLHV +GAAG   KA+LIH SW  GT+ + SY+FT++
Subjt:  IKRKNVQLKYNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS

I3PFJ9 4,5-DOPA dioxygenase extradiol 18.7e-1550.7Show/hide
Query:  IKRKNVQLKYNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS
        +++  +  ++ +VN YE KAP+ K+AHP P+HFYPLHV +GAAG   KA+LIH SW  GT+ + SY+FT++
Subjt:  IKRKNVQLKYNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS

Q70FG7 4,5-DOPA dioxygenase extradiol3.2e-1764.52Show/hide
Query:  YNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS
        Y +VN+YE KAP+ K+AHP P+HFYPLHVA+GAAG + KA+LIH+SW  G MSY SY+FT++
Subjt:  YNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS

Q7XA48 4,5-DOPA dioxygenase extradiol5.6e-1457.14Show/hide
Query:  KYNDVNEYEKKAPHA-KMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTA
        +Y DVN Y+ KAP   K+AHP P+HF PLHVA+GA G   KA+LI+ +W  GT+ YASY+FT+
Subjt:  KYNDVNEYEKKAPHA-KMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTA

Q949R4 Extradiol ring-cleavage dioxygenase9.0e-2059.74Show/hide
Query:  HYIRECIKRKNVQLKYNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS
        H++R+ +    +Q +Y DVNE+E+KAP+AKMAHP P+H YPLHV +GAAGGD KA+ IH SW  GT+SY+SY FT+S
Subjt:  HYIRECIKRKNVQLKYNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS

Arabidopsis top hitse value%identityAlignment
AT4G15093.1 catalytic LigB subunit of aromatic ring-opening dioxygenase family6.4e-2159.74Show/hide
Query:  HYIRECIKRKNVQLKYNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS
        H++R+ +    +Q +Y DVNE+E+KAP+AKMAHP P+H YPLHV +GAAGGD KA+ IH SW  GT+SY+SY FT+S
Subjt:  HYIRECIKRKNVQLKYNDVNEYEKKAPHAKMAHPSPDHFYPLHVAIGAAGGDPKAKLIHHSWGQGTMSYASYQFTAS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-1030.6Show/hide
Query:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKE----EKVLKLKKALYELKQSKSLECSNRQVS-TQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGS
        +D+ +AFLNG L+E++Y+K P GY     +      V  LKK++Y LKQ+         V+     F +   +H   +K+     L V +YVDD+I   +
Subjt:  MDVKSAFLNGVLEEKVYIKQPQGYQVEGKE----EKVLKLKKALYELKQSKSLECSNRQVS-TQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGS

Query:  NPSMFNKFKEEMTNEFEMTDIGLMSYYLVGVISR
        N +  ++ K ++ + F++ D+G + Y+L   I+R
Subjt:  NPSMFNKFKEEMTNEFEMTDIGLMSYYLVGVISR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTGAAATCTGCTTTTCTCAACGGAGTCCTTGAGGAAAAAGTCTACATTAAACAACCCCAAGGTTATCAAGTTGAGGGAAAAGAAGAAAAGGTTTTGAAGCTGAA
AAAGGCACTCTATGAGTTGAAGCAATCCAAGAGCTTGGAATGCTCGAATAGACAAGTATCTACACAAAGAAATTTCACCAAGTGTCCATATGAGCATGCACTATGCATCA
AAGTCCAAAATAATGATGTATTGATTGTGTGCTTGTATGTAGATGACTTGATCTTTACAGGTAGTAATCCAAGCATGTTCAACAAGTTTAAAGAAGAGATGACAAACGAG
TTTGAGATGACTGACATCGGCCTCATGTCTTACTACCTTGTTGGAGTTATCAGTCGCTTCATGGAGAAACCAACGACCACACACTTGAAGGCAGCAAAGAGAATTCTTCG
GCACATCAAAGACAACAAGTCAGCAATTGCTTTGGCCAAGAATCCAGTCTTCCACAACCGGAGCAAGCATATCAATACACGTTATCATTATATCAGAGAGTGCATCAAAA
GAAAGAATGTGCAACTAAAATACAATGATGTGAACGAGTATGAGAAGAAGGCTCCACATGCAAAAATGGCACATCCAAGTCCAGACCACTTTTACCCACTGCATGTTGCG
ATCGGAGCAGCAGGAGGCGACCCGAAAGCCAAGCTTATCCACCATAGCTGGGGCCAAGGAACCATGTCCTACGCTTCCTATCAATTCACAGCCTCTTCTCCCATGAACGA
AAGTAACCTATGGATGTGTTATCAATCTGAAACGCAAAGCATCGGGAAATCAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTGAAATCTGCTTTTCTCAACGGAGTCCTTGAGGAAAAAGTCTACATTAAACAACCCCAAGGTTATCAAGTTGAGGGAAAAGAAGAAAAGGTTTTGAAGCTGAA
AAAGGCACTCTATGAGTTGAAGCAATCCAAGAGCTTGGAATGCTCGAATAGACAAGTATCTACACAAAGAAATTTCACCAAGTGTCCATATGAGCATGCACTATGCATCA
AAGTCCAAAATAATGATGTATTGATTGTGTGCTTGTATGTAGATGACTTGATCTTTACAGGTAGTAATCCAAGCATGTTCAACAAGTTTAAAGAAGAGATGACAAACGAG
TTTGAGATGACTGACATCGGCCTCATGTCTTACTACCTTGTTGGAGTTATCAGTCGCTTCATGGAGAAACCAACGACCACACACTTGAAGGCAGCAAAGAGAATTCTTCG
GCACATCAAAGACAACAAGTCAGCAATTGCTTTGGCCAAGAATCCAGTCTTCCACAACCGGAGCAAGCATATCAATACACGTTATCATTATATCAGAGAGTGCATCAAAA
GAAAGAATGTGCAACTAAAATACAATGATGTGAACGAGTATGAGAAGAAGGCTCCACATGCAAAAATGGCACATCCAAGTCCAGACCACTTTTACCCACTGCATGTTGCG
ATCGGAGCAGCAGGAGGCGACCCGAAAGCCAAGCTTATCCACCATAGCTGGGGCCAAGGAACCATGTCCTACGCTTCCTATCAATTCACAGCCTCTTCTCCCATGAACGA
AAGTAACCTATGGATGTGTTATCAATCTGAAACGCAAAGCATCGGGAAATCAATCTGAAACGCAAGGTAGCGTAAAATAAGACAAGAATCTTGAGAGATTGTCGGGAGAT
TGATGTGCTTGATCTGCGGCTAAAGTGGTGGATCTAAATTAAACCGGAAATCACGCGATTAGGCAAGATAGACGCAAGGCAGCAATTCAGCCTCAAATCATGCAGATTTG
CTCAACGATTGCAATGCAGCATCTCTAAAAGGGCAGACCTGATGGCTGAAAATGTTGTTAGATCTCTCGGAAGAATTTCTAAGTGACAGCTGTCACACCCTACTTTTTAA
GAAACCGATGCGTTGAAGAGTCTATGAAGAAAAGAGGTAAAAGACCATGTTGGATGCATCGTCAGGACTAGGTAACGCATGGTAAGGGTGGATGCATGGAAGTAAACCGA
GAAGGGAAAGAAGTGAAACGCATAACCAAGAAAGGAATAAAGTTATGTTGTGAAGTGTTGAAAGAGATTTAAGTCTTTGAAAGATAGGATACTATAGCAAATTGAGTAAC
GTAT
Protein sequenceShow/hide protein sequence
MDVKSAFLNGVLEEKVYIKQPQGYQVEGKEEKVLKLKKALYELKQSKSLECSNRQVSTQRNFTKCPYEHALCIKVQNNDVLIVCLYVDDLIFTGSNPSMFNKFKEEMTNE
FEMTDIGLMSYYLVGVISRFMEKPTTTHLKAAKRILRHIKDNKSAIALAKNPVFHNRSKHINTRYHYIRECIKRKNVQLKYNDVNEYEKKAPHAKMAHPSPDHFYPLHVA
IGAAGGDPKAKLIHHSWGQGTMSYASYQFTASSPMNESNLWMCYQSETQSIGKSI