; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0011251 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0011251
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionReverse transcriptase
Genome locationchr11:11754929..11763406
RNA-Seq ExpressionIVF0011251
SyntenyIVF0011251
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR019380 - Casein kinase substrate, phosphoprotein PP28
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031931.1 pol protein [Cucumis melo var. makuwa]6.81e-24973.58Show/hide
Query:  VRVRRGTDWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVRRGG---RGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKK
        VR +RG D + A R REGHMDASGFL A AD SLVVVREM PRRG RRGG   RGRGA  VQPEVQPVAQATD AAPVTHA+LAAMEQRFRDLIMQ+R++
Subjt:  VRVRRGTDWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVRRGG---RGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKK

Query:  QQPVPPAPTLA----------------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWE
        QQP PPAP  A                 VVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPT AQ+WLSSLETIFRYMKC EDQKVQ        + +AWWE
Subjt:  QQPVPPAPTLA----------------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWE

Query:  MAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADA
          ERMLGGDV QITW+QFKESFY KFFSASLRDAKRQ+FLNLEQ D TVEQYDAEFDMLS FAPEMI TEAARADKFV+GLRLDIQGLV AFRPATHADA
Subjt:  MAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADA

Query:  LRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEE-------------AVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRC
        LRLA+DLSLQERANSSK AGRGST GQKRKAEQQP+ VPQRNF+SG E             A RGKPLCTTCGK HLGRCLFGT TCFKCRQEGHT DRC
Subjt:  LRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEE-------------AVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRC

Query:  PMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKACQI
        P+RLTG  QNQGAGAPHQ                              VLFDSGSS+SFISSAFVLHARLE               E MLSKEKVKACQI
Subjt:  PMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKACQI

Query:  EIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK
        EIAGHVIEVTLLVLDMLDFDVILGMDWLA N+ASIDCSRKEV FN PSMASFKFKG GSRSLP+
Subjt:  EIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK

KAA0032617.1 gag protease polyprotein [Cucumis melo var. makuwa]1.83e-25067.56Show/hide
Query:  RSSYVRVRRGTDWQGATRMREGHMDASGFLYAFA--------------------------DESLVVVREMPPRRGVRRGGRG---RGAELVQPEVQPVAQ
        R   V  +RG D + A RMR+GHMDASGFL  FA                            SL +VREMPPRRG RRGGRG   RGA  VQPEVQPVAQ
Subjt:  RSSYVRVRRGTDWQGATRMREGHMDASGFLYAFA--------------------------DESLVVVREMPPRRGVRRGGRG---RGAELVQPEVQPVAQ

Query:  ATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPAPTLALV--------------------VPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWL
        A D  APVTHA+LAAMEQRFRDLIMQ+R++Q+P  P P LA                      VPDQLSAEAKHLRDFRKYNPTTFDGSLEDPT AQMWL
Subjt:  ATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPAPTLALV--------------------VPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWL

Query:  SSLETIFRYMKCLEDQKVQ--------QMSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEM
        SSLETIFRYMKC EDQKVQ        + +AWWE  ERMLGGDV QITW+QFKESFY KFFSASLRDAKRQ+FLNLEQ D TVE YDAEFDMLS FAPEM
Subjt:  SSLETIFRYMKCLEDQKVQ--------QMSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEM

Query:  ITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEE-------------AVRGK
        I TEAARADKFV+GLRLDIQGLV AFRPATHADALRLA+DLSLQERANSSK AGRGST GQKRKAEQQP+ VPQRNF+SG E             A RGK
Subjt:  ITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEE-------------AVRGK

Query:  PLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVL
        PLCTTCGK HLGRCL GT TCFKCRQEGHT DRCP+RLTG  QNQGAGAPHQ                              VLFDSGSS+SFISSAFVL
Subjt:  PLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVL

Query:  HARLE---------------ENMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK
        HARLE               E MLSKEKVKACQI IAGHVIEVTL+VLDMLDFDVILGMDWLA N+ASIDCSRKEV FN PSMASFKFKG GS+SLP+
Subjt:  HARLE---------------ENMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK

KAA0047200.1 gag protease polyprotein [Cucumis melo var. makuwa]8.08e-24971.2Show/hide
Query:  VRVRRGTDWQGATRMREGHMDASGFLYAFA--DESLVVVREMPPRRGVRRGGRG---RGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIR
        VR +RG D + A R REGHMDAS FL A A    SL +VREMPPRRG RRGGRG   RGA  VQPEVQPVAQA D AAPVTHA+LAAMEQRFRDLIMQ+R
Subjt:  VRVRRGTDWQGATRMREGHMDASGFLYAFA--DESLVVVREMPPRRGVRRGGRG---RGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIR

Query:  KKQQ---------PVP-PAPTLALV------VPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAW
        ++Q+         P P PAP  A V      VPDQLSAEAKHLRDFRKYNPTTFDGSLEDPT AQ+WLSSLETIF YMKC EDQKVQ        + +AW
Subjt:  KKQQ---------PVP-PAPTLALV------VPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAW

Query:  WEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHA
        WE  ERMLGGDV QITW+QFKESFY KFFSASLRDAKRQ+FLN EQ D TVEQYDAEFDMLS FAPEMI TEAA+ADKFV+GLRLDIQGLV AFRPATHA
Subjt:  WEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHA

Query:  DALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEE-------------AVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTD
        DALRLA+DLSLQERANSSK AGRGST GQKRKAEQQP+ VPQRNF+SG E             A RGKPLCTTCGK HLGRCLFGT  CFKCRQEGHT D
Subjt:  DALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEE-------------AVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTD

Query:  RCPMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKAC
        RC +RLTG  QNQGAGAPHQ                              VLFDSGSS+SFISSAFVLHARLE               E MLSKE+VKAC
Subjt:  RCPMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKAC

Query:  QIEIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK
        QIEIAGHVIEVTLLVLDMLDFDVILGMDWLA N+ASIDCSRKEV FN PSMASFKFKG GS+SLP+
Subjt:  QIEIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK

KAA0065602.1 pol protein [Cucumis melo var. makuwa]2.22e-25174.59Show/hide
Query:  DWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVRRGG---RGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPA
        D + A RMREGHMDAS FL A AD SLVVVREMPPRRG RRGG   RGRGA  VQPEVQPVAQATD  APVTHA+LAAMEQRFRDLIMQ+R++QQP PPA
Subjt:  DWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVRRGG---RGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPA

Query:  PTLA----------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWEMAERMLGGDVGQI
        P  A           VVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPT AQ+WLSSLETIFRYMKC EDQKVQ        + +AWWE  ERMLGGDV QI
Subjt:  PTLA----------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWEMAERMLGGDVGQI

Query:  TWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERA
        TW+QFKESFY KFFSASLRDAKRQKFLNLEQ D TVEQYDAEFDMLS FAPEMI TE ARADKFV+GLRLDIQGLV AFRPATHADALRLA+DLSLQERA
Subjt:  TWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERA

Query:  NSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEE-------------AVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGA
        NSSK AGRGST GQKRKAEQQP+ VPQRNF+SG E             A RGKPLCT CGK HLGRCLFGT TCFKCRQEGHT DRCP RLTG  QNQGA
Subjt:  NSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEE-------------AVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGA

Query:  GAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKACQIEIAGHVIEVTLLV
        GAPHQ                              VLFDSGSS+SFISSAFVLHARLE               E MLSKEKVKACQIEI GHVIEVTLLV
Subjt:  GAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKACQIEIAGHVIEVTLLV

Query:  LDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK
        LDMLDFDVILGMDWLA N+ASIDCSRKEVAFN PSMASFKFKGEGSRSLP+
Subjt:  LDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK

TYK02909.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.42e-26762.79Show/hide
Query:  MKPQPCAPLFPPHLLLILLPSSPDRRPDTAPPSSPSAPASHLHLGRPCLSRTSSAMQRQR--------TSASFSPA---------------PFSTS----
        MKPQPCAPLFPPHLLLILLPSSPDRRPDTAPPSSPSAPASHLHLGRPCLSRTSSAMQRQ           ASF                   F  +    
Subjt:  MKPQPCAPLFPPHLLLILLPSSPDRRPDTAPPSSPSAPASHLHLGRPCLSRTSSAMQRQR--------TSASFSPA---------------PFSTS----

Query:  AGLQICQASTATETRARSSYVRVRRGTDWQGATRMREGHMDASGFLYAFAD--------------------------------------------ESLVV
        A   I +   A +   R   VRVRRGTDWQGATRMREGHMDASGFLYAFAD                                            ESLVV
Subjt:  AGLQICQASTATETRARSSYVRVRRGTDWQGATRMREGHMDASGFLYAFAD--------------------------------------------ESLVV

Query:  VREMPPRRGVRRGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPAPTLALVVPDQLSAEAKHLRDFRKYNPTTFDGS
        VREMPPRRGVRRGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPAPTLALV                          
Subjt:  VREMPPRRGVRRGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPAPTLALVVPDQLSAEAKHLRDFRKYNPTTFDGS

Query:  LEDPTNAQMWLSSLETIFRYMKCLEDQKVQQMSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFA
                                       MSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFA
Subjt:  LEDPTNAQMWLSSLETIFRYMKCLEDQKVQQMSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFA

Query:  PEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEEAVRGKPLCTTCGKQH
        PEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG                 
Subjt:  PEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEEAVRGKPLCTTCGKQH

Query:  LGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGAGAPHQVLFDSGSSYSFISSAFVLHARLEENMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDV
         GRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGAGAPHQ                             EKVKACQIEIAGHVIEVTLLVLDMLDFDV
Subjt:  LGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGAGAPHQVLFDSGSSYSFISSAFVLHARLEENMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDV

Query:  ILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRS------------------------LP----------------------------------
        ILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSR                         LP                                  
Subjt:  ILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRS------------------------LP----------------------------------

Query:  ---------------KGATMFSEIDLRSGYHQLRIKDSH
                       KGATMFSEIDLRSGYHQLRIKDSH
Subjt:  ---------------KGATMFSEIDLRSGYHQLRIKDSH

TrEMBL top hitse value%identityAlignment
A0A5A7T3J6 Reverse transcriptase1.7e-21173.13Show/hide
Query:  DWQGATRMREGHMDASGFLYAFAD--ESLVVVR-------EMPPRRGVR---RGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIR
        D + A RMREGHMDASGFLYA AD   SL +VR       EMPPRRG R   RGGRGRGA  VQPEVQPVAQATD AA VTHA+LAAMEQRFRDLIMQ+R
Subjt:  DWQGATRMREGHMDASGFLYAFAD--ESLVVVR-------EMPPRRGVR---RGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIR

Query:  KKQQPVPPAPTLA------------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWEMA
        ++QQP PPAP  A             VVPDQLS EAKHLRDFRKYNPTTFDGSLEDPT AQ+WLSSLETIFRYMKC EDQKVQ        + +AWWE  
Subjt:  KKQQPVPPAPTLA------------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWEMA

Query:  ERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALR
        ERMLGGDV QITW+QFKESFY KFFS SLRDAKRQ+FLNLEQ D TVEQYDAEFDMLSCFAPEMI TEAARADKFV+GLRLDIQGLV AFRPATHADALR
Subjt:  ERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALR

Query:  LAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG-------------EEAVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRCPM
        LA+DLSLQERANSSKVAGRGST GQKRKAEQQPI VPQRNF+ G              EA RGKPLC TCGK HLGRCLFGT TCFKCRQEGHT DRCPM
Subjt:  LAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG-------------EEAVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRCPM

Query:  RLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKACQIEI
        RLTG  QNQGAGAPHQ                              VLFDSGSS+SFISSAFVLHARLE               E MLSKEKVKACQIEI
Subjt:  RLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKACQIEI

Query:  AGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK
        AGHVIEVTLLVLDMLDFDV+LGMDWLA N+ASIDCS KEVAFN PSMASFKFKGEGSRSLP+
Subjt:  AGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK

A0A5A7TXM6 Reverse transcriptase6.3e-21171.75Show/hide
Query:  SYVRVRRGTDWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVR---RGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIR
        S VR +RG D + A R REGHMDASGFL A A        EMPPRRG R   RGGRGRGA  VQPEVQPVA+ATD AAPVTHA+LAAMEQRFRDLIMQ+R
Subjt:  SYVRVRRGTDWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVR---RGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIR

Query:  KKQQPVPPAPTLA--------------------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------Q
        ++QQP PPAP LA                     VVPDQLSAE+KHLRDFRKYNPTTFDGSLEDPT AQ+WLSSLETIFRYMKC EDQKVQ        +
Subjt:  KKQQPVPPAPTLA--------------------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------Q

Query:  MSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRP
         +AWWE  ERMLGGDV QITW+QFKESFY KFFSASLRDAKRQ+FLNLEQ D TVEQYDAEFDMLS FAPEMI TEAARADKFV+GLRLDIQGLV AFRP
Subjt:  MSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRP

Query:  ATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG-------------EEAVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEG
        ATHADALRLA+DLSLQERANSSK AGRGST GQKRKAEQQP+ VPQRNF+SG             EEA RGKPLCTTCGK HLGRCLFGT TCFKCRQEG
Subjt:  ATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG-------------EEAVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEG

Query:  HTTDRCPMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEK
        HT DRCP+RLTG TQNQGAGAPHQ                              VLFDSGSS+SFISSAFVLHARLE               E MLSKEK
Subjt:  HTTDRCPMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEK

Query:  VKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK
        VKACQIEIAGHVIEVTLLVLDMLDFDVILGM+WLA N+ASIDCSRKEV FN PSMASFKFKG GS+SLP+
Subjt:  VKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK

A0A5A7VBD0 Reverse transcriptase2.1e-21474.59Show/hide
Query:  DWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVRR---GGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPA
        D + A RMREGHMDAS FL A AD SLVVVREMPPRRG RR   GGRGRGA  VQPEVQPVAQATD  APVTHA+LAAMEQRFRDLIMQ+R++QQP PPA
Subjt:  DWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVRR---GGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPA

Query:  PTLA----------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWEMAERMLGGDVGQI
        P  A           VVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPT AQ+WLSSLETIFRYMKC EDQKVQ        + +AWWE  ERMLGGDV QI
Subjt:  PTLA----------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWEMAERMLGGDVGQI

Query:  TWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERA
        TW+QFKESFY KFFSASLRDAKRQKFLNLEQ D TVEQYDAEFDMLS FAPEMI TE ARADKFV+GLRLDIQGLV AFRPATHADALRLA+DLSLQERA
Subjt:  TWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERA

Query:  NSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG-------------EEAVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGA
        NSSK AGRGST GQKRKAEQQP+ VPQRNF+SG              EA RGKPLCT CGK HLGRCLFGT TCFKCRQEGHT DRCP RLTG  QNQGA
Subjt:  NSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG-------------EEAVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGA

Query:  GAPHQ------------------------------VLFDSGSSYSFISSAFVLHARL---------------EENMLSKEKVKACQIEIAGHVIEVTLLV
        GAPHQ                              VLFDSGSS+SFISSAFVLHARL               EE MLSKEKVKACQIEI GHVIEVTLLV
Subjt:  GAPHQ------------------------------VLFDSGSSYSFISSAFVLHARL---------------EENMLSKEKVKACQIEIAGHVIEVTLLV

Query:  LDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK
        LDMLDFDVILGMDWLA N+ASIDCSRKEVAFN PSMASFKFKGEGSRSLP+
Subjt:  LDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK

A0A5D3BXT1 Ty3-gypsy retrotransposon protein5.7e-22062.65Show/hide
Query:  MKPQPCAPLFPPHLLLILLPSSPDRRPDTAPPSSPSAPASHLHLGRPCLSRTSSAMQRQR--------TSASFSPAPF-------------------STS
        MKPQPCAPLFPPHLLLILLPSSPDRRPDTAPPSSPSAPASHLHLGRPCLSRTSSAMQRQ           ASF                           
Subjt:  MKPQPCAPLFPPHLLLILLPSSPDRRPDTAPPSSPSAPASHLHLGRPCLSRTSSAMQRQR--------TSASFSPAPF-------------------STS

Query:  AGLQICQASTATETRARSSYVRVRRGTDWQGATRMREGHMDASGFLYAFAD--------------------------------------------ESLVV
        A   I +   A +   R   VRVRRGTDWQGATRMREGHMDASGFLYAFAD                                            ESLVV
Subjt:  AGLQICQASTATETRARSSYVRVRRGTDWQGATRMREGHMDASGFLYAFAD--------------------------------------------ESLVV

Query:  VREMPPRRGVRRGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPAPTLALVVPDQLSAEAKHLRDFRKYNPTTFDGS
        VREMPPRRGVRRGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPAPTLALV                          
Subjt:  VREMPPRRGVRRGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPAPTLALVVPDQLSAEAKHLRDFRKYNPTTFDGS

Query:  LEDPTNAQMWLSSLETIFRYMKCLEDQKVQQMSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFA
                                       MSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFA
Subjt:  LEDPTNAQMWLSSLETIFRYMKCLEDQKVQQMSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFA

Query:  PEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEEAVRGKPLCTTCGKQH
        PEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG                 
Subjt:  PEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEEAVRGKPLCTTCGKQH

Query:  LGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGAGAPHQVLFDSGSSYSFISSAFVLHARLEENMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDV
         GRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGAGAPHQ                             EKVKACQIEIAGHVIEVTLLVLDMLDFDV
Subjt:  LGRCLFGTMTCFKCRQEGHTTDRCPMRLTGGTQNQGAGAPHQVLFDSGSSYSFISSAFVLHARLEENMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDV

Query:  ILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSR------------------------SLP----------------------------------
        ILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSR                         LP                                  
Subjt:  ILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSR------------------------SLP----------------------------------

Query:  ---------------KGATMFSEIDLRSGYHQLRIKDSH
                       KGATMFSEIDLRSGYHQLRIKDSH
Subjt:  ---------------KGATMFSEIDLRSGYHQLRIKDSH

A0A5D3BXT1 Ty3-gypsy retrotransposon protein1.3e-0375.61Show/hide
Query:  ILKQKAHKQFMRLQEQGKTEQARKDLDTMLWFLRIRLVKTL
        ILKQKAHKQFMRLQEQGKTEQARKDLD+   FL I  ++ L
Subjt:  ILKQKAHKQFMRLQEQGKTEQARKDLDTMLWFLRIRLVKTL

A0A5D3BXT1 Ty3-gypsy retrotransposon protein1.9e-21573.58Show/hide
Query:  VRVRRGTDWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVRR---GGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKK
        VR +RG D + A R REGHMDASGFL A AD SLVVVREM PRRG RR   GGRGRGA  VQPEVQPVAQATD AAPVTHA+LAAMEQRFRDLIMQ+R++
Subjt:  VRVRRGTDWQGATRMREGHMDASGFLYAFADESLVVVREMPPRRGVRR---GGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKK

Query:  QQPVPPAPTLA----------------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWE
        QQP PPAP  A                 VVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPT AQ+WLSSLETIFRYMKC EDQKVQ        + +AWWE
Subjt:  QQPVPPAPTLA----------------LVVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQ--------QMSAWWE

Query:  MAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADA
          ERMLGGDV QITW+QFKESFY KFFSASLRDAKRQ+FLNLEQ D TVEQYDAEFDMLS FAPEMI TEAARADKFV+GLRLDIQGLV AFRPATHADA
Subjt:  MAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAPEMITTEAARADKFVKGLRLDIQGLVWAFRPATHADA

Query:  LRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG-------------EEAVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRC
        LRLA+DLSLQERANSSK AGRGST GQKRKAEQQP+ VPQRNF+SG              EA RGKPLCTTCGK HLGRCLFGT TCFKCRQEGHT DRC
Subjt:  LRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSG-------------EEAVRGKPLCTTCGKQHLGRCLFGTMTCFKCRQEGHTTDRC

Query:  PMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKACQI
        P+RLTG  QNQGAGAPHQ                              VLFDSGSS+SFISSAFVLHARLE               E MLSKEKVKACQI
Subjt:  PMRLTGGTQNQGAGAPHQ------------------------------VLFDSGSSYSFISSAFVLHARLE---------------ENMLSKEKVKACQI

Query:  EIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK
        EIAGHVIEVTLLVLDMLDFDVILGMDWLA N+ASIDCSRKEV FN PSMASFKFKG GSRSLP+
Subjt:  EIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEVAFNSPSMASFKFKGEGSRSLPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46020.1 CONTAINS InterPro DOMAIN/s: Casein kinase substrate, phosphoprotein PP28 (InterPro:IPR019380); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.8e-0841.67Show/hide
Query:  KGATMFSEIDLRSGYHQ--LRIKDSHNVSRIATPLTQLTRKGAPFILKQKAHKQFMRLQEQGKTEQARKDLDTMLWFLRIRLVKTLRDEFYLKKSE
        KGA    E+D  +   Q  L+ KD       A+  T+L+R+    + KQ+AH+++MRLQEQGKTEQARKDLD      R+ L++  R+E   K+ E
Subjt:  KGATMFSEIDLRSGYHQ--LRIKDSHNVSRIATPLTQLTRKGAPFILKQKAHKQFMRLQEQGKTEQARKDLDTMLWFLRIRLVKTLRDEFYLKKSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACCTCAACCGTGCGCGCCGCTCTTCCCCCCTCATCTTCTTCTTATCCTTCTTCCTTCCTCGCCCGACCGTCGCCCAGACACAGCTCCTCCTTCTTCTCCGTCAGC
GCCAGCAAGTCATCTCCATCTCGGTCGTCCCTGCCTCAGCCGAACATCGTCAGCCATGCAACGCCAACGTACGTCAGCTTCCTTCTCCCCGGCCCCCTTTTCGACTTCGG
CCGGCCTCCAAATCTGCCAAGCCTCGACAGCCACGGAGACGCGGGCTAGATCTTCCTATGTAAGGGTACGGCGAGGGACAGACTGGCAAGGGGCAACAAGGATGCGTGAA
GGCCATATGGACGCGTCTGGTTTTCTTTATGCTTTCGCTGATGAATCATTGGTTGTTGTTAGGGAAATGCCGCCAAGGAGAGGTGTACGTAGGGGTGGCCGAGGAAGGGG
AGCAGAACTTGTCCAACCTGAAGTACAGCCTGTAGCTCAAGCCACCGACCTTGCTGCACCAGTTACTCACGCGAACCTAGCTGCTATGGAGCAGAGGTTCAGGGATTTGA
TTATGCAGATACGGAAGAAGCAGCAGCCTGTCCCGCCAGCTCCTACTCTGGCTCTAGTCGTGCCGGATCAGTTGTCGGCAGAGGCCAAGCACTTGAGGGATTTCAGGAAG
TATAACCCCACGACATTCGATGGGTCTTTGGAGGACCCCACCAACGCTCAGATGTGGTTATCTTCTTTGGAAACCATATTTCGGTATATGAAGTGCCTTGAGGATCAGAA
AGTTCAACAGATGTCTGCCTGGTGGGAGATGGCAGAGAGGATGCTAGGGGGTGACGTGGGTCAGATCACTTGGGAACAGTTCAAGGAGAGTTTCTATGAGAAATTCTTCT
CTGCCAGTTTGCGAGATGCCAAGCGGCAGAAGTTTCTGAACCTGGAGCAGGACGACAGGACAGTGGAGCAGTATGATGCGGAGTTTGACATGTTATCCTGCTTCGCTCCC
GAGATGATAACGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAAAGGCCTCAGGCTAGACATCCAGGGTCTGGTTTGGGCCTTCCGACCCGCCACTCACGCTGATGCACT
GCGCCTGGCAATGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCCAAGGTTGCAGGTAGAGGTTCGACCTTAGGACAGAAAAGGAAGGCTGAGCAGCAGCCTATTTTAG
TGCCACAGCGGAACTTCAAATCAGGTGAGGAAGCTGTCAGAGGGAAGCCTTTGTGTACCACTTGTGGGAAGCAACATCTAGGCCGTTGTTTATTTGGGACCATGACTTGC
TTCAAGTGTAGGCAAGAGGGGCATACAACTGATAGATGCCCGATGAGACTTACCGGGGGTACACAGAATCAGGGGGCAGGTGCTCCACATCAGGTTTTGTTTGATTCTGG
TTCGTCATATTCTTTTATCTCTTCTGCATTTGTGTTGCATGCTCGTTTAGAGGAGAATATGTTGTCGAAAGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGTCATG
TGATTGAAGTAACATTGTTAGTCCTTGACATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGCTAGCTACTAACTATGCTAGCATAGATTGTTCCCGTAAGGAGGTA
GCGTTTAACTCTCCCTCGATGGCCAGTTTTAAGTTTAAGGGAGAAGGATCAAGGTCGTTACCTAAGGGAGCTACAATGTTCTCTGAGATCGACCTTCGGTCGGGATATCA
TCAGCTGAGGATTAAGGATAGCCATAATGTTTCTCGTATAGCCACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTATACTGAAACAAAAGGCTCACAAGCAAT
TCATGAGGCTTCAAGAACAAGGGAAAACTGAACAAGCTAGGAAAGATTTAGATACTATGCTTTGGTTCTTAAGAATTCGTTTAGTTAAGACTTTGAGGGATGAGTTTTAT
CTTAAGAAATCTGAGGTAATGAGTAAATTATCTGTAACGCCCCAAAAATTTAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACCTCAACCGTGCGCGCCGCTCTTCCCCCCTCATCTTCTTCTTATCCTTCTTCCTTCCTCGCCCGACCGTCGCCCAGACACAGCTCCTCCTTCTTCTCCGTCAGC
GCCAGCAAGTCATCTCCATCTCGGTCGTCCCTGCCTCAGCCGAACATCGTCAGCCATGCAACGCCAACGTACGTCAGCTTCCTTCTCCCCGGCCCCCTTTTCGACTTCGG
CCGGCCTCCAAATCTGCCAAGCCTCGACAGCCACGGAGACGCGGGCTAGATCTTCCTATGTAAGGGTACGGCGAGGGACAGACTGGCAAGGGGCAACAAGGATGCGTGAA
GGCCATATGGACGCGTCTGGTTTTCTTTATGCTTTCGCTGATGAATCATTGGTTGTTGTTAGGGAAATGCCGCCAAGGAGAGGTGTACGTAGGGGTGGCCGAGGAAGGGG
AGCAGAACTTGTCCAACCTGAAGTACAGCCTGTAGCTCAAGCCACCGACCTTGCTGCACCAGTTACTCACGCGAACCTAGCTGCTATGGAGCAGAGGTTCAGGGATTTGA
TTATGCAGATACGGAAGAAGCAGCAGCCTGTCCCGCCAGCTCCTACTCTGGCTCTAGTCGTGCCGGATCAGTTGTCGGCAGAGGCCAAGCACTTGAGGGATTTCAGGAAG
TATAACCCCACGACATTCGATGGGTCTTTGGAGGACCCCACCAACGCTCAGATGTGGTTATCTTCTTTGGAAACCATATTTCGGTATATGAAGTGCCTTGAGGATCAGAA
AGTTCAACAGATGTCTGCCTGGTGGGAGATGGCAGAGAGGATGCTAGGGGGTGACGTGGGTCAGATCACTTGGGAACAGTTCAAGGAGAGTTTCTATGAGAAATTCTTCT
CTGCCAGTTTGCGAGATGCCAAGCGGCAGAAGTTTCTGAACCTGGAGCAGGACGACAGGACAGTGGAGCAGTATGATGCGGAGTTTGACATGTTATCCTGCTTCGCTCCC
GAGATGATAACGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAAAGGCCTCAGGCTAGACATCCAGGGTCTGGTTTGGGCCTTCCGACCCGCCACTCACGCTGATGCACT
GCGCCTGGCAATGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCCAAGGTTGCAGGTAGAGGTTCGACCTTAGGACAGAAAAGGAAGGCTGAGCAGCAGCCTATTTTAG
TGCCACAGCGGAACTTCAAATCAGGTGAGGAAGCTGTCAGAGGGAAGCCTTTGTGTACCACTTGTGGGAAGCAACATCTAGGCCGTTGTTTATTTGGGACCATGACTTGC
TTCAAGTGTAGGCAAGAGGGGCATACAACTGATAGATGCCCGATGAGACTTACCGGGGGTACACAGAATCAGGGGGCAGGTGCTCCACATCAGGTTTTGTTTGATTCTGG
TTCGTCATATTCTTTTATCTCTTCTGCATTTGTGTTGCATGCTCGTTTAGAGGAGAATATGTTGTCGAAAGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGTCATG
TGATTGAAGTAACATTGTTAGTCCTTGACATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGCTAGCTACTAACTATGCTAGCATAGATTGTTCCCGTAAGGAGGTA
GCGTTTAACTCTCCCTCGATGGCCAGTTTTAAGTTTAAGGGAGAAGGATCAAGGTCGTTACCTAAGGGAGCTACAATGTTCTCTGAGATCGACCTTCGGTCGGGATATCA
TCAGCTGAGGATTAAGGATAGCCATAATGTTTCTCGTATAGCCACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTATACTGAAACAAAAGGCTCACAAGCAAT
TCATGAGGCTTCAAGAACAAGGGAAAACTGAACAAGCTAGGAAAGATTTAGATACTATGCTTTGGTTCTTAAGAATTCGTTTAGTTAAGACTTTGAGGGATGAGTTTTAT
CTTAAGAAATCTGAGGTAATGAGTAAATTATCTGTAACGCCCCAAAAATTTAGATAA
Protein sequenceShow/hide protein sequence
MKPQPCAPLFPPHLLLILLPSSPDRRPDTAPPSSPSAPASHLHLGRPCLSRTSSAMQRQRTSASFSPAPFSTSAGLQICQASTATETRARSSYVRVRRGTDWQGATRMRE
GHMDASGFLYAFADESLVVVREMPPRRGVRRGGRGRGAELVQPEVQPVAQATDLAAPVTHANLAAMEQRFRDLIMQIRKKQQPVPPAPTLALVVPDQLSAEAKHLRDFRK
YNPTTFDGSLEDPTNAQMWLSSLETIFRYMKCLEDQKVQQMSAWWEMAERMLGGDVGQITWEQFKESFYEKFFSASLRDAKRQKFLNLEQDDRTVEQYDAEFDMLSCFAP
EMITTEAARADKFVKGLRLDIQGLVWAFRPATHADALRLAMDLSLQERANSSKVAGRGSTLGQKRKAEQQPILVPQRNFKSGEEAVRGKPLCTTCGKQHLGRCLFGTMTC
FKCRQEGHTTDRCPMRLTGGTQNQGAGAPHQVLFDSGSSYSFISSAFVLHARLEENMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLATNYASIDCSRKEV
AFNSPSMASFKFKGEGSRSLPKGATMFSEIDLRSGYHQLRIKDSHNVSRIATPLTQLTRKGAPFILKQKAHKQFMRLQEQGKTEQARKDLDTMLWFLRIRLVKTLRDEFY
LKKSEVMSKLSVTPQKFR