; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0000664 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0000664
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotransposon protein
Genome locationchr04:25434012..25435241
RNA-Seq ExpressionPI0000664
SyntenyPI0000664
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]1.9e-8154.3Show/hide
Query:  MTNADVMEDYDEGDSTHA-TTAGDDIHYIETFNEWSHWRDDLAQEMAPKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEKMPGCN
        MT  + +ED DEGDST+A TTA +DI YIET NEWS WRDDLA  M     +   +   +     ELVS  GWKS+NGTFRPGYLAQL RMMAEK+ GC 
Subjt:  MTNADVMEDYDEGDSTHA-TTAGDDIHYIETFNEWSHWRDDLAQEMAPKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEKMPGCN

Query:  IQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSNVP-GR
        ++    IDCRIK +K+TFQ IAEM GP+CS FGWNDE KCI+AE+ELFD+WV+S   AKGLLN PFP+YD+L Y FG+DRATG  AETF +VGSN P G 
Subjt:  IQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSNVP-GR

Query:  FEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVNQLRA
        ++ F    GN+     +Y +G+D+  DD+ ++   R+ +G+TGSS SKRKRG Q    +E I  A+D  N+QL+ IAEWP      +  VR E    LR 
Subjt:  FEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVNQLRA

Query:  SP
         P
Subjt:  SP

ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]2.1e-9158.17Show/hide
Query:  MTNADVMEDYDEGDSTHATTAGDDIHYIETFNEWSHWRDDLAQEM------APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEK
        MTN D+ ++ DE DSTHATTA DDIHYIET NEWS WRD+LA+E+       PKH WTKEEEA LVE LVELV+A GW+S+NGTFRPGYL QL RMMA K
Subjt:  MTNADVMEDYDEGDSTHATTAGDDIHYIETFNEWSHWRDDLAQEM------APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEK

Query:  MPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSN
        +PG NI    TID RIK MK+ F  +AEM GP+CS FGWNDE KCI+AE+E+FD W  SH  AKGLLNK F HYD+L Y FGKDRATG RAE+F ++GSN
Subjt:  MPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSN

Query:  VPGRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVN
         P  ++   AD   DT    +Y  GL+MS DD+      R  + R  SS SKRKR   AT + +I++ A+++ N+QL  IAEWP LQRQ+    R E+V 
Subjt:  VPGRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVN

Query:  QLRASP
         L A P
Subjt:  QLRASP

KAA0035413.1 retrotransposon protein [Cucumis melo var. makuwa]6.6e-8252.27Show/hide
Query:  MTNADVMEDYDEGDSTHATTAGDDIHYIETFNEWSHWRDDLAQEMAP---------------KHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLA
        M   D+++D DE DSTHAT+A DDIHYIET NEW+ WRDDL +EM                 KH WTKEEEA LVE LVELV+A GW+S+NGTFRPG   
Subjt:  MTNADVMEDYDEGDSTHATTAGDDIHYIETFNEWSHWRDDLAQEMAP---------------KHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLA

Query:  QLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARA
                               RIK +K+ F  IAEMCGP+CS+FGWNDE KCI+AE+E+FD+WVKSH  AKGLLNK FPHYD+L Y FGKDRATG RA
Subjt:  QLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARA

Query:  ETFTNVGSNVPGRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEE
        E+F ++G N P  +E F  D   DT    +Y QGL+MS D++      +  +GR  SS SKRKR  QAT + ++++ A+++ N+QL  IAEWP LQRQ+ 
Subjt:  ETFTNVGSNVPGRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEE

Query:  GVVRDEVV
           R EVV
Subjt:  GVVRDEVV

KAA0036924.1 retrotransposon protein [Cucumis melo var. makuwa]2.2e-9357.51Show/hide
Query:  MTNADVMEDYDEGDSTHA-TTAGDDIHYIETFNEWSHWRDDLAQEM-----------APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQ
        MT  D +ED DEGDST+A TTA +DI YIET NEWS WRD+LA  M           AP+H+WT+EEE  LVE L+ELVS  GWKS+NGTFRPGYLAQL 
Subjt:  MTNADVMEDYDEGDSTHA-TTAGDDIHYIETFNEWSHWRDDLAQEM-----------APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQ

Query:  RMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETF
        RMMAEK+PGC ++    IDCRIK +K+TFQ IAEM GP+CS FGWNDE KCI+AE+ELFD+WV+SH  AKGLLNKPFP+YD+L Y F +DRATG  AETF
Subjt:  RMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETF

Query:  TNVGSNVP-GRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGV
         +VGSN P G ++ F    GN+     +Y QG+D+S DD+ ++   R+ +GRTGSS SKRKRG Q    +E I  A+D  N+QL+ IAEWP      +  
Subjt:  TNVGSNVP-GRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGV

Query:  VRDEVVNQLRASP
        +R E    LR  P
Subjt:  VRDEVVNQLRASP

XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]6.6e-8260.16Show/hide
Query:  APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERE
        APKH WTKEEE   VE LVELVS+ GW+S+NGTF+PGYLAQLQRMMAEK+PG NIQ   TIDC +K++KKT+  IAEM GPSCS FGWN+E +CIIAER+
Subjt:  APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERE

Query:  LFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSNVPGRFEGFF-ADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSE
        LFDSW+KSH  AKGLL+K FP+YDDL Y FGKDRATGAR+ETF NVGSNV   F         +D  I  +Y QG+ MS D+M     G++ + R  SS 
Subjt:  LFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSNVPGRFEGFF-ADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSE

Query:  SKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVNQLRASP
        SKRKRG +  +T+E+I++ M+F N+QLKAIA+WP  +R  E  +R +VV QL+  P
Subjt:  SKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVNQLRASP

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859533.2e-8260.16Show/hide
Query:  APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERE
        APKH WTKEEE   VE LVELVS+ GW+S+NGTF+PGYLAQLQRMMAEK+PG NIQ   TIDC +K++KKT+  IAEM GPSCS FGWN+E +CIIAER+
Subjt:  APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERE

Query:  LFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSNVPGRFEGFF-ADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSE
        LFDSW+KSH  AKGLL+K FP+YDDL Y FGKDRATGAR+ETF NVGSNV   F         +D  I  +Y QG+ MS D+M     G++ + R  SS 
Subjt:  LFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSNVPGRFEGFF-ADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSE

Query:  SKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVNQLRASP
        SKRKRG +  +T+E+I++ M+F N+QLKAIA+WP  +R  E  +R +VV QL+  P
Subjt:  SKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVNQLRASP

A0A5A7SXX8 Retrotransposon protein3.2e-8252.27Show/hide
Query:  MTNADVMEDYDEGDSTHATTAGDDIHYIETFNEWSHWRDDLAQEMAP---------------KHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLA
        M   D+++D DE DSTHAT+A DDIHYIET NEW+ WRDDL +EM                 KH WTKEEEA LVE LVELV+A GW+S+NGTFRPG   
Subjt:  MTNADVMEDYDEGDSTHATTAGDDIHYIETFNEWSHWRDDLAQEMAP---------------KHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLA

Query:  QLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARA
                               RIK +K+ F  IAEMCGP+CS+FGWNDE KCI+AE+E+FD+WVKSH  AKGLLNK FPHYD+L Y FGKDRATG RA
Subjt:  QLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARA

Query:  ETFTNVGSNVPGRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEE
        E+F ++G N P  +E F  D   DT    +Y QGL+MS D++      +  +GR  SS SKRKR  QAT + ++++ A+++ N+QL  IAEWP LQRQ+ 
Subjt:  ETFTNVGSNVPGRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEE

Query:  GVVRDEVV
           R EVV
Subjt:  GVVRDEVV

A0A5A7T091 Retrotransposon protein1.1e-9357.51Show/hide
Query:  MTNADVMEDYDEGDSTHA-TTAGDDIHYIETFNEWSHWRDDLAQEM-----------APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQ
        MT  D +ED DEGDST+A TTA +DI YIET NEWS WRD+LA  M           AP+H+WT+EEE  LVE L+ELVS  GWKS+NGTFRPGYLAQL 
Subjt:  MTNADVMEDYDEGDSTHA-TTAGDDIHYIETFNEWSHWRDDLAQEM-----------APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQ

Query:  RMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETF
        RMMAEK+PGC ++    IDCRIK +K+TFQ IAEM GP+CS FGWNDE KCI+AE+ELFD+WV+SH  AKGLLNKPFP+YD+L Y F +DRATG  AETF
Subjt:  RMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETF

Query:  TNVGSNVP-GRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGV
         +VGSN P G ++ F    GN+     +Y QG+D+S DD+ ++   R+ +GRTGSS SKRKRG Q    +E I  A+D  N+QL+ IAEWP      +  
Subjt:  TNVGSNVP-GRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGV

Query:  VRDEVVNQLRASP
        +R E    LR  P
Subjt:  VRDEVVNQLRASP

A0A5A7U0H7 Retrotransposon protein3.2e-8260.16Show/hide
Query:  APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERE
        APKH WTKEEE   VE LVELVS+ GW+S+NGTF+PGYLAQLQRMMAEK+PG NIQ   TIDC +K++KKT+  IAEM GPSCS FGWN+E +CIIAER+
Subjt:  APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERE

Query:  LFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSNVPGRFEGFF-ADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSE
        LFDSW+KSH  AKGLL+K FP+YDDL Y FGKDRATGAR+ETF NVGSNV   F         +D  I  +Y QG+ MS D+M     G++ + R  SS 
Subjt:  LFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSNVPGRFEGFF-ADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSE

Query:  SKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVNQLRASP
        SKRKRG +  +T+E+I++ M+F N+QLKAIA+WP  +R  E  +R +VV QL+  P
Subjt:  SKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVNQLRASP

E5GCB5 Retrotransposon protein1.0e-9158.17Show/hide
Query:  MTNADVMEDYDEGDSTHATTAGDDIHYIETFNEWSHWRDDLAQEM------APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEK
        MTN D+ ++ DE DSTHATTA DDIHYIET NEWS WRD+LA+E+       PKH WTKEEEA LVE LVELV+A GW+S+NGTFRPGYL QL RMMA K
Subjt:  MTNADVMEDYDEGDSTHATTAGDDIHYIETFNEWSHWRDDLAQEM------APKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEK

Query:  MPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSN
        +PG NI    TID RIK MK+ F  +AEM GP+CS FGWNDE KCI+AE+E+FD W  SH  AKGLLNK F HYD+L Y FGKDRATG RAE+F ++GSN
Subjt:  MPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSN

Query:  VPGRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVN
         P  ++   AD   DT    +Y  GL+MS DD+      R  + R  SS SKRKR   AT + +I++ A+++ N+QL  IAEWP LQRQ+    R E+V 
Subjt:  VPGRFEGFFADYGNDTKILMIYGQGLDMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVN

Query:  QLRASP
         L A P
Subjt:  QLRASP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein4.7e-0928.28Show/hide
Query:  QEMAPKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMM--AEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCI
        +E  P + WT +E     + L+EL+    W+ ++G    G L    +++    K  GCN +       R+K +K  +Q+  ++   S S FGW+ E K  
Subjt:  QEMAPKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMM--AEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCI

Query:  IAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARA
         A  E++  ++K+H   K +  +   H++DL   FG   ATG+ A
Subjt:  IAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARA

AT2G24960.2 unknown protein1.1e-0523.53Show/hide
Query:  WTKEEEAYLVEALVELVSADGWKSNNGTFRPGYL----AQLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAEREL
        WT   + +L++ LVE V       NNG  R G      A  + + A      +      +  R K++++ +  I  +     + F W+     +IA+ ++
Subjt:  WTKEEEAYLVEALVELVSADGWKSNNGTFRPGYL----AQLQRMMAEKMPGCNIQGLPTIDCRIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAEREL

Query:  FDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATG
        +++++++H  A+    K  P Y +LC+ FGK+ + G
Subjt:  FDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATG

AT4G02210.1 unknown protein1.8e-0526.47Show/hide
Query:  RIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFG
        R K++++ F  I  +       F W++E + + A+  ++  ++K+H  A+  + +P P+Y DLC   G
Subjt:  RIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFG

AT4G02210.2 unknown protein1.8e-0526.47Show/hide
Query:  RIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFG
        R K++++ F  I  +       F W++E + + A+  ++  ++K+H  A+  + +P P+Y DLC   G
Subjt:  RIKNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGAACGCTGATGTGATGGAGGACTACGATGAGGGTGATTCGACACACGCAACAACTGCAGGTGACGACATTCACTACATCGAAACGTTCAATGAATGGAGTCATTG
GAGGGATGATCTGGCCCAAGAGATGGCCCCCAAACACATCTGGACGAAGGAGGAAGAGGCATACCTGGTCGAGGCCTTGGTAGAGTTGGTATCCGCCGACGGTTGGAAGT
CAAACAACGGTACCTTTCGTCCTGGGTACTTAGCGCAGCTACAGAGAATGATGGCAGAGAAGATGCCTGGATGTAATATCCAAGGTTTGCCGACAATTGACTGTCGGATA
AAAAACATGAAGAAAACATTCCAGACCATTGCAGAGATGTGCGGGCCGTCGTGTAGTAAGTTTGGTTGGAACGATGAAGTGAAGTGCATCATCGCTGAGAGAGAGTTGTT
CGACAGTTGGGTTAAGAGTCATTCTGTTGCGAAGGGCCTCCTAAACAAGCCATTTCCCCATTACGACGACCTGTGCTACGCCTTTGGGAAGGATCGTGCGACGGGAGCTC
GTGCAGAGACATTCACCAACGTCGGGTCGAATGTGCCTGGAAGGTTCGAAGGATTTTTCGCTGATTATGGAAACGATACGAAGATCCTTATGATATACGGTCAGGGACTA
GACATGTCGCTTGATGACATGGCGAGTGCAGGTCTCGGTCGCTCGATGAAGGGTAGGACTGGTTCAAGCGAATCGAAGAGGAAGCGAGGAGACCAGGCAACACAGACTAT
GGAGATCATTCAGAATGCAATGGACTTCGCCAACGACCAGTTGAAGGCTATTGCAGAGTGGCCGAATTTACAACGTCAAGAAGAAGGCGTCGTTCGTGACGAAGTCGTTA
ACCAACTCAGGGCATCCCCGAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGAACGCTGATGTGATGGAGGACTACGATGAGGGTGATTCGACACACGCAACAACTGCAGGTGACGACATTCACTACATCGAAACGTTCAATGAATGGAGTCATTG
GAGGGATGATCTGGCCCAAGAGATGGCCCCCAAACACATCTGGACGAAGGAGGAAGAGGCATACCTGGTCGAGGCCTTGGTAGAGTTGGTATCCGCCGACGGTTGGAAGT
CAAACAACGGTACCTTTCGTCCTGGGTACTTAGCGCAGCTACAGAGAATGATGGCAGAGAAGATGCCTGGATGTAATATCCAAGGTTTGCCGACAATTGACTGTCGGATA
AAAAACATGAAGAAAACATTCCAGACCATTGCAGAGATGTGCGGGCCGTCGTGTAGTAAGTTTGGTTGGAACGATGAAGTGAAGTGCATCATCGCTGAGAGAGAGTTGTT
CGACAGTTGGGTTAAGAGTCATTCTGTTGCGAAGGGCCTCCTAAACAAGCCATTTCCCCATTACGACGACCTGTGCTACGCCTTTGGGAAGGATCGTGCGACGGGAGCTC
GTGCAGAGACATTCACCAACGTCGGGTCGAATGTGCCTGGAAGGTTCGAAGGATTTTTCGCTGATTATGGAAACGATACGAAGATCCTTATGATATACGGTCAGGGACTA
GACATGTCGCTTGATGACATGGCGAGTGCAGGTCTCGGTCGCTCGATGAAGGGTAGGACTGGTTCAAGCGAATCGAAGAGGAAGCGAGGAGACCAGGCAACACAGACTAT
GGAGATCATTCAGAATGCAATGGACTTCGCCAACGACCAGTTGAAGGCTATTGCAGAGTGGCCGAATTTACAACGTCAAGAAGAAGGCGTCGTTCGTGACGAAGTCGTTA
ACCAACTCAGGGCATCCCCGAACTAA
Protein sequenceShow/hide protein sequence
MTNADVMEDYDEGDSTHATTAGDDIHYIETFNEWSHWRDDLAQEMAPKHIWTKEEEAYLVEALVELVSADGWKSNNGTFRPGYLAQLQRMMAEKMPGCNIQGLPTIDCRI
KNMKKTFQTIAEMCGPSCSKFGWNDEVKCIIAERELFDSWVKSHSVAKGLLNKPFPHYDDLCYAFGKDRATGARAETFTNVGSNVPGRFEGFFADYGNDTKILMIYGQGL
DMSLDDMASAGLGRSMKGRTGSSESKRKRGDQATQTMEIIQNAMDFANDQLKAIAEWPNLQRQEEGVVRDEVVNQLRASPN