; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017352 (gene) of Chayote v1 genome

Gene IDSed0017352
OrganismSechium edule (Chayote v1)
DescriptionRetrotransposon protein
Genome locationLG02:33514441..33518425
RNA-Seq ExpressionSed0017352
SyntenySed0017352
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96933.1 retrotransposon protein [Cucumis melo var. makuwa]6.8e-3436.29Show/hide
Query:  MAAAGNRGARHQWTSTEDAILVDALLGL-EAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTT-SIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQ
        MA+  ++  +H+WT+ +D  LV+ LL L E     A N TF+P YL  +   ++ K+P + +  T ++E +V+ LKKQY AI +M+GP  S FGWNE R+
Subjt:  MAAAGNRGARHQWTSTEDAILVDALLGL-EAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTT-SIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQ

Query:  CVDIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNL
        C++ +K  FD WVK HPNA+GL +KPF++++ L ++FG D+A+G     P   + S TA       EEDDMD+N  ++ +P+   L P +G+        
Subjt:  CVDIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNL

Query:  QASQTVNITSGNHSGSQSRMSGSKRKKPSQDLELLEA
               + S  H    SR S  +R  P   ++   A
Subjt:  QASQTVNITSGNHSGSQSRMSGSKRKKPSQDLELLEA

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]1.8e-3934.68Show/hide
Query:  NRGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDIDKD
        ++ ++H W+  EDA LV+ALL L      + NGTFRPGYL  L   +  K+P   LN  +IECKVR+LKKQY A+ EM+    SGF WNE  +CV ++++
Subjt:  NRGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDIDKD

Query:  TFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQASQTVN
         FDLWV+SHPNAKG+W KPF HY  LS +FG DRA                                  + + P+ +       Q  + E+  + S    
Subjt:  TFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQASQTVN

Query:  ITSGNHSGSQSRMSGSKRKKPSQDLELLEALKNLIEKQPDHFQMYMKEWHTQKNEMLNNAMGDALEKIESLTYMNDKDKATLVYLVASDHGKVQTFL
          +G  S       GSKRK+ S  +E+++ +K+ +E Q  H    +  W  +K E+    + +A+  I+ L    + D+ TL+ L+ +D  K   FL
Subjt:  ITSGNHSGSQSRMSGSKRKKPSQDLELLEALKNLIEKQPDHFQMYMKEWHTQKNEMLNNAMGDALEKIESLTYMNDKDKATLVYLVASDHGKVQTFL

XP_038892629.1 uncharacterized protein At2g29880-like [Benincasa hispida]4.0e-3444.26Show/hide
Query:  AAGNRGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDI
        A   + ++H W+  EDA LV+ALL L        NGTFRPGYL  L   +  K+P   LN  +IECKVR+LKKQY A+ EM+    SG GWNE  +CV +
Subjt:  AAGNRGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDI

Query:  DKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINP------TTHVQSITADREEIRVEEDDMDLNFPEYYVP
        +++ FDLWV SHPNAK +W+KPF HY  LS IFG DRA G  + NP      T  V+   +   +  ++E+  + +     VP
Subjt:  DKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINP------TTHVQSITADREEIRVEEDDMDLNFPEYYVP

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]2.0e-3834.11Show/hide
Query:  MAAAGNRGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCV
        MA +G R ++H W+  ED  LV+ALL L      + NGTFR GYL  L   +  K+P   LN  +IECKVR+LKKQY A+ EM+    SGFGWNE  +CV
Subjt:  MAAAGNRGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCV

Query:  DIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQA
         ++K+ FDLWV+SH NAKG+W+K F HY  LS +FG DRA+         H   +   + E  + +D++          D +    + G+ S++ ++ + 
Subjt:  DIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQA

Query:  SQTVNITSGNHSGSQSRMSGSKRKKPSQDLELLEALKNLIEKQPDHFQMYMKEWHTQKNEMLNNAMGDALEKIESLTYMNDKDKATLVYLVASDHGKVQT
                           GSKRK+PS   E+++ +++ +E Q  H    +  W  +K E+      + +  I S+  +++ D+ T + L+ +D  K   
Subjt:  SQTVNITSGNHSGSQSRMSGSKRKKPSQDLELLEALKNLIEKQPDHFQMYMKEWHTQKNEMLNNAMGDALEKIESLTYMNDKDKATLVYLVASDHGKVQT

Query:  FL
        FL
Subjt:  FL

XP_038902479.1 uncharacterized protein At2g29880-like [Benincasa hispida]1.6e-3541.41Show/hide
Query:  MAAAGNRGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCV
        M + G R ++H W+  EDA LV+ALL L      + NGTFRPGYL  L   +  K+P  TLN  +IECKVR+LKKQY  + EM+    SGF WNE  +CV
Subjt:  MAAAGNRGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCV

Query:  DIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQA
         ++++ FDLWV SHPNAK +W+KPF HY   S +FG DR  G  + +P  +V +  A RE     ED++ L   +   P+ +       Q  + E+  + 
Subjt:  DIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQA

Query:  SQTVNITSGNHSGSQSRMSGSKRKKPS
        S      +G  S       GSKRK+PS
Subjt:  SQTVNITSGNHSGSQSRMSGSKRKKPS

TrEMBL top hitse value%identityAlignment
A0A072UYX1 Myb/SANT-like DNA-binding domain protein1.8e-3238.91Show/hide
Query:  RGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTT-SIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDIDKD
        +G + QWT+ EDA+LV  LL L      A  GTF+PGY   L   +  K+PD TL     IE +V+ LK  Y+AI++M+GP  SGFGWN+A + + ++K+
Subjt:  RGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTT-SIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDIDKD

Query:  TFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITAD-REEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQASQTV
         +  W KSHPNA GL+ KPF HY  L  +FG D+A G  + +P  H  +I  +        E D+DLN  E    ++Q        T+ V   +  SQ  
Subjt:  TFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITAD-REEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQASQTV

Query:  NITSGNHSGSQSRMSGSKRKKPSQDL--ELLEALKNLIE
        ++TS N SG     S +KR K + D+   LL +L  L E
Subjt:  NITSGNHSGSQSRMSGSKRKKPSQDL--ELLEALKNLIE

A0A1S3C252 uncharacterized protein At2g29880-like3.3e-3436.29Show/hide
Query:  MAAAGNRGARHQWTSTEDAILVDALLGL-EAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTT-SIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQ
        MA+  ++  +H+WT+ +D  LV+ LL L E     A N TF+P YL  +   ++ K+P + +  T ++E +V+ LKKQY AI +M+GP  S FGWNE R+
Subjt:  MAAAGNRGARHQWTSTEDAILVDALLGL-EAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTT-SIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQ

Query:  CVDIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNL
        C++ +K  FD WVK HPNA+GL +KPF++++ L ++FG D+A+G     P   + S TA       EEDDMD+N  ++ +P+   L P +G+        
Subjt:  CVDIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNL

Query:  QASQTVNITSGNHSGSQSRMSGSKRKKPSQDLELLEA
               + S  H    SR S  +R  P   ++   A
Subjt:  QASQTVNITSGNHSGSQSRMSGSKRKKPSQDLELLEA

A0A5A7VHE7 Retrotransposon protein6.2e-3333.8Show/hide
Query:  NRGARHQWTSTEDAILVDALLGLEAQ-HLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTS-IECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDID
        NR  RH WT  E+  LV+ L+ L +     + NGTFRPGYL  L+  +  K+P   + TT+ I+C+++TLK+ + AI EM GP  SGFGWN+  +C+  +
Subjt:  NRGARHQWTSTEDAILVDALLGLEAQ-HLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTS-IECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDID

Query:  KDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQASQT
        K+ FD WV+SHP AKGL +KPF +Y  L+ +FG +RA+G         V S   D    R +  D + +FP  Y          +    + +DN++AS+ 
Subjt:  KDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQASQT

Query:  VNITSGNHSGSQSRMSGSKRKKPSQDLELLEALKNLIEKQPDHFQMYMKEWHTQKNEMLNNAMGDALEKIESLTYMNDKDKATL
           + G  +GS    SGSKRK+ SQ    LE +   +++  +  +  + EW  +     N+   +    +  +  +   D+A L
Subjt:  VNITSGNHSGSQSRMSGSKRKKPSQDLELLEALKNLIEKQPDHFQMYMKEWHTQKNEMLNNAMGDALEKIESLTYMNDKDKATL

A0A5D3BC95 Retrotransposon protein3.3e-3436.29Show/hide
Query:  MAAAGNRGARHQWTSTEDAILVDALLGL-EAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTT-SIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQ
        MA+  ++  +H+WT+ +D  LV+ LL L E     A N TF+P YL  +   ++ K+P + +  T ++E +V+ LKKQY AI +M+GP  S FGWNE R+
Subjt:  MAAAGNRGARHQWTSTEDAILVDALLGL-EAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTT-SIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQ

Query:  CVDIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNL
        C++ +K  FD WVK HPNA+GL +KPF++++ L ++FG D+A+G     P   + S TA       EEDDMD+N  ++ +P+   L P +G+        
Subjt:  CVDIDKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNL

Query:  QASQTVNITSGNHSGSQSRMSGSKRKKPSQDLELLEA
               + S  H    SR S  +R  P   ++   A
Subjt:  QASQTVNITSGNHSGSQSRMSGSKRKKPSQDLELLEA

A0A5D3DQQ3 Retrotransposon protein1.1e-3233.8Show/hide
Query:  NRGARHQWTSTEDAILVDALLGLEAQ-HLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTS-IECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDID
        NR  RH WT  E+  LV+ L+ L +     + NGTFRPGYL  L+  +  K+P   + TT+ I+C+++TLK+ + AI EM GP  SGFGWN+  +C+  +
Subjt:  NRGARHQWTSTEDAILVDALLGLEAQ-HLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTS-IECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDID

Query:  KDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQASQT
        K+ FD WV+SHP AKGL +KPF +Y  L+ +FG +RA+G         V S   D    R +  D + +FP  Y          +    + +DN++AS+ 
Subjt:  KDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQASQT

Query:  VNITSGNHSGSQSRMSGSKRKKPSQDLELLEALKNLIEKQPDHFQMYMKEWHTQKNEMLNNAMGDALEKIESLTYMNDKDKATL
           + G  +GS    SGSKRK+ SQ    LE +   +++  +  +  + EW  +     N+   +    +  +  +   D+A L
Subjt:  VNITSGNHSGSQSRMSGSKRKKPSQDLELLEALKNLIEKQPDHFQMYMKEWHTQKNEMLNNAMGDALEKIESLTYMNDKDKATL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein5.2e-0824.47Show/hide
Query:  RGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDIDKDT
        +G  +QWT  E     D L+ L  Q+    +G      + + LL    K      N  +   +++ LK  Y +  ++     SGFGW+   +      + 
Subjt:  RGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDIDKDT

Query:  FDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGS--LNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTS
        +  ++K+HPN K + ++   H+  L +IFG+  A+GS  + ++ +T  +  T        E  + D N  E Y    QH +     TS
Subjt:  FDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASGS--LNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTS

AT2G24960.1 unknown protein2.0e-0726.13Show/hide
Query:  NRGARHQWTSTEDAILVDALLGLEAQHLTAPN---GTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDI
        N   R  WT T +   +D +L    +HL   N    TF       +L    +K   +  +   ++ +   L KQY  +K ++  G  GF W++  Q V  
Subjt:  NRGARHQWTSTEDAILVDALLGLEAQHLTAPN---GTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDI

Query:  DKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASG-------SLNINPTTHVQSITADREEIRVEE--DDMDLNFPEYYVPDTQHLNPTAGQTS
        D   + L++K+HP A+   +KP  ++  L LI+G   A G        L I    + +S+    +E    E   +MD  F E  V      N T    S
Subjt:  DKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASG-------SLNINPTTHVQSITADREEIRVEE--DDMDLNFPEYYVPDTQHLNPTAGQTS

AT2G24960.2 unknown protein2.0e-0726.13Show/hide
Query:  NRGARHQWTSTEDAILVDALLGLEAQHLTAPN---GTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDI
        N   R  WT T +   +D +L    +HL   N    TF       +L    +K   +  +   ++ +   L KQY  +K ++  G  GF W++  Q V  
Subjt:  NRGARHQWTSTEDAILVDALLGLEAQHLTAPN---GTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDI

Query:  DKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASG-------SLNINPTTHVQSITADREEIRVEE--DDMDLNFPEYYVPDTQHLNPTAGQTS
        D   + L++K+HP A+   +KP  ++  L LI+G   A G        L I    + +S+    +E    E   +MD  F E  V      N T    S
Subjt:  DKDTFDLWVKSHPNAKGLWSKPFSHYHALSLIFGNDRASG-------SLNINPTTHVQSITADREEIRVEE--DDMDLNFPEYYVPDTQHLNPTAGQTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCAGCAGGTAACCGAGGTGCAAGACATCAATGGACGAGCACTGAAGATGCTATTTTAGTTGATGCGTTATTGGGTTTAGAAGCTCAACATCTAACCGCACCAAA
TGGGACCTTTCGTCCCGGTTACTTAGGAGCCCTCTTATTGGATATTCAAACGAAGATGCCAGATACTACGCTAAATACTACGAGTATTGAATGCAAGGTTCGTACCCTAA
AAAAACAGTATGCTGCCATCAAGGAAATGGTGGGCCCGGGTGGAAGTGGTTTCGGTTGGAATGAGGCTAGACAGTGTGTTGATATTGATAAAGACACATTCGATCTTTGG
GTCAAGTCACACCCGAATGCCAAAGGCCTATGGAGCAAGCCATTTTCTCATTACCATGCACTGTCCTTAATATTCGGGAATGATAGAGCATCTGGCTCATTAAACATAAA
TCCTACGACACACGTACAATCAATCACCGCAGATAGAGAGGAGATTAGAGTTGAAGAAGATGACATGGATTTAAATTTTCCAGAATATTATGTGCCAGACACTCAACACT
TGAACCCAACAGCAGGACAAACCAGTATGGTTGAAGACAACCTTCAAGCTTCCCAAACAGTGAATATAACATCAGGAAACCATTCAGGTAGCCAATCTAGAATGAGTGGC
AGTAAAAGGAAGAAGCCATCCCAGGATTTGGAATTATTGGAGGCCTTGAAGAATTTGATTGAAAAGCAGCCCGACCACTTTCAAATGTACATGAAGGAGTGGCACACCCA
AAAGAATGAAATGCTTAATAACGCTATGGGAGACGCGTTAGAAAAAATAGAATCCTTAACTTACATGAATGATAAAGATAAAGCGACATTGGTTTACTTGGTGGCATCCG
ATCATGGTAAGGTGCAGACATTCTTATAG
mRNA sequenceShow/hide mRNA sequence
TCGGCTGGGCGCTGGCGATGCGACGCGACACGGAGCTGGCGAGGTCGCGCGGGCGGGCAAGGGACGATGTGCGGCGAGGCTCGGCGGCTGGGCTGCTTTTGCGCACGCGA
GCTCGGCGACAGGAGCTCTGTTTCTGCGCGCGAACAGGGGCGAAGGGGAAACTCGAATTCGAGCGATCCGAGGGGATTCGACGAGCGATTCTTTGATGGCGAACACCCCT
ACACTAGCTGCACAAAACCCATACCTCAAATTACCTCGACTCGGCCGGAAATGGCGAAGCCCGAAACAGGGGAGAAGAACTCGAATTTTGAAGGAGAAGAAGAAGAAGAA
GAAGTACCTGCTGCTCGGGGTGAGATCGACTTCGAATTTGGGAGAGAAGCTCGGCTGAATCGAGGTTGGCGAGGGAGAAAACGCGAGAGGGAGCTCAGGTCGGTCGAGGA
AGAAAAGAGAAGGAAGAAGAAGGAATCGGGGAAGAAGCCTCGGGCGGCCTGACTTTAAACCCGAAGGCACGAAACAAGGGATATGTATTAAATCGCGCGTTTACATTAAC
AAACGCGCCCAGTGGCTCGGCTGGCAGCGCCCCCTGCCTAGCATGCAGCACGTCCTTGGCTCGAATCTCACTCGACGCATTATTTAATTCCTTATTTATTTTTACCTTTT
TCCAAGCCCAAATGTCTTCACCAAAATTCCCGATCCTATTCAAATCGACTCATAATTTTTCCGGCAATTTACAATTTCATAACCGGCCCTGTCGGCCCGACCCAAAATTA
CCAAAATGCCCCTGGGCCTCCCGGGAAGACCGGGATGAAATTTCCGGGCCGTTACAATTTGTGCACACGATGTCAAGAATCGTATAGTTCGAACACAGTGGGCTCGATCT
GGGGAAACAGTGTCATGACACTTTGGGGAAGTGCTAACTGCCATTCTCCAATGTCACGATGTGTTGCTGAAGAAACCGACACCTATAACCAGTGAGTGCACAGACAATCA
ATGGAAATGGTTTCAGGTACCTAAAAAATAATGATATATTTTTCTTAACATTCTTATACAACGATAAAATATTTGGACCCTTACGAACATCGATTTGCAGAATTGTTTGG
GTGCGTTAGATGGCACTTACATCGAAGTGAAGGTGCGAGAGGAAGACAAAGCTCGCTATAGAACAAGAAAGGGAACTATTGCTACAAACGTTCTCGGGGTATGTTCTCCA
GAAGGAGAGTTCATCTTTATATCAGCTGGATTTGAAGGGTCAGCTGCAGATTCTCGAGTGCTAATAGAATCGCTGGCACAACCTAACGGTTTAAAAGTTCCTCGTGTAAT
ATGGCAGCAGCAGGTAACCGAGGTGCAAGACATCAATGGACGAGCACTGAAGATGCTATTTTAGTTGATGCGTTATTGGGTTTAGAAGCTCAACATCTAACCGCACCAAA
TGGGACCTTTCGTCCCGGTTACTTAGGAGCCCTCTTATTGGATATTCAAACGAAGATGCCAGATACTACGCTAAATACTACGAGTATTGAATGCAAGGTTCGTACCCTAA
AAAAACAGTATGCTGCCATCAAGGAAATGGTGGGCCCGGGTGGAAGTGGTTTCGGTTGGAATGAGGCTAGACAGTGTGTTGATATTGATAAAGACACATTCGATCTTTGG
GTCAAGTCACACCCGAATGCCAAAGGCCTATGGAGCAAGCCATTTTCTCATTACCATGCACTGTCCTTAATATTCGGGAATGATAGAGCATCTGGCTCATTAAACATAAA
TCCTACGACACACGTACAATCAATCACCGCAGATAGAGAGGAGATTAGAGTTGAAGAAGATGACATGGATTTAAATTTTCCAGAATATTATGTGCCAGACACTCAACACT
TGAACCCAACAGCAGGACAAACCAGTATGGTTGAAGACAACCTTCAAGCTTCCCAAACAGTGAATATAACATCAGGAAACCATTCAGGTAGCCAATCTAGAATGAGTGGC
AGTAAAAGGAAGAAGCCATCCCAGGATTTGGAATTATTGGAGGCCTTGAAGAATTTGATTGAAAAGCAGCCCGACCACTTTCAAATGTACATGAAGGAGTGGCACACCCA
AAAGAATGAAATGCTTAATAACGCTATGGGAGACGCGTTAGAAAAAATAGAATCCTTAACTTACATGAATGATAAAGATAAAGCGACATTGGTTTACTTGGTGGCATCCG
ATCATGGTAAGGTGCAGACATTCTTATAG
Protein sequenceShow/hide protein sequence
MAAAGNRGARHQWTSTEDAILVDALLGLEAQHLTAPNGTFRPGYLGALLLDIQTKMPDTTLNTTSIECKVRTLKKQYAAIKEMVGPGGSGFGWNEARQCVDIDKDTFDLW
VKSHPNAKGLWSKPFSHYHALSLIFGNDRASGSLNINPTTHVQSITADREEIRVEEDDMDLNFPEYYVPDTQHLNPTAGQTSMVEDNLQASQTVNITSGNHSGSQSRMSG
SKRKKPSQDLELLEALKNLIEKQPDHFQMYMKEWHTQKNEMLNNAMGDALEKIESLTYMNDKDKATLVYLVASDHGKVQTFL