; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G18930 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G18930
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr09:31730821..31731864
RNA-Seq ExpressionClc09G18930
SyntenyClc09G18930
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038877407.1 uncharacterized protein LOC120069696 [Benincasa hispida]1.3e-7968.35Show/hide
Query:  VRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQ
        VRSLKKQYNA+SEML QSGFDWNEEFKC QVEREIF+LWVRSHP+AKGMWNK FPHYDDLST                                   DC 
Subjt:  VRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQ

Query:  TLEVRQTNSPLNLDGMDEETAEQSTGRAT-PTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQ
        T EV Q  S LN D +DEE  EQSTGR + P ESS+GSKRKR SFQ E+IDIMRSTVEM +THMGRLA WQK+KY+LEF RQKEVVN IYNID L ED Q
Subjt:  TLEVRQTNSPLNLDGMDEETAEQSTGRAT-PTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQ

Query:  VTLIDILVTDIQKTDCFLVVPEHPRKRYCLRLLGRNM
        VTLID++VTDIQKTDCFL VPEH  KRYCLRLLGRNM
Subjt:  VTLIDILVTDIQKTDCFLVVPEHPRKRYCLRLLGRNM

XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]1.9e-9463.61Show/hide
Query:  MAGNSKRTKHIWSKVEDAKLVEALLYL-----------------------------------------VRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVE
        M GNSKR+KH+WSKVEDA+LVEALLYL                                         VRSLKKQYNAVSEML QSGF+WNEEFKC QVE
Subjt:  MAGNSKRTKHIWSKVEDAKLVEALLYL-----------------------------------------VRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVE

Query:  REIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDGMDEETAEQSTGRAT-PT
        REIFDLWVRSHP+AKGMW K FPHYDDLS +FGKDRA                            DC T EVRQT SPLN D +DEE AEQSTGRA+ PT
Subjt:  REIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDGMDEETAEQSTGRAT-PT

Query:  ESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRYCLRL
        ESS+GSKRKR SFQ E+IDI++STVEMQ+THMGRLA WQ EKY+LE    KEVVN IYNIDDL E+DQVTLID++VTDIQKTDCFL VPEH RKRYCLRL
Subjt:  ESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRYCLRL

Query:  LGRNM
        LGRNM
Subjt:  LGRNM

XP_038895773.1 uncharacterized protein LOC120083935 [Benincasa hispida]2.1e-8256.81Show/hide
Query:  NSKRTKHIWSKVEDAKLVEALLYL-----------------------------------------VRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREI
        N KR+KH+WSKVEDAK VEALLYL                                         VRSLKKQ NAVSEML QSGFDWNEEFKC QVEREI
Subjt:  NSKRTKHIWSKVEDAKLVEALLYL-----------------------------------------VRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREI

Query:  FDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDGMDEETAEQSTGRATPTESSQ
        FD WVRSHP+AKGMWNK FPHYDDLST+FGK +A+GQSSEDPY+M +NAFREF+DEIRLG QDC T E                                
Subjt:  FDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDGMDEETAEQSTGRATPTESSQ

Query:  GSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRYCLRLLGRN
                                +THMGRLA WQKEKY+LEF R+KEVVN IYNID L EDDQVTLID+LVTDIQKT+CFL VPEH RKRYCLRLLGRN
Subjt:  GSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRYCLRLLGRN

Query:  M
        M
Subjt:  M

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]3.0e-9262.95Show/hide
Query:  MAGNSKRTKHIWSKVEDAKLVEALLYL-----------------------------------------VRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVE
        MAG+ KR+KH+WSKVED KLVEALLYL                                         VRSLKKQYNAVSEML QSGF WNEEFKC QVE
Subjt:  MAGNSKRTKHIWSKVEDAKLVEALLYL-----------------------------------------VRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVE

Query:  REIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDGMDEETAEQSTGRATP-T
        +EIFDLWVRSH +AKGMWNKSF HYDDLST+FGKDRA                            +C T EV Q  SPLN D +DEE AEQSTGRA+   
Subjt:  REIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDGMDEETAEQSTGRATP-T

Query:  ESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRYCLRL
        ESS+GSKRKRPSFQAE+IDIMRSTVEMQ+THMGRLA WQKEKY+LEF R+KEVVN IY+ID L EDDQVT ID+LVTDIQKTDCFL VPEH RKRYCL L
Subjt:  ESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRYCLRL

Query:  LGRNM
        L RNM
Subjt:  LGRNM

XP_038899910.1 uncharacterized protein LOC120087100 [Benincasa hispida]9.2e-7872.09Show/hide
Query:  MLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNL
        ML QSGF WNEEFKC QVE+EIF+   RSHP+AKGMWNKSFPHYDDLST+FGKDRA+GQSSEDPY+MA NAFREF+DEIRLG QDC+T EVRQT SPLN 
Subjt:  MLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNL

Query:  DGMDEETAEQSTGRAT-PTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQK
        D +DEE AEQSTGRA+ P E+SQGSKRKRPSFQAE+IDIMRSTVEMQ+THMGRLA WQKEKY+LEF   KEVVN IY+ID L EDD+      L++   +
Subjt:  DGMDEETAEQSTGRAT-PTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQK

Query:  TDCFLVVPEHPRKRY
        T    V+P   RK++
Subjt:  TDCFLVVPEHPRKRY

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859539.4e-3634.75Show/hide
Query:  MAGNSKRTKHIWSKVEDAKLVEALLYL-------------------------------------------VRSLKKQYNAVSEMLGQ--SGFDWNEEFKC
        MA  S+  KH W+K E+ K VE L+ L                                           V+SLKK Y+A++EM G   SGF WNEEF+C
Subjt:  MAGNSKRTKHIWSKVEDAKLVEALLYL-------------------------------------------VRSLKKQYNAVSEMLGQ--SGFDWNEEFKC

Query:  AQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNS-PLNLDGMDEETAEQSTGR
           ER++FD W++SHP+AKG+ +KSFP+YDDLS +FGKDRA G  SE    + SN    F D I LG    + +    +    ++ D M    A Q++ R
Subjt:  AQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNS-PLNLDGMDEETAEQSTGR

Query:  ATPTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRY
              S  SKRKR S + E ++++RS +E  N  +  +A W KEK  +E   + +VV  + +I  L   D+  L+ IL   ++  + FL +P   +  Y
Subjt:  ATPTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRY

Query:  CLRLL
        C  LL
Subjt:  CLRLL

A0A5A7U0H7 Retrotransposon protein9.4e-3634.75Show/hide
Query:  MAGNSKRTKHIWSKVEDAKLVEALLYL-------------------------------------------VRSLKKQYNAVSEMLGQ--SGFDWNEEFKC
        MA  S+  KH W+K E+ K VE L+ L                                           V+SLKK Y+A++EM G   SGF WNEEF+C
Subjt:  MAGNSKRTKHIWSKVEDAKLVEALLYL-------------------------------------------VRSLKKQYNAVSEMLGQ--SGFDWNEEFKC

Query:  AQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNS-PLNLDGMDEETAEQSTGR
           ER++FD W++SHP+AKG+ +KSFP+YDDLS +FGKDRA G  SE    + SN    F D I LG    + +    +    ++ D M    A Q++ R
Subjt:  AQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNS-PLNLDGMDEETAEQSTGR

Query:  ATPTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRY
              S  SKRKR S + E ++++RS +E  N  +  +A W KEK  +E   + +VV  + +I  L   D+  L+ IL   ++  + FL +P   +  Y
Subjt:  ATPTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRY

Query:  CLRLL
        C  LL
Subjt:  CLRLL

A0A5A7UME4 Retrotransposon protein3.8e-2932.57Show/hide
Query:  MAGNSKRTKHIWSKVEDAKLVEALLYLVRS------------------------------------------LKKQYNAVSEMLGQ--SGFDWNEEFKCA
        M  +S+  KH W+K E+A LVE L+ LV +                                          +K+ ++A++EM G   SGF WN+E KC 
Subjt:  MAGNSKRTKHIWSKVEDAKLVEALLYLVRS------------------------------------------LKKQYNAVSEMLGQ--SGFDWNEEFKCA

Query:  QVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDG-MDEETAEQSTGRA
          E+E+FD W  SHP+AKG+ NKSF HYD+LS +FGKDRA G  +E    + SN    +  E    M D     +      ++ D  M+  TA  S  R 
Subjt:  QVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDG-MDEETAEQSTGRA

Query:  TPTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRYC
             S GSKRKRP    +  DI+R+ +E  N  + R+A W   + +     ++E+V  +  I +LT  D+  L+ IL+ ++     FL VP+H +  YC
Subjt:  TPTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRYC

Query:  LRLLGRN
          +L  N
Subjt:  LRLLGRN

A0A5D3C7T4 Uncharacterized protein5.4e-3131.67Show/hide
Query:  LIYSIFIEQAYAYSIMAGNSKRTKHIWSKVEDAKLVEALLYLVR-------------SLKKQYNAVSEMLGQ--SGFDWNEEFKCAQVEREIFDLWVRSH
        L+  +  + +    + + NSK TKH W+ +ED  LVE LL LV                 KQY A++EM+G   SGF WNE  KC +VE+ +FD WV+ H
Subjt:  LIYSIFIEQAYAYSIMAGNSKRTKHIWSKVEDAKLVEALLYLVR-------------SLKKQYNAVSEMLGQ--SGFDWNEEFKCAQVEREIFDLWVRSH

Query:  PSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREF-KDEIRLGMQDCQTLEVRQTNSPLNLDGMDEETAEQSTGRATPTESSQGSKR---
        P+A+G+ NK FP++ DL  +FG+DRA G   + P  M+S   R+  +D++ + ++D             N  G++  + E      T      GS R   
Subjt:  PSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREF-KDEIRLGMQDCQTLEVRQTNSPLNLDGMDEETAEQSTGRATPTESSQGSKR---

Query:  KRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVP
        KR S+  +L+D  R+++   +  +G++A WQ+EK ++E +  K +   +  I  +  DD + + + L+ D      FL  P
Subjt:  KRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVP

A0A6J1DW73 uncharacterized protein LOC1110250183.2e-3638.71Show/hide
Query:  GFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDGMDE
        GF WN++ KC + E+E+FD WV+SHP+AKG+ NK  PHYDDL+  FGKDRA G + + P  MAS+A     ++     QD    +    N+    D ++E
Subjt:  GFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDGMDE

Query:  ETAEQSTGRATPTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLV
        +     T + T   SS GSKRKR  + +E++D++R+ + MQ  H+ ++A W  +K + + AR+K V + +  I +L  +D V L+ IL+T+++K+  FL 
Subjt:  ETAEQSTGRATPTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKLEFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLV

Query:  VPEHPRKRYCLRLLGRN
        VP   +K +C++LLG++
Subjt:  VPEHPRKRYCLRLLGRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24960.1 unknown protein3.5e-0630Show/hide
Query:  LLYLVRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSED
        L +    L K Y  +  +L + GF W+E       +  ++D +++ HP A+    KS P Y+DL TIF      G    D
Subjt:  LLYLVRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQSSED

AT2G24960.2 unknown protein7.0e-0729.58Show/hide
Query:  RSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQ
        + L++ YN +  +L Q+GF W+        + +I++ ++++HP A+    K+ P Y +L  IFGK+ + G+
Subjt:  RSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGKDRAIGQ

AT4G02210.1 unknown protein3.5e-0624.53Show/hide
Query:  IYSIFIEQAYAYSIMAGNSKRTKHIWSKVEDAKLVEALLYLVRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDD
        I  +F +QA+   +   N        +K E    V+ L    +SL++Q+NA+  +L   GF W+ E +    +  ++  ++++H  A+    +  P+Y D
Subjt:  IYSIFIEQAYAYSIMAGNSKRTKHIWSKVEDAKLVEALLYLVRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDD

Query:  LSTIFG
        L  + G
Subjt:  LSTIFG

AT4G02210.2 unknown protein3.5e-0624.53Show/hide
Query:  IYSIFIEQAYAYSIMAGNSKRTKHIWSKVEDAKLVEALLYLVRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDD
        I  +F +QA+   +   N        +K E    V+ L    +SL++Q+NA+  +L   GF W+ E +    +  ++  ++++H  A+    +  P+Y D
Subjt:  IYSIFIEQAYAYSIMAGNSKRTKHIWSKVEDAKLVEALLYLVRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDD

Query:  LSTIFG
        L  + G
Subjt:  LSTIFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTTAATTTATAGCATTTTCATTGAACAAGCGTATGCATACAGTATAATGGCAGGTAATAGTAAGAGGACGAAGCACATATGGTCTAAGGTGGAGGACGCTAAGTT
GGTGGAAGCCCTACTTTATTTGGTGAGAAGTCTGAAGAAACAATACAATGCAGTATCAGAGATGTTAGGTCAGTCAGGATTCGACTGGAACGAAGAGTTCAAATGTGCCC
AAGTCGAGAGGGAGATTTTCGATCTTTGGGTTCGGAGTCATCCTAGTGCAAAGGGGATGTGGAACAAGTCATTCCCCCATTACGATGACCTCTCCACCATCTTTGGGAAG
GATAGAGCTATAGGGCAATCAAGTGAGGACCCATACATGATGGCGAGTAATGCATTTAGAGAGTTTAAAGATGAGATTCGACTTGGAATGCAGGACTGTCAGACACTTGA
GGTTCGCCAAACAAATTCACCATTAAATCTGGATGGAATGGATGAAGAGACAGCAGAGCAATCTACAGGTAGAGCGACACCTACCGAGTCATCTCAAGGAAGCAAGAGAA
AGAGGCCATCATTCCAAGCTGAATTGATCGACATCATGAGATCAACTGTTGAGATGCAGAACACGCACATGGGTAGACTAGCATTGTGGCAGAAGGAGAAGTATAAGCTG
GAGTTCGCTCGTCAGAAGGAAGTAGTAAATGTCATATACAACATTGACGACTTGACTGAGGATGACCAGGTGACCCTTATTGACATACTTGTCACAGACATTCAGAAGAC
AGATTGCTTCCTTGTAGTACCAGAACACCCAAGGAAAAGGTATTGTCTTCGTCTACTAGGAAGAAACATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATTTAATTTATAGCATTTTCATTGAACAAGCGTATGCATACAGTATAATGGCAGGTAATAGTAAGAGGACGAAGCACATATGGTCTAAGGTGGAGGACGCTAAGTT
GGTGGAAGCCCTACTTTATTTGGTGAGAAGTCTGAAGAAACAATACAATGCAGTATCAGAGATGTTAGGTCAGTCAGGATTCGACTGGAACGAAGAGTTCAAATGTGCCC
AAGTCGAGAGGGAGATTTTCGATCTTTGGGTTCGGAGTCATCCTAGTGCAAAGGGGATGTGGAACAAGTCATTCCCCCATTACGATGACCTCTCCACCATCTTTGGGAAG
GATAGAGCTATAGGGCAATCAAGTGAGGACCCATACATGATGGCGAGTAATGCATTTAGAGAGTTTAAAGATGAGATTCGACTTGGAATGCAGGACTGTCAGACACTTGA
GGTTCGCCAAACAAATTCACCATTAAATCTGGATGGAATGGATGAAGAGACAGCAGAGCAATCTACAGGTAGAGCGACACCTACCGAGTCATCTCAAGGAAGCAAGAGAA
AGAGGCCATCATTCCAAGCTGAATTGATCGACATCATGAGATCAACTGTTGAGATGCAGAACACGCACATGGGTAGACTAGCATTGTGGCAGAAGGAGAAGTATAAGCTG
GAGTTCGCTCGTCAGAAGGAAGTAGTAAATGTCATATACAACATTGACGACTTGACTGAGGATGACCAGGTGACCCTTATTGACATACTTGTCACAGACATTCAGAAGAC
AGATTGCTTCCTTGTAGTACCAGAACACCCAAGGAAAAGGTATTGTCTTCGTCTACTAGGAAGAAACATGTAG
Protein sequenceShow/hide protein sequence
MYLIYSIFIEQAYAYSIMAGNSKRTKHIWSKVEDAKLVEALLYLVRSLKKQYNAVSEMLGQSGFDWNEEFKCAQVEREIFDLWVRSHPSAKGMWNKSFPHYDDLSTIFGK
DRAIGQSSEDPYMMASNAFREFKDEIRLGMQDCQTLEVRQTNSPLNLDGMDEETAEQSTGRATPTESSQGSKRKRPSFQAELIDIMRSTVEMQNTHMGRLALWQKEKYKL
EFARQKEVVNVIYNIDDLTEDDQVTLIDILVTDIQKTDCFLVVPEHPRKRYCLRLLGRNM