; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022916 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022916
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr7:40793129..40796365
RNA-Seq ExpressionLag0022916
SyntenyLag0022916
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.9e-4038.38Show/hide
Query:  IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E+QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRK----------------------NDPKKLQPKRKRSKKF----------SQPQQL
        LK+LI KLA+E KIELD+DEVAQ+N   +   S          QRK                      N   K +P     +++            P+++
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRK----------------------NDPKKLQPKRKRSKKF----------SQPQQL

Query:  DLRLRSHQAS------NYSSFSIPKNEYGHDRRRKSMFGV----------------------HLHSTFSFP-----KAKCLHIEEK---STFDICFDRLK
              H  S      NY S+    N     ++R S+F                        +   TF++      K   + I +K   ST+   FDRLK
Subjt:  DLRLRSHQAS------NYSSFSIPKNEYGHDRRRKSMFGV----------------------HLHSTFSFP-----KAKCLHIEEK---STFDICFDRLK

Query:  VTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKLSVLINTR-FLEVRRFFVV
        +T+DQ +R+M  L+ K F E N+D K+ S +PSRMKRKLSV INT   L V+  F++
Subjt:  VTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKLSVLINTR-FLEVRRFFVV

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-4344.27Show/hide
Query:  IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRK--------RSKKFSQPQQLDLRLRSHQAS------NYSSFSIPKNEYGHDR
        LK+LILKLA+E KIELD+DEVAQ+N A I+  S   +  D   LQ +R         RS     P+++      H AS      NY S S   N      
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRK--------RSKKFSQPQQLDLRLRSHQAS------NYSSFSIPKNEYGHDR

Query:  RRKSMFGVHLHSTFSFPKAKCLHIEEKSTFDIC----------------------------FDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSR
        +R S+F     ST      + L +  K   + C                            FDRLK+T+DQ +R+M + + K F E N+D K+ S +PSR
Subjt:  RRKSMFGVHLHSTFSFPKAKCLHIEEKSTFDIC----------------------------FDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSR

Query:  MKRKLSVLINTR-FLEVRRFFVV
        MKRKL V INT   L V+  F++
Subjt:  MKRKLSVLINTR-FLEVRRFFVV

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]1.1e-3665.91Show/hide
Query:  EETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVER
        + T +ESMVVNTT P   SKGK     ++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE+QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+
Subjt:  EETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVER

Query:  CFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        CFVLK+LIL+LA+E +IELDL+EVAQ+N A +
Subjt:  CFVLKDLILKLAKEGKIELDLDEVAQSNLATI

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]4.8e-3765.91Show/hide
Query:  EETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVER
        + T++ESMVVNTT P   SKGK     ++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE+QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+
Subjt:  EETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVER

Query:  CFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        CFVLK+LIL+LA+E +IELDL+EVAQ+N A +
Subjt:  CFVLKDLILKLAKEGKIELDLDEVAQSNLATI

XP_031742390.1 uncharacterized protein LOC116401672 [Cucumis sativus]1.1e-3665.91Show/hide
Query:  EETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVER
        + T +ESMVVNTT P   SKGK     ++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE+QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+
Subjt:  EETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVER

Query:  CFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        CFVLK+LIL+LA+E +IELDL+EVAQ+N A +
Subjt:  CFVLKDLILKLAKEGKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein3.8e-4038.38Show/hide
Query:  IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E+QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFV
Subjt:  IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRK----------------------NDPKKLQPKRKRSKKF----------SQPQQL
        LK+LI KLA+E KIELD+DEVAQ+N   +   S          QRK                      N   K +P     +++            P+++
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKSK--------HQRK----------------------NDPKKLQPKRKRSKKF----------SQPQQL

Query:  DLRLRSHQAS------NYSSFSIPKNEYGHDRRRKSMFGV----------------------HLHSTFSFP-----KAKCLHIEEK---STFDICFDRLK
              H  S      NY S+    N     ++R S+F                        +   TF++      K   + I +K   ST+   FDRLK
Subjt:  DLRLRSHQAS------NYSSFSIPKNEYGHDRRRKSMFGV----------------------HLHSTFSFP-----KAKCLHIEEK---STFDICFDRLK

Query:  VTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKLSVLINTR-FLEVRRFFVV
        +T+DQ +R+M  L+ K F E N+D K+ S +PSRMKRKLSV INT   L V+  F++
Subjt:  VTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKLSVLINTR-FLEVRRFFVV

A0A5A7ULK6 Retrotransposon gag protein1.5e-3662.86Show/hide
Query:  RKEGRNNEETI----EESMVVNTTLPKSS----SKGKRQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKY
        +KE ++ E+ +    +ESMVVNTT  K S     + K++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE+QLI+LP+CKRPE+  KVDDP YCKY
Subjt:  RKEGRNNEETI----EESMVVNTTLPKSS----SKGKRQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKY

Query:  HRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLA
        HRVI HPVE+CFVLK+LIL+LA+E KIELDL+EVAQ+N A
Subjt:  HRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLA

A0A5A7URH1 Ty3-gypsy retrotransposon protein5.7e-4444.27Show/hide
Query:  IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRK--------RSKKFSQPQQLDLRLRSHQAS------NYSSFSIPKNEYGHDR
        LK+LILKLA+E KIELD+DEVAQ+N A I+  S   +  D   LQ +R         RS     P+++      H AS      NY S S   N      
Subjt:  LKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRK--------RSKKFSQPQQLDLRLRSHQAS------NYSSFSIPKNEYGHDR

Query:  RRKSMFGVHLHSTFSFPKAKCLHIEEKSTFDIC----------------------------FDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSR
        +R S+F     ST      + L +  K   + C                            FDRLK+T+DQ +R+M + + K F E N+D K+ S +PSR
Subjt:  RRKSMFGVHLHSTFSFPKAKCLHIEEKSTFDIC----------------------------FDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSR

Query:  MKRKLSVLINTR-FLEVRRFFVV
        MKRKL V INT   L V+  F++
Subjt:  MKRKLSVLINTR-FLEVRRFFVV

A0A5A7V7A0 Retrotransposon gag protein8.9e-3736.21Show/hide
Query:  IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGKSK---------------HQRKNDPKKLQ-
        LK+LILKLA+E KIELD+DEVAQ+N                                       + TI  ++K                Q+   P  +Q 
Subjt:  LKDLILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGKSK---------------HQRKNDPKKLQ-

Query:  ----------------PKRKRSKKFSQPQQ--------LDLRL------------------------RSHQAS------NYSSFSIPKNEYGHDRRRKSM
                         K +R+KK   P+         L LRL                          H AS      NY S S   N      +R S+
Subjt:  ----------------PKRKRSKKFSQPQQ--------LDLRL------------------------RSHQAS------NYSSFSIPKNEYGHDRRRKSM

Query:  FGVHLHSTFSFPKAKCLHIEEKSTFDIC----------------------------FDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKL
        F     ST      + L +  K   + C                            FDRLK+T DQ +R+M +L+ K F E N+D K+ S +PSRMKRKL
Subjt:  FGVHLHSTFSFPKAKCLHIEEKSTFDIC----------------------------FDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKL

Query:  SVLINT
        SV INT
Subjt:  SVLINT

A0A5D3C2C8 Retrotransposon gag protein5.2e-3735.96Show/hide
Query:  IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV
        I+ESMVV+ T  KS SK K     R+ +G      TLKERQKK+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFV
Subjt:  IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFV

Query:  LKDLILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGKSK---------------HQRKNDPKKLQ-
        LK+LILKLA+E KIELD+DEVAQ+N                                       + TI  ++K                Q+   P  +Q 
Subjt:  LKDLILKLAKEGKIELDLDEVAQSN---------------------------------------LATIKGKSK---------------HQRKNDPKKLQ-

Query:  ----------------PKRKRSKKFSQPQQLD------LRLR--------------------------SHQAS------NYSSFSIPKNEYGHDRRRKSM
                         K +R+KK   P+ +       L+LR                           H AS      NY S S   N      +R S+
Subjt:  ----------------PKRKRSKKFSQPQQLD------LRLR--------------------------SHQAS------NYSSFSIPKNEYGHDRRRKSM

Query:  FGVHLHSTFSFPKAKCLHIEEKSTFDIC----------------------------FDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKL
        F     ST      + L +  K   + C                            FDRLK+ +DQ +R+M +L+ K F E N+D K+ S +PSRMKRKL
Subjt:  FGVHLHSTFSFPKAKCLHIEEKSTFDIC----------------------------FDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKL

Query:  SVLINT
        SV INT
Subjt:  SVLINT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAAGAAGGAAGGAACAACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCA
TCATTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGAGCAACTGATAGAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTA
AAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAGAATGATCCTAAGAA
ACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGA
ATGAGTATGGCCACGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTC
GACATCTGTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAACGATAAGAAGCTTCAAAG
TAGCATCCCGTCACGTATGAAGAGGAAGTTATCTGTTCTCATAAATACAAGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCATTGTTCCTTCTCCAAGTT
CGAGGGTTCTTAGTTGTACAACTACTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTTCCTCTTCTCTCAAATTCGATGGTTCTCA
CGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCTTTCTCCCCAAGTTCGAA
GGTTCACACACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCATTCT
TCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCAGCAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACG
GGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCC
AAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCATGTTGAAGGTTCTCACGCGCTTCGTTGCGGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTC
GCTGCAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAAG
TTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCTCCAAATTCGAAGGTTCTCACATCGCTTCCC
TGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCCTCCCCAAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGA
AGGTTCTCACGCGTTTCGTTGCATTTCCTTCCTCCAAGTCTGAAGGTTCACGCACTTTGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACACATTTCGCTGCAGTT
CCTTCCCCCGAGTTCGAAGGTTCTAACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCCATCCTCCAAGTTCGAAGGTTCT
CACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCAAAAGTTCTCACGCGCTTCGCTGCAGTTCCTTCC
CCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTTCCTTCTCCAAGTTCGAAGGTCCTCATGCTACGCTGGGCTACACTGCTGCGCTACTTCCTAAAGTCCAAAGA
CGTCAATTGTCCTCACGTTGCGCTGCTTCCTTCTCCAAGTTCGAAGGTCCTCATGCTACGCTCGGCTACACTGCTGCGCTACTTCCTAAAGTCCAAAGACGCGTGGCGGC
GACACAAGTCCAAGGACATGTCCCAAAGCAAGGAACATGTCCCTGTACTCATGCTGAAAGACGTGGCGGCAGCACAAGTCCAAGGACATGTCCCAAAGCAGGGAATATGC
CCCTACACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAAGAAGGAAGGAACAACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCA
TCATTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGAGCAACTGATAGAGCTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTA
AAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAGAATGATCCTAAGAA
ACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGA
ATGAGTATGGCCACGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTC
GACATCTGTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAACGATAAGAAGCTTCAAAG
TAGCATCCCGTCACGTATGAAGAGGAAGTTATCTGTTCTCATAAATACAAGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCATTGTTCCTTCTCCAAGTT
CGAGGGTTCTTAGTTGTACAACTACTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTTCCTCTTCTCTCAAATTCGATGGTTCTCA
CGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCTTTCTCCCCAAGTTCGAA
GGTTCACACACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCATTCT
TCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCAGCAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACG
GGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCC
AAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCATGTTGAAGGTTCTCACGCGCTTCGTTGCGGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTC
GCTGCAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAAG
TTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCTCCAAATTCGAAGGTTCTCACATCGCTTCCC
TGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCCTCCCCAAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGA
AGGTTCTCACGCGTTTCGTTGCATTTCCTTCCTCCAAGTCTGAAGGTTCACGCACTTTGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACACATTTCGCTGCAGTT
CCTTCCCCCGAGTTCGAAGGTTCTAACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCCATCCTCCAAGTTCGAAGGTTCT
CACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCAAAAGTTCTCACGCGCTTCGCTGCAGTTCCTTCC
CCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTTCCTTCTCCAAGTTCGAAGGTCCTCATGCTACGCTGGGCTACACTGCTGCGCTACTTCCTAAAGTCCAAAGA
CGTCAATTGTCCTCACGTTGCGCTGCTTCCTTCTCCAAGTTCGAAGGTCCTCATGCTACGCTCGGCTACACTGCTGCGCTACTTCCTAAAGTCCAAAGACGCGTGGCGGC
GACACAAGTCCAAGGACATGTCCCAAAGCAAGGAACATGTCCCTGTACTCATGCTGAAAGACGTGGCGGCAGCACAAGTCCAAGGACATGTCCCAAAGCAGGGAATATGC
CCCTACACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MRKEGRNNEETIEESMVVNTTLPKSSSKGKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEEQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLIL
KLAKEGKIELDLDEVAQSNLATIKGKSKHQRKNDPKKLQPKRKRSKKFSQPQQLDLRLRSHQASNYSSFSIPKNEYGHDRRRKSMFGVHLHSTFSFPKAKCLHIEEKSTF
DICFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKLSVLINTRFLEVRRFFVVSCCIVPSPSSRVLSCTTTTLFLLQVRRILCGALLHCFLFSQIRWFS
RSFAGVSSPQVRRFSRAPLQFLLSKVEGSHSFSPSSKVHTLRCSSFSQIRRFSRASLQFLPPSSKVLRCCSSFFQVRRFSRRFAAVPSSKFEGSHVLRSSSFSPRSKVLT
GCVAVLSPQVRRFTHFAAVPSPKFEGSHALRSAIPSPSSKVLTRFAAVPSSMLKVLTRFVAVPSSQFEGSHALRCSSFPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPK
FEGSHALRAVPSSKFEGSHALRCISFLQIRRFSHRFPAILPPSSKVLTRFALQFPPQSSKVLTRFVQFLPPNSKVLTRFVAFPSSKSEGSRTLLQFLLPNSKVLTHFAAV
PSPEFEGSNALRCSSFLQVQRFSRASLQFHPPSSKVLTRFAAVPSPKLKVLTRFVAVPSSKFKSSHALRCSSFPPSSKVLTHFAAVSFSKFEGPHATLGYTAALLPKVQR
RQLSSRCAASFSKFEGPHATLGYTAALLPKVQRRVAATQVQGHVPKQGTCPCTHAERRGGSTSPRTCPKAGNMPLHSC