; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028641 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028641
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr8:27381485..27386602
RNA-Seq ExpressionLag0028641
SyntenyLag0028641
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035697.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.2e-3347.71Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSK--------HQRKKDPKKLQPMR--------------KRSKKFS
        KVDD  YCKYH VI HPVE+CFVLK+LILKLA E+KI+LD+DE+AQ+N V ++  S          QR+K+  ++                  K     +
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSK--------HQRKKDPKKLQPMR--------------KRSKKFS

Query:  QPQQLDLRLRSHQASNYSSLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSI
        + +Q        + S   S  FQR+SMA  EEENQ    TST+ SAF+RL +S SKK RPSTS FDRLK+T+ Q +++M  L+ K F E N+D K+ S +
Subjt:  QPQQLDLRLRSHQASNYSSLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSI

Query:  PSRMKRKFSVLINTEGSL
         SRMKRK SV INTEGSL
Subjt:  PSRMKRKFSVLINTEGSL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.6e-3447.68Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSKHQRKKDPKKLQPMRK--------RSKKFSQPQQLDLRLRSHQA
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N   I+  S   + KD   LQ  R         RS     P+++      H A
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSKHQRKKDPKKLQPMRK--------RSKKFSQPQQLDLRLRSHQA

Query:  S------NYS---------------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDN
        S      NY                            S  FQR+S+A  EEENQC     TR S  +RLS+ST KK RPSTS FDRLK+T+ Q +R+M +
Subjt:  S------NYS---------------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDN

Query:  LEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
         + K F E N D K+ S +PSRMKRK  V INTEGSL
Subjt:  LEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

KAA0065608.1 retrotransposon gag protein [Cucumis melo var. makuwa]7.3e-3446.64Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSK--------HQRKKDPKKLQPMRKRSKKFSQPQQLDLRLRSHQA
        KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N   +   S          Q   +    + +   +   +   ++D    S++ 
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSK--------HQRKKDPKKLQPMRKRSKKFSQPQQLDLRLRSHQA

Query:  SNYS-------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKK
         + S                   S  FQR+SMA  EEE QC  ST TR S F+RLS+STSKK RPSTS FDRLK+T+ Q +++M +L+ K F E N D K
Subjt:  SNYS-------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKK

Query:  LQSSIPSRMKRKFSVLINTEGSL
        + S +PSR KRK SV INTEGSL
Subjt:  LQSSIPSRMKRKFSVLINTEGSL

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.1e-3246.84Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSKHQRKKDPKKLQPMRK--------RSKKFSQPQQLDLRLRSHQA
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E KI+LD+DE  +               KD   LQP R         RS     P+++      H  
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSKHQRKKDPKKLQPMRK--------RSKKFSQPQQLDLRLRSHQA

Query:  S------NYS---------------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDN
        S      NY                            SL FQR+SMA  EEENQC  ST TR SAF+RLS+STSKK RPSTS FDRLK+ + Q +R+M +
Subjt:  S------NYS---------------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDN

Query:  LEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
        L+ K F E N D K+ S IPS   RK SV IN EGSL
Subjt:  LEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

TYK15207.1 Retrotransposon gag protein [Cucumis melo var. makuwa]1.1e-3246.19Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIK-----EKSKHQRKKDPKKLQPMR--------KRSKKFSQPQQLD---
        KVDDP YCKYHRVI HPVE+CF+LK++ILKLA E+KIELD+ EVAQ+N V ++       S H      +++  +         +    ++  +++D   
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIK-----EKSKHQRKKDPKKLQPMR--------KRSKKFSQPQQLD---

Query:  -LRLRSH-----QASNYSSLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSI
         ++ R++     + S   S  FQR SMA  EE+NQC MSTST+ SAF+RLS+STSK+ RP TS FDRLK+T+ Q +R+M  L+ K F E N+D K+ + +
Subjt:  -LRLRSH-----QASNYSSLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSI

Query:  PSRMKRKFSVLINTE-GSLKFLL
        PS MKRK  V INTE GS+   +
Subjt:  PSRMKRKFSVLINTE-GSLKFLL

TrEMBL top hitse value%identityAlignment
A0A5A7URH1 Ty3-gypsy retrotransposon protein2.7e-3447.68Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSKHQRKKDPKKLQPMRK--------RSKKFSQPQQLDLRLRSHQA
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N   I+  S   + KD   LQ  R         RS     P+++      H A
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSKHQRKKDPKKLQPMRK--------RSKKFSQPQQLDLRLRSHQA

Query:  S------NYS---------------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDN
        S      NY                            S  FQR+S+A  EEENQC     TR S  +RLS+ST KK RPSTS FDRLK+T+ Q +R+M +
Subjt:  S------NYS---------------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDN

Query:  LEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
         + K F E N D K+ S +PSRMKRK  V INTEGSL
Subjt:  LEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

A0A5A7VFA5 Ty3-gypsy retrotransposon protein5.1e-3346.84Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSKHQRKKDPKKLQPMRK--------RSKKFSQPQQLDLRLRSHQA
        KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E KI+LD+DE  +               KD   LQP R         RS     P+++      H  
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSKHQRKKDPKKLQPMRK--------RSKKFSQPQQLDLRLRSHQA

Query:  S------NYS---------------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDN
        S      NY                            SL FQR+SMA  EEENQC  ST TR SAF+RLS+STSKK RPSTS FDRLK+ + Q +R+M +
Subjt:  S------NYS---------------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDN

Query:  LEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
        L+ K F E N D K+ S IPS   RK SV IN EGSL
Subjt:  LEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

A0A5D3CA53 Retrotransposon gag protein3.5e-3446.64Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSK--------HQRKKDPKKLQPMRKRSKKFSQPQQLDLRLRSHQA
        KVDDP YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DEVAQ+N   +   S          Q   +    + +   +   +   ++D    S++ 
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSK--------HQRKKDPKKLQPMRKRSKKFSQPQQLDLRLRSHQA

Query:  SNYS-------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKK
         + S                   S  FQR+SMA  EEE QC  ST TR S F+RLS+STSKK RPSTS FDRLK+T+ Q +++M +L+ K F E N D K
Subjt:  SNYS-------------------SLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKK

Query:  LQSSIPSRMKRKFSVLINTEGSL
        + S +PSR KRK SV INTEGSL
Subjt:  LQSSIPSRMKRKFSVLINTEGSL

A0A5D3CTF5 Retrotransposon gag protein5.1e-3346.19Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIK-----EKSKHQRKKDPKKLQPMR--------KRSKKFSQPQQLD---
        KVDDP YCKYHRVI HPVE+CF+LK++ILKLA E+KIELD+ EVAQ+N V ++       S H      +++  +         +    ++  +++D   
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIK-----EKSKHQRKKDPKKLQPMR--------KRSKKFSQPQQLD---

Query:  -LRLRSH-----QASNYSSLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSI
         ++ R++     + S   S  FQR SMA  EE+NQC MSTST+ SAF+RLS+STSK+ RP TS FDRLK+T+ Q +R+M  L+ K F E N+D K+ + +
Subjt:  -LRLRSH-----QASNYSSLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSI

Query:  PSRMKRKFSVLINTE-GSLKFLL
        PS MKRK  V INTE GS+   +
Subjt:  PSRMKRKFSVLINTE-GSLKFLL

A0A5D3E4T1 Retrotransposon gag protein6.0e-3447.71Show/hide
Query:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSK--------HQRKKDPKKLQPMR--------------KRSKKFS
        KVDD  YCKYH VI HPVE+CFVLK+LILKLA E+KI+LD+DE+AQ+N V ++  S          QR+K+  ++                  K     +
Subjt:  KVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSK--------HQRKKDPKKLQPMR--------------KRSKKFS

Query:  QPQQLDLRLRSHQASNYSSLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSI
        + +Q        + S   S  FQR+SMA  EEENQ    TST+ SAF+RL +S SKK RPSTS FDRLK+T+ Q +++M  L+ K F E N+D K+ S +
Subjt:  QPQQLDLRLRSHQASNYSSLAFQRMSMAAIEEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSI

Query:  PSRMKRKFSVLINTEGSL
         SRMKRK SV INTEGSL
Subjt:  PSRMKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAG
AAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGTTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCATGAGGA
AGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTGGCATTCCAAAGAATGAGTATGGCCGCGATA
GAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCTACATCGAAGAAAACTCGACCTTCAACATCTGTTTTTGATCG
CCTCAAAGTAACAAGCGGTCAATCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTTGATGAAGTAAACAGCGATAAAAAGCTTCAAAGTAGCATCCCGTCACGTA
TGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGT
TCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTG
GTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTC
CTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTC
TCGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCG
CTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGTTCTTTCCTCCAAGTTCGAAGGTTCTCATGCGCTT
CGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAG
GCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCATTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTC
CAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTACTCCT
TCTCCAAGTTCGAAGGTGTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCT
CCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGC
GGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGATTGTGGTGAAATCACTGCAAGTGAAAAGCTGA
TGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGACGACGACCGTGGTG
ACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACTATGGTGACCACCCCTGCAGGAAACTA
CAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGCAGCTGATGACGATCGTGGTGACCACCCCTGCAGGAAACTACAGTCATGAAAGTGACTGGT
CTAGACAGAGGAGGCCACCATTCATTCATGGGGGATTCAGATTCGGAGATTCAGATCAGAGAACTCAGAGTCCAGAGAATTCTGCCAGAGTCCAAAGTCATCAGAAGTCA
GAGAGTCTAGAGAATTCAGAAGATCCAAGATTCAGAATTCAACCAACTCAAGACTCAGAAGATCATCAAGCCAATCGACCAATCAAGAAGATCAACAAGTCAGTAGGTCG
ATCATCCAAGAGGATCAACAAGCTAACAAGTCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGT
CAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGTCGATCATCCAAGAAGATCAACAAGCCAACCGACCGATCAAG
AAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAA
CCGATCGAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAG
AAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGTTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCATGAGGA
AGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTGGCATTCCAAAGAATGAGTATGGCCGCGATA
GAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCTACATCGAAGAAAACTCGACCTTCAACATCTGTTTTTGATCG
CCTCAAAGTAACAAGCGGTCAATCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTTGATGAAGTAAACAGCGATAAAAAGCTTCAAAGTAGCATCCCGTCACGTA
TGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGT
TCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTG
GTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTC
CTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTC
TCGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCG
CTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGTTCTTTCCTCCAAGTTCGAAGGTTCTCATGCGCTT
CGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAG
GCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCATTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTC
CAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTACTCCT
TCTCCAAGTTCGAAGGTGTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCT
CCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGC
GGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGATTGTGGTGAAATCACTGCAAGTGAAAAGCTGA
TGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGACGACGACCGTGGTG
ACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACTATGGTGACCACCCCTGCAGGAAACTA
CAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGCAGCTGATGACGATCGTGGTGACCACCCCTGCAGGAAACTACAGTCATGAAAGTGACTGGT
CTAGACAGAGGAGGCCACCATTCATTCATGGGGGATTCAGATTCGGAGATTCAGATCAGAGAACTCAGAGTCCAGAGAATTCTGCCAGAGTCCAAAGTCATCAGAAGTCA
GAGAGTCTAGAGAATTCAGAAGATCCAAGATTCAGAATTCAACCAACTCAAGACTCAGAAGATCATCAAGCCAATCGACCAATCAAGAAGATCAACAAGTCAGTAGGTCG
ATCATCCAAGAGGATCAACAAGCTAACAAGTCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGT
CAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGTCGATCATCCAAGAAGATCAACAAGCCAACCGACCGATCAAG
AAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAA
CCGATCGAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAATAG
Protein sequenceShow/hide protein sequence
MEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLVTIKEKSKHQRKKDPKKLQPMRKRSKKFSQPQQLDLRLRSHQASNYSSLAFQRMSMAAI
EEENQCSMSTSTRPSAFQRLSVSTSKKTRPSTSVFDRLKVTSGQSKRKMDNLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSLKFLLSKFEGPYTVRYCVVPSPS
SKVLRCILLRCSFSKFEGSQLYDCYVVPPPSVKDLMWCVVALFPLLSSSMVLTQLCWSFFSPSSKVLTRSVAVPSFQGRRFSLAALQFFLPKFEGSRTSLQFLLPNSKVL
SRFAAVPSPKFEGSHVASLQFLPPSLKVLTSLRFALRFVAVPSSKFEGSHSLRCSSFLQVRSSFLQVRRFSCASLLLPPSSKVLHALLQLLPPSSKVPSRASLAPSPSSK
APLSTVPSSKFEGSHALHCYLPPSSKVLSRAAAAPSSKFEGSLTRFARSFSKFEGASLHCSFSKFEGASLHYSFSKFEGVLSAAPSPSSKALLSTAPSPSSKALLSTAPS
PSSKVLLSTPLFEGSPLRFSFSKFEGSPLLLFKCLAAVDVLVPLHLQMLAVDGVRFASSSKIDCGEITASEKLMTTVVTTPAGNYSHQSDWSRQTGGEITASEKLTTTVV
TTPAGNYSHQSDWSRQVVKSLQVKLMTTMVTTPAGNYSHQSDWSRQVVKSLQVQLMTIVVTTPAGNYSHESDWSRQRRPPFIHGGFRFGDSDQRTQSPENSARVQSHQKS
ESLENSEDPRFRIQPTQDSEDHQANRPIKKINKSVGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRSIIQEDQQANRPIK
KINKSAGRSSKRSTSQPTDQEDQQVSRPIIQEDQQANRSNRSSSQQADPRDHQVSRPIIQEDQQANKPIQ