; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015395 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015395
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr12:12141268..12150871
RNA-Seq ExpressionLag0015395
SyntenyLag0015395
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]5.9e-7988.37Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTSGSVFTLNGGA VW SIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEV+ NM+LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE  SHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]5.9e-7988.37Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTSGSVFTLNGGA VW SIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEV+ NM+LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE  SHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]5.9e-7988.37Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTSGSVFTLNGGA VW SIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEV+ NM+LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE  SHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

KAA0059678.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-7887.21Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTS SVFTLNGGA VW SIKQGCIADSTMEAEYVAAC+ AKEAVWLRKFL DLEV+ N +LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE CSHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]6.5e-7887.79Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTSGSVFTLN GA VW SIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEV+ NM+LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE  SHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

TrEMBL top hitse value%identityAlignment
A0A5A7TKM4 Gag/pol protein2.8e-7988.37Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTSGSVFTLNGGA VW SIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEV+ NM+LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE  SHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

A0A5A7TZD0 Gag/pol protein2.8e-7988.37Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTSGSVFTLNGGA VW SIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEV+ NM+LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE  SHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

A0A5A7UWW4 Gag/pol protein6.3e-7987.21Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTS SVFTLNGGA VW SIKQGCIADSTMEAEYVAAC+ AKEAVWLRKFL DLEV+ N +LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE CSHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

A0A5A7UYE8 Gag/pol protein2.8e-7988.37Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTSGSVFTLNGGA VW SIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEV+ NM+LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE  SHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

A0A5A7V1F5 Gag/pol protein3.1e-7887.79Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        KYLRRTRDY+LVYG+KDL+L GYTDSDFQTDKDSRKSTSGSVFTLN GA VW SIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEV+ NM+LPIT
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        LYCDNSGAVANSKE  SHKRGKHIERKYHLI+EIV RGDVIVTKIASEHNI DPFT  LTAKVFE HLE LG
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-2035.03Show/hide
Query:  KYLRRTRDYVLVYGSKDLV----LIGYTDSDFQTDKDSRKSTSGSVFTL-NGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNM
        +YL+ T D  L++  K+L     +IGY DSD+   +  RKST+G +F + +     W + +Q  +A S+ EAEY+A  E  +EA+WL+  LT + +   +
Subjt:  KYLRRTRDYVLVYGSKDLV----LIGYTDSDFQTDKDSRKSTSGSVFTL-NGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNM

Query:  HLPITLYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
          PI +Y DN G ++ +     HKR KHI+ KYH  +E V    + +  I +E+ + D FT  L A  F    + LG
Subjt:  HLPITLYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

P0CV72 Secreted RxLR effector protein 1615.5e-1145.78Show/hide
Query:  KYLRRTRDYVLVYGSKDLV-LIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWL
        +YL+ T+ Y L +       L+GY+D+D+  D +SR+STSG +F LNGG   W S KQ  +A S+ E EY+A  E  +EAVWL
Subjt:  KYLRRTRDYVLVYGSKDLV-LIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.9e-2941.86Show/hide
Query:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        +YLR T    L +G  D +L GYTD+D   D D+RKS++G +FT +GGA  W S  Q C+A ST EAEY+AA E  KE +WL++FL +L +    ++   
Subjt:  KYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG
        +YCD+  A+  SK S  H R KHI+ +YH I+E+V    + V KI++  N  D  T  +    FE   E +G
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLEGLG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-1434.59Show/hide
Query:  YLRRTRDYVLVYGSK-DLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT
        Y++ T    L Y S+ ++ L  ++D+ FQ+ KD+R+ST+G    L      W S KQ  ++ S+ EAEY A      E +WL +F  +L++ L+   P  
Subjt:  YLRRTRDYVLVYGSK-DLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPIT

Query:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKE
        L+CDN+ A+  +  +  H+R KHIE   H ++E
Subjt:  LYCDNSGAVANSKESCSHKRGKHIERKYHLIKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAAAGATAAGGGTGCTCTGTTAGTTCCTTTTGATTTTGAAATTGAAAAGACTTGCAAAAAGAATAGGAAGGAGAAAAGGGAAAGACTTGTAGCAATGGCCAATCT
AAATCCACAAGAGGAGCTGAAACCGATACGGGACTATTTCCAGCCTACTTTTTCGGATCAACAATCTGGGATAGTCTACGTGCCAATAAATGTGAACAATTTCGAGCTCA
AAACTGGTCTCATCCAGATGGAGGCTAGAAATAGTCTGGATATTTGTGGAACTATGAAGATAAAGCGAGAGACTGGCTTCAATCACTTCCACCTGGAAGTATCACTACGT
GGGACGCGTACAGGAGGTGTACAATCTAATGAGACAGCTGTTCTTGCATCTCAGTCTCAAGAGGAGACCTCTAATGAACAGGTTCTTTATGTTGATAGAAATTCTAACTA
TAGGGGACACCATAACTCTACACCCACACATTACCACCCTAATGTTAGAAATCATGAAAATTTCTCGTATGCTAACAATAAGAATATCTTGAACCCTCCTGGATTTATAT
CTCAAAAGACTGAAAATAAACCTTCCCTTGAAGATATGGTTGGAGCTTTTATTGCAGAGTCCAGCAACAAAACCAACAAGCTGGAGGAAGCAGTGATAGCTATAAATACC
ACAGTAACAGGCCACAGTGCAACAATTAAAAACATAGAAACTCAATTGGGACAGTTGGTGAATGTTGTAAACACTATGCAGAAAGGTAAAACCCCAGCTGAGTCAGAGAA
AACCCAAATGGAGTATTGTAAGGCCATCACAGTGCATCATGTGGAGGAAGTTCAAGTAGCTGCGGCACAAAATATTAATGAACCAAAAGTCACTAAGGAAGAAGTTGAAG
AAGGTTCATCCTCAAACGAAGCTGAAAAGCTTAATTTTTACCCTCTTATACCCTCTCCTGCTGTTTTGGTTCCAAAGCTTAAGAAAAAGAAGAAAAAGAATTATTCAACT
CAATTGAAGAAGTTTCTTGATATCTTTATGAGTTTAAATATTAATTTACCTTTTGCAGAGGCTTTGGAGCAGATGCCCAAAAAGGTGCAAGCAAGAGCACCTCTTGTGAA
TCTCGTGACAGTAGACCTCCTTGAAGCATGCACGGAGCCCAACAAGTATCTAAGGAGAACAAGGGACTATGTGCTCGTGTATGGTTCCAAGGATTTGGTTCTTATAGGAT
ACACTGATTCTGACTTTCAAACTGATAAAGATTCGAGGAAATCTACATCGGGGTCAGTATTCACTCTGAATGGAGGAGCAACAGTATGGTGTAGCATCAAGCAAGGATGC
ATTGCAGACTCGACCATGGAGGCCGAATATGTAGCCGCTTGTGAAGTAGCAAAAGAGGCAGTATGGCTCAGAAAATTCTTGACTGACTTGGAAGTCATTCTGAATATGCA
TCTGCCTATCACCTTGTATTGTGATAATAGCGGCGCAGTTGCAAACTCAAAGGAATCTTGCAGCCATAAGAGAGGCAAGCACATCGAAAGGAAATATCATCTCATTAAGG
AGATTGTGCACCGAGGAGATGTGATTGTCACAAAGATAGCGTCGGAGCACAATATAGTTGATCCATTCACGATGGCTCTCACGGCTAAAGTGTTCGAGAGTCATCTAGAA
GGTCTAGGTGGACAGAAATATTCTACAGTGAGGGAAGTGCAACTACAGGGCTATAGTGGAGTGTCTCGACTCCCACAAGTCGTTCTAAGGTCGGAGGATAGCAGAGAAGA
CACTAGAGGTGGTCCACACACCGTTCGTTATCAAAGGAATCAAAAGGAGTTGCTGAATCAGTCTCCAAAAGCGATCTCAGCATCAATCTGCATATCTCGAAAAGAGGTTA
AGGGAACTGAGGCTGAGCCCAAGGGTAGCTCTCCTCTTGTGACCTCTACAAGTTCTGTGGTCAAAAAGGCCCAAGCCATTGGGCCTGGGGTCGACCTCAACAATGGGCCA
AGGCCAGGGATTGGGCTCAAACCCAATCTCCTTGGGCTTTTCTTCTTCTGGGTCACCGCTGACCTCCACCGGTTGGCCACCATCGTTGACCTCCGACTATCGATCGTCGC
CGACCTCCACCAACCAGCCGCCACTGTCAACCTCCAACTATCGGTCGTCGCCGACCGACCGCCTCCGTCGACCTTCGACTACCGGTCGCCGCCGACCTCTGCCAACCGAC
TGCCACCATCGACCTCGGACTACCGACTGTCACTGACTTCCGTCGACCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAAAGATAAGGGTGCTCTGTTAGTTCCTTTTGATTTTGAAATTGAAAAGACTTGCAAAAAGAATAGGAAGGAGAAAAGGGAAAGACTTGTAGCAATGGCCAATCT
AAATCCACAAGAGGAGCTGAAACCGATACGGGACTATTTCCAGCCTACTTTTTCGGATCAACAATCTGGGATAGTCTACGTGCCAATAAATGTGAACAATTTCGAGCTCA
AAACTGGTCTCATCCAGATGGAGGCTAGAAATAGTCTGGATATTTGTGGAACTATGAAGATAAAGCGAGAGACTGGCTTCAATCACTTCCACCTGGAAGTATCACTACGT
GGGACGCGTACAGGAGGTGTACAATCTAATGAGACAGCTGTTCTTGCATCTCAGTCTCAAGAGGAGACCTCTAATGAACAGGTTCTTTATGTTGATAGAAATTCTAACTA
TAGGGGACACCATAACTCTACACCCACACATTACCACCCTAATGTTAGAAATCATGAAAATTTCTCGTATGCTAACAATAAGAATATCTTGAACCCTCCTGGATTTATAT
CTCAAAAGACTGAAAATAAACCTTCCCTTGAAGATATGGTTGGAGCTTTTATTGCAGAGTCCAGCAACAAAACCAACAAGCTGGAGGAAGCAGTGATAGCTATAAATACC
ACAGTAACAGGCCACAGTGCAACAATTAAAAACATAGAAACTCAATTGGGACAGTTGGTGAATGTTGTAAACACTATGCAGAAAGGTAAAACCCCAGCTGAGTCAGAGAA
AACCCAAATGGAGTATTGTAAGGCCATCACAGTGCATCATGTGGAGGAAGTTCAAGTAGCTGCGGCACAAAATATTAATGAACCAAAAGTCACTAAGGAAGAAGTTGAAG
AAGGTTCATCCTCAAACGAAGCTGAAAAGCTTAATTTTTACCCTCTTATACCCTCTCCTGCTGTTTTGGTTCCAAAGCTTAAGAAAAAGAAGAAAAAGAATTATTCAACT
CAATTGAAGAAGTTTCTTGATATCTTTATGAGTTTAAATATTAATTTACCTTTTGCAGAGGCTTTGGAGCAGATGCCCAAAAAGGTGCAAGCAAGAGCACCTCTTGTGAA
TCTCGTGACAGTAGACCTCCTTGAAGCATGCACGGAGCCCAACAAGTATCTAAGGAGAACAAGGGACTATGTGCTCGTGTATGGTTCCAAGGATTTGGTTCTTATAGGAT
ACACTGATTCTGACTTTCAAACTGATAAAGATTCGAGGAAATCTACATCGGGGTCAGTATTCACTCTGAATGGAGGAGCAACAGTATGGTGTAGCATCAAGCAAGGATGC
ATTGCAGACTCGACCATGGAGGCCGAATATGTAGCCGCTTGTGAAGTAGCAAAAGAGGCAGTATGGCTCAGAAAATTCTTGACTGACTTGGAAGTCATTCTGAATATGCA
TCTGCCTATCACCTTGTATTGTGATAATAGCGGCGCAGTTGCAAACTCAAAGGAATCTTGCAGCCATAAGAGAGGCAAGCACATCGAAAGGAAATATCATCTCATTAAGG
AGATTGTGCACCGAGGAGATGTGATTGTCACAAAGATAGCGTCGGAGCACAATATAGTTGATCCATTCACGATGGCTCTCACGGCTAAAGTGTTCGAGAGTCATCTAGAA
GGTCTAGGTGGACAGAAATATTCTACAGTGAGGGAAGTGCAACTACAGGGCTATAGTGGAGTGTCTCGACTCCCACAAGTCGTTCTAAGGTCGGAGGATAGCAGAGAAGA
CACTAGAGGTGGTCCACACACCGTTCGTTATCAAAGGAATCAAAAGGAGTTGCTGAATCAGTCTCCAAAAGCGATCTCAGCATCAATCTGCATATCTCGAAAAGAGGTTA
AGGGAACTGAGGCTGAGCCCAAGGGTAGCTCTCCTCTTGTGACCTCTACAAGTTCTGTGGTCAAAAAGGCCCAAGCCATTGGGCCTGGGGTCGACCTCAACAATGGGCCA
AGGCCAGGGATTGGGCTCAAACCCAATCTCCTTGGGCTTTTCTTCTTCTGGGTCACCGCTGACCTCCACCGGTTGGCCACCATCGTTGACCTCCGACTATCGATCGTCGC
CGACCTCCACCAACCAGCCGCCACTGTCAACCTCCAACTATCGGTCGTCGCCGACCGACCGCCTCCGTCGACCTTCGACTACCGGTCGCCGCCGACCTCTGCCAACCGAC
TGCCACCATCGACCTCGGACTACCGACTGTCACTGACTTCCGTCGACCAATGA
Protein sequenceShow/hide protein sequence
MLKDKGALLVPFDFEIEKTCKKNRKEKRERLVAMANLNPQEELKPIRDYFQPTFSDQQSGIVYVPINVNNFELKTGLIQMEARNSLDICGTMKIKRETGFNHFHLEVSLR
GTRTGGVQSNETAVLASQSQEETSNEQVLYVDRNSNYRGHHNSTPTHYHPNVRNHENFSYANNKNILNPPGFISQKTENKPSLEDMVGAFIAESSNKTNKLEEAVIAINT
TVTGHSATIKNIETQLGQLVNVVNTMQKGKTPAESEKTQMEYCKAITVHHVEEVQVAAAQNINEPKVTKEEVEEGSSSNEAEKLNFYPLIPSPAVLVPKLKKKKKKNYST
QLKKFLDIFMSLNINLPFAEALEQMPKKVQARAPLVNLVTVDLLEACTEPNKYLRRTRDYVLVYGSKDLVLIGYTDSDFQTDKDSRKSTSGSVFTLNGGATVWCSIKQGC
IADSTMEAEYVAACEVAKEAVWLRKFLTDLEVILNMHLPITLYCDNSGAVANSKESCSHKRGKHIERKYHLIKEIVHRGDVIVTKIASEHNIVDPFTMALTAKVFESHLE
GLGGQKYSTVREVQLQGYSGVSRLPQVVLRSEDSREDTRGGPHTVRYQRNQKELLNQSPKAISASICISRKEVKGTEAEPKGSSPLVTSTSSVVKKAQAIGPGVDLNNGP
RPGIGLKPNLLGLFFFWVTADLHRLATIVDLRLSIVADLHQPAATVNLQLSVVADRPPPSTFDYRSPPTSANRLPPSTSDYRLSLTSVDQ