; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007836 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007836
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:5953825..5961879
RNA-Seq ExpressionLag0007836
SyntenyLag0007836
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]3.9e-6151.37Show/hide
Query:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG
        +RDGAK+WLN+ A G+I +WN L E FL KYFPP RNA+ R+EIV F+Q ED+T SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN AT+ +VDASA 
Subjt:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-VEPAALVNQVQRKHVSIVVKITT----
        GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +       A V   A++NQ   +      +  T    
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-VEPAALVNQVQRKHVSIVVKITT----

Query:  -----------------------------TSWRNHPNFSWGGQGS-NVQAQQKVN
                                       WRNHPNFSW GQGS N Q   K N
Subjt:  -----------------------------TSWRNHPNFSWGGQGS-NVQAQQKVN

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]5.1e-6151.34Show/hide
Query:  IIRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASA
        ++RDGAK+WLN+ APG+I +WN LAENFL KYFPP RNA+ ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN  T+ +VDASA
Subjt:  IIRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASA

Query:  GGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-VEPAALVNQVQRKHVSIVVKITT---
         GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +       A V  AA +NQ   +      +  T   
Subjt:  GGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-VEPAALVNQVQRKHVSIVVKITT---

Query:  ------------------------------TSWRNHPNFSWGGQG-SNVQAQQKVN-QSGF
                                        WRNHPNFSW GQ   N Q   K N  SGF
Subjt:  ------------------------------TSWRNHPNFSWGGQG-SNVQAQQKVN-QSGF

XP_022960432.1 uncharacterized protein LOC111461168 [Cucurbita moschata]2.7e-6254.29Show/hide
Query:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG
        +RDGAK+WLN+ AP +I +WN LAE FL KYFPP RNA+ R+EIV F+Q EDET SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN AT+ +VDASA 
Subjt:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNV-----TVI---SHQQPPAVEPAA----------LVNQV
        GA+L+KT+NEA+EILERI++N+CQW+DVR    KK + VLEVD +S+I A LA + N L+N+     T+I   +H      + A             +Q 
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNV-----TVI---SHQQPPAVEPAA----------LVNQV

Query:  QRKHVSIVVKITTTS----------------WRNHPNFSWGGQGS
         R   SI+      S                WRNHPNFSW GQGS
Subjt:  QRKHVSIVVKITTTS----------------WRNHPNFSWGGQGS

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]1.2e-6245.16Show/hide
Query:  LDPEIERTFRRRRREQ--RRNQNQMDNVSRLPQGPEDPATP--------------RIVCCSKTSRWSKMSSKI------IRDGAKAWLNSFAPGSIRTWN
        +DPEIERTFR+RR+EQ  +++ N  D       G  + A P               +   ++ +R S+ + ++      +RD A+AWLN+  P S+  WN
Subjt:  LDPEIERTFRRRRREQ--RRNQNQMDNVSRLPQGPEDPATP--------------RIVCCSKTSRWSKMSSKI------IRDGAKAWLNSFAPGSIRTWN

Query:  ELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAGGALLAKTFNEAHEILERISTN
        +LAE FL KYFPP RNAK RSEI+ F+QLEDET S+AWERFKELLRKCPHHG+PHCIQ+ETFYNGLN A++ ++DASA GA+L+K++NEA EILERI++N
Subjt:  ELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAGGALLAKTFNEAHEILERISTN

Query:  SCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAVEPAALV-------NQVQRKHVSIVVKITTTSWRNHPNFSWGGQGSNVQ
        + QWS  R  T++KV  VLEVD ++ + A +A + N LKN+ +    QP       +        NQ   ++ +        +W++HPNFSWGGQG++  
Subjt:  SCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAVEPAALV-------NQVQRKHVSIVVKITTTSWRNHPNFSWGGQGSNVQ

Query:  AQQKVNQSGF
          Q   +  F
Subjt:  AQQKVNQSGF

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]2.1e-5949.4Show/hide
Query:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG
        +RD A+AWLN+    S+  WN+LAENFL KYFPP RNAK RSEI+ F+QLEDET S+AWERFKELLRKCPHHG+PHCIQ+ETFYNGLN A++ ++DASA 
Subjt:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAV----------------------EPAAL
        GA+L+K++NEA EILERI++N+ QWS  R  T++KV  VLEVD ++ + A +A + N LKN+ +    QP A                        PA++
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAV----------------------EPAAL

Query:  V---NQVQRKHVSIVVKITTTSWRNHPNFSWGGQGSNVQAQQKVNQSGF
            NQ   ++ +        +W++HPNFSWGGQG++    Q   +  F
Subjt:  V---NQVQRKHVSIVVKITTTSWRNHPNFSWGGQGSNVQAQQKVNQSGF

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333941.9e-6151.37Show/hide
Query:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG
        +RDGAK+WLN+ A G+I +WN L E FL KYFPP RNA+ R+EIV F+Q ED+T SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN AT+ +VDASA 
Subjt:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-VEPAALVNQVQRKHVSIVVKITT----
        GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +       A V   A++NQ   +      +  T    
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-VEPAALVNQVQRKHVSIVVKITT----

Query:  -----------------------------TSWRNHPNFSWGGQGS-NVQAQQKVN
                                       WRNHPNFSW GQGS N Q   K N
Subjt:  -----------------------------TSWRNHPNFSWGGQGS-NVQAQQKVN

A0A6J1EQ90 uncharacterized protein LOC1114364112.5e-6151.34Show/hide
Query:  IIRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASA
        ++RDGAK+WLN+ APG+I +WN LAENFL KYFPP RNA+ ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN  T+ +VDASA
Subjt:  IIRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASA

Query:  GGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-VEPAALVNQVQRKHVSIVVKITT---
         GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +       A V  AA +NQ   +      +  T   
Subjt:  GGALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPA-VEPAALVNQVQRKHVSIVVKITT---

Query:  ------------------------------TSWRNHPNFSWGGQG-SNVQAQQKVN-QSGF
                                        WRNHPNFSW GQ   N Q   K N  SGF
Subjt:  ------------------------------TSWRNHPNFSWGGQG-SNVQAQQKVN-QSGF

A0A6J1G7Q6 uncharacterized protein LOC1114515984.1e-5650.78Show/hide
Query:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG
        +RDGAK+WLN  A G I +WN LAE FL KYFPP R+A+ R+EIV F++ E+ET SEAWERFKE LRKCPHHGLPHCIQ+ETFYNGLN AT+ +VDASA 
Subjt:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNV---------------TVI---------------SHQQP
        G +L+KT+NEA+EILERI++N+CQW DVR    KK + VLEVD +S+I A LA + N L+N+               TV+               +  Q 
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNV---------------TVI---------------SHQQP

Query:  PAVEPAALV---NQVQRKHVSIVVKITT--TSWRNHPNFSWGGQGS-NVQAQQKVN
        P+  PA++    NQ  + +        T    WRNHPNF   GQGS N Q   K N
Subjt:  PAVEPAALV---NQVQRKHVSIVVKITT--TSWRNHPNFSWGGQGS-NVQAQQKVN

A0A6J1H7E4 uncharacterized protein LOC1114611681.3e-6254.29Show/hide
Query:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG
        +RDGAK+WLN+ AP +I +WN LAE FL KYFPP RNA+ R+EIV F+Q EDET SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN AT+ +VDASA 
Subjt:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNV-----TVI---SHQQPPAVEPAA----------LVNQV
        GA+L+KT+NEA+EILERI++N+CQW+DVR    KK + VLEVD +S+I A LA + N L+N+     T+I   +H      + A             +Q 
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNV-----TVI---SHQQPPAVEPAA----------LVNQV

Query:  QRKHVSIVVKITTTS----------------WRNHPNFSWGGQGS
         R   SI+      S                WRNHPNFSW GQGS
Subjt:  QRKHVSIVVKITTTS----------------WRNHPNFSWGGQGS

U5CUI2 Retrotrans_gag domain-containing protein1.3e-5458.1Show/hide
Query:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG
        +RD A++WLN+  P S+  WN+LAE FL KYFPP RNAK RSEI+ F+QLEDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN A++ ++DASA 
Subjt:  IRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAVEPAALV
        GA+L+K++NEA EILE I++N+ QWS+ R  T++KV  VLEVD ++ + A +A + N LKN+++ + +    ++PAA +
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAVEPAALV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAGAGTGGCCAATACGTCGACTCAATATGCCTTCCTTTTTGGGGACAAGACCGAGTGGGAGGCTGGGGTCATGACAACACAAGAAAGAATCCATCCTTTCCGCT
TCTAGGAGAAGTAGATGAGTGTTCCCCAAGTGGTGACTCCGAGACTTTAACAAAGGGATCATGCCCTCTCATTGGCCCGAGAGGGATTTTTTGTTTAATGCCCCCCAAAA
GGAACCCATCTCCTGTTCGTTCTTCTTCCCTACCGTCAGTGTCGAAACCCGCAGGTCGCCGACCTACACCCTCACCGGCGTCATCTCTTGCATCCCTCGCGTGTTGCTCG
GGTCGAGCAGCAAGTTCCTTCTCGCGTGAGCAGCTGCGGTTTGTCCTCTCTCTCTCGTTTTCTCTTCTTGCTGCCGAACCACAATTGCACTCTTCTCACTGTAGACATAA
TTCTGTGTCCACGGATATCAACCATCAACAGCAAGTTAGCCCTTCACGTGTGTTCGTATCCCAGCTAGGTCAAATTACCGTTTTACCCCTCGGCTACCTCTTGGTCCTTG
AGTACCAATGCTCCTCTAATGAACAACTGTTTGTGGTCCAACCTGCAAAAGAAATCCTCTCGTGCCATAAAGAGGGATACCCCCACTCGCATGTCAACTACATGAACCGT
TGGATCATTACGTTTGTATCAATATACAAAGCGGGCGTATCCATAGTGTCACCAGGACAAGAGGAGCATCGAGGCCTTCGGTACAAATGGTCAAGGATCGGTGTGATGAG
TCTAGCAAGCATCGGTGCACCTTGGTACAAATGGTCAAGGGCGATGCACATTCGAGGCCTTGGTACAAATGGTCAAGGGTCGAACGCCGAGCTCCGTAGAGAGCATTGTG
GCCCTGGGTACAAATGGTCAGGGGACAGTGCGACTCGAAGGATCAGTCTTGGAGAGCCAACTTTGAGAAATATGTTGTTTGCGCATGAAAACATGTTGGTTGCTTGGTTG
TTGTTTGATGGCTTCTTAGGTAGGAGTGTGATAACTTTCCAAAAGAATGATTGCTGGGCGACTGAGGGAGCAAATTCTGTGCTGCAGCAAAGCTGGGAGCAAAACTGCCA
CGTCACAGCTCGCGTGAGTTTGGTGCATGAGCGATCCGCCTGGGTAAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGAGAAGAAGGAGAGAGCAGCGCAGAA
ACCAGAATCAAATGGATAACGTGTCGCGTCTTCCGCAGGGTCCTGAAGATCCAGCAACCCCCAGAATCGTTTGCTGCAGCAAAACCAGCCGCTGGAGCAAAATGAGCAGC
AAAATAATCAGAGATGGAGCAAAGGCATGGTTAAATTCTTTTGCTCCAGGATCAATTAGGACATGGAATGAGTTAGCTGAAAATTTTTTGAGTAAATATTTCCCACCAAA
TAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACC
ATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGGTTAAACGGAGCAACCCAAGGTATGGTTGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTT
AATGAAGCCCATGAAATTTTAGAAAGAATATCTACTAATAGTTGTCAGTGGTCAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTC
CACCATTAGGGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTGTGGAGCCTGCTGCATTGGTGAACCAAGTCC
AGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAACAAAAGGTG
AACCAGTCGGGATTTGCTAAAGCGCAGGTATTGCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAGTTCTCTTGAGGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAGAGTGGCCAATACGTCGACTCAATATGCCTTCCTTTTTGGGGACAAGACCGAGTGGGAGGCTGGGGTCATGACAACACAAGAAAGAATCCATCCTTTCCGCT
TCTAGGAGAAGTAGATGAGTGTTCCCCAAGTGGTGACTCCGAGACTTTAACAAAGGGATCATGCCCTCTCATTGGCCCGAGAGGGATTTTTTGTTTAATGCCCCCCAAAA
GGAACCCATCTCCTGTTCGTTCTTCTTCCCTACCGTCAGTGTCGAAACCCGCAGGTCGCCGACCTACACCCTCACCGGCGTCATCTCTTGCATCCCTCGCGTGTTGCTCG
GGTCGAGCAGCAAGTTCCTTCTCGCGTGAGCAGCTGCGGTTTGTCCTCTCTCTCTCGTTTTCTCTTCTTGCTGCCGAACCACAATTGCACTCTTCTCACTGTAGACATAA
TTCTGTGTCCACGGATATCAACCATCAACAGCAAGTTAGCCCTTCACGTGTGTTCGTATCCCAGCTAGGTCAAATTACCGTTTTACCCCTCGGCTACCTCTTGGTCCTTG
AGTACCAATGCTCCTCTAATGAACAACTGTTTGTGGTCCAACCTGCAAAAGAAATCCTCTCGTGCCATAAAGAGGGATACCCCCACTCGCATGTCAACTACATGAACCGT
TGGATCATTACGTTTGTATCAATATACAAAGCGGGCGTATCCATAGTGTCACCAGGACAAGAGGAGCATCGAGGCCTTCGGTACAAATGGTCAAGGATCGGTGTGATGAG
TCTAGCAAGCATCGGTGCACCTTGGTACAAATGGTCAAGGGCGATGCACATTCGAGGCCTTGGTACAAATGGTCAAGGGTCGAACGCCGAGCTCCGTAGAGAGCATTGTG
GCCCTGGGTACAAATGGTCAGGGGACAGTGCGACTCGAAGGATCAGTCTTGGAGAGCCAACTTTGAGAAATATGTTGTTTGCGCATGAAAACATGTTGGTTGCTTGGTTG
TTGTTTGATGGCTTCTTAGGTAGGAGTGTGATAACTTTCCAAAAGAATGATTGCTGGGCGACTGAGGGAGCAAATTCTGTGCTGCAGCAAAGCTGGGAGCAAAACTGCCA
CGTCACAGCTCGCGTGAGTTTGGTGCATGAGCGATCCGCCTGGGTAAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGAGAAGAAGGAGAGAGCAGCGCAGAA
ACCAGAATCAAATGGATAACGTGTCGCGTCTTCCGCAGGGTCCTGAAGATCCAGCAACCCCCAGAATCGTTTGCTGCAGCAAAACCAGCCGCTGGAGCAAAATGAGCAGC
AAAATAATCAGAGATGGAGCAAAGGCATGGTTAAATTCTTTTGCTCCAGGATCAATTAGGACATGGAATGAGTTAGCTGAAAATTTTTTGAGTAAATATTTCCCACCAAA
TAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACC
ATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGGTTAAACGGAGCAACCCAAGGTATGGTTGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTT
AATGAAGCCCATGAAATTTTAGAAAGAATATCTACTAATAGTTGTCAGTGGTCAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTC
CACCATTAGGGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTGTGGAGCCTGCTGCATTGGTGAACCAAGTCC
AGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAACAAAAGGTG
AACCAGTCGGGATTTGCTAAAGCGCAGGTATTGCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAGTTCTCTTGAGGCAATGA
Protein sequenceShow/hide protein sequence
MSESGQYVDSICLPFWGQDRVGGWGHDNTRKNPSFPLLGEVDECSPSGDSETLTKGSCPLIGPRGIFCLMPPKRNPSPVRSSSLPSVSKPAGRRPTPSPASSLASLACCS
GRAASSFSREQLRFVLSLSFSLLAAEPQLHSSHCRHNSVSTDINHQQQVSPSRVFVSQLGQITVLPLGYLLVLEYQCSSNEQLFVVQPAKEILSCHKEGYPHSHVNYMNR
WIITFVSIYKAGVSIVSPGQEEHRGLRYKWSRIGVMSLASIGAPWYKWSRAMHIRGLGTNGQGSNAELRREHCGPGYKWSGDSATRRISLGEPTLRNMLFAHENMLVAWL
LFDGFLGRSVITFQKNDCWATEGANSVLQQSWEQNCHVTARVSLVHERSAWVRFELDPEIERTFRRRRREQRRNQNQMDNVSRLPQGPEDPATPRIVCCSKTSRWSKMSS
KIIRDGAKAWLNSFAPGSIRTWNELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDASAGGALLAKTF
NEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAVEPAALVNQVQRKHVSIVVKITTTSWRNHPNFSWGGQGSNVQAQQKV
NQSGFAKAQVLPSKISRLCPSKIRGVLLRQ