; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011471 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011471
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF1985 domain-containing protein
Genome locationchr1:25745905..25751387
RNA-Seq ExpressionLag0011471
SyntenyLag0011471
Gene Ontology termsGO:0048856 - anatomical structure development (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060374.1 uncharacterized protein E6C27_scaffold22G001730 [Cucumis melo var. makuwa]1.2e-4955.62Show/hide
Query:  PPLDMVKLKGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSI
        P +D  K+KG+FL KYF  + PI RS VS LF+  + +K +D++++AK+YFL NFLLGKQ +TG + + I LLDDEQ FD+YPWGRI Y   IDSIKKSI
Subjt:  PPLDMVKLKGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSI

Query:  KNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENEN
        KNP AL VGI+GF Y+LLVW Y+C+PLL  PS+ CAQ++      + NW+ ++HPEWK+LA + F +E+
Subjt:  KNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENEN

XP_008440207.1 PREDICTED: uncharacterized protein LOC103484737 isoform X1 [Cucumis melo]1.7e-4345.05Show/hide
Query:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS
        P +DM K+ KG+F  +YF  +K I+R+ + E+F+ M+  + KD V++AKLY L  F+LGKQ+ TGI  ++  L+DD++ FDSYPWGRI+Y  TID +KK+
Subjt:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS

Query:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD
        IK+ +A  +G+ GFP+AL VWAYE IPLL+  S   A R+S   P MNNW AD HPEWKDL+ K F++E F+   +  T+ E+E  +     G K  N  
Subjt:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD

Query:  NRFQGETSRGKEKATDERESDH
        N+F  +     +  T   + DH
Subjt:  NRFQGETSRGKEKATDERESDH

XP_008440208.1 PREDICTED: uncharacterized protein LOC103484737 isoform X2 [Cucumis melo]1.7e-4345.05Show/hide
Query:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS
        P +DM K+ KG+F  +YF  +K I+R+ + E+F+ M+  + KD V++AKLY L  F+LGKQ+ TGI  ++  L+DD++ FDSYPWGRI+Y  TID +KK+
Subjt:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS

Query:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD
        IK+ +A  +G+ GFP+AL VWAYE IPLL+  S   A R+S   P MNNW AD HPEWKDL+ K F++E F+   +  T+ E+E  +     G K  N  
Subjt:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD

Query:  NRFQGETSRGKEKATDERESDH
        N+F  +     +  T   + DH
Subjt:  NRFQGETSRGKEKATDERESDH

XP_008440212.1 PREDICTED: uncharacterized protein LOC103484737 isoform X5 [Cucumis melo]1.7e-4345.05Show/hide
Query:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS
        P +DM K+ KG+F  +YF  +K I+R+ + E+F+ M+  + KD V++AKLY L  F+LGKQ+ TGI  ++  L+DD++ FDSYPWGRI+Y  TID +KK+
Subjt:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS

Query:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD
        IK+ +A  +G+ GFP+AL VWAYE IPLL+  S   A R+S   P MNNW AD HPEWKDL+ K F++E F+   +  T+ E+E  +     G K  N  
Subjt:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD

Query:  NRFQGETSRGKEKATDERESDH
        N+F  +     +  T   + DH
Subjt:  NRFQGETSRGKEKATDERESDH

XP_016899363.1 PREDICTED: uncharacterized protein LOC103484737 isoform X3 [Cucumis melo]1.7e-4345.05Show/hide
Query:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS
        P +DM K+ KG+F  +YF  +K I+R+ + E+F+ M+  + KD V++AKLY L  F+LGKQ+ TGI  ++  L+DD++ FDSYPWGRI+Y  TID +KK+
Subjt:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS

Query:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD
        IK+ +A  +G+ GFP+AL VWAYE IPLL+  S   A R+S   P MNNW AD HPEWKDL+ K F++E F+   +  T+ E+E  +     G K  N  
Subjt:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD

Query:  NRFQGETSRGKEKATDERESDH
        N+F  +     +  T   + DH
Subjt:  NRFQGETSRGKEKATDERESDH

TrEMBL top hitse value%identityAlignment
A0A1S3B065 uncharacterized protein LOC103484737 isoform X48.1e-4445.05Show/hide
Query:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS
        P +DM K+ KG+F  +YF  +K I+R+ + E+F+ M+  + KD V++AKLY L  F+LGKQ+ TGI  ++  L+DD++ FDSYPWGRI+Y  TID +KK+
Subjt:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS

Query:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD
        IK+ +A  +G+ GFP+AL VWAYE IPLL+  S   A R+S   P MNNW AD HPEWKDL+ K F++E F+   +  T+ E+E  +     G K  N  
Subjt:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD

Query:  NRFQGETSRGKEKATDERESDH
        N+F  +     +  T   + DH
Subjt:  NRFQGETSRGKEKATDERESDH

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X58.1e-4445.05Show/hide
Query:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS
        P +DM K+ KG+F  +YF  +K I+R+ + E+F+ M+  + KD V++AKLY L  F+LGKQ+ TGI  ++  L+DD++ FDSYPWGRI+Y  TID +KK+
Subjt:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS

Query:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD
        IK+ +A  +G+ GFP+AL VWAYE IPLL+  S   A R+S   P MNNW AD HPEWKDL+ K F++E F+   +  T+ E+E  +     G K  N  
Subjt:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD

Query:  NRFQGETSRGKEKATDERESDH
        N+F  +     +  T   + DH
Subjt:  NRFQGETSRGKEKATDERESDH

A0A1S3B181 uncharacterized protein LOC103484737 isoform X78.1e-4445.05Show/hide
Query:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS
        P +DM K+ KG+F  +YF  +K I+R+ + E+F+ M+  + KD V++AKLY L  F+LGKQ+ TGI  ++  L+DD++ FDSYPWGRI+Y  TID +KK+
Subjt:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS

Query:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD
        IK+ +A  +G+ GFP+AL VWAYE IPLL+  S   A R+S   P MNNW AD HPEWKDL+ K F++E F+   +  T+ E+E  +     G K  N  
Subjt:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD

Query:  NRFQGETSRGKEKATDERESDH
        N+F  +     +  T   + DH
Subjt:  NRFQGETSRGKEKATDERESDH

A0A1S4DTS6 uncharacterized protein LOC103484737 isoform X38.1e-4445.05Show/hide
Query:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS
        P +DM K+ KG+F  +YF  +K I+R+ + E+F+ M+  + KD V++AKLY L  F+LGKQ+ TGI  ++  L+DD++ FDSYPWGRI+Y  TID +KK+
Subjt:  PPLDMVKL-KGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKS

Query:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD
        IK+ +A  +G+ GFP+AL VWAYE IPLL+  S   A R+S   P MNNW AD HPEWKDL+ K F++E F+   +  T+ E+E  +     G K  N  
Subjt:  IKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENENFEF--IEMTDEEIESMF---HEGNKEKNAD

Query:  NRFQGETSRGKEKATDERESDH
        N+F  +     +  T   + DH
Subjt:  NRFQGETSRGKEKATDERESDH

A0A5A7UZA2 DUF1985 domain-containing protein5.8e-5055.62Show/hide
Query:  PPLDMVKLKGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSI
        P +D  K+KG+FL KYF  + PI RS VS LF+  + +K +D++++AK+YFL NFLLGKQ +TG + + I LLDDEQ FD+YPWGRI Y   IDSIKKSI
Subjt:  PPLDMVKLKGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSI

Query:  KNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENEN
        KNP AL VGI+GF Y+LLVW Y+C+PLL  PS+ CAQ++      + NW+ ++HPEWK+LA + F +E+
Subjt:  KNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFENEN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases4.3e-0532.29Show/hide
Query:  RKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIK-KSIKNPEALVVGIAGFPYALLVWAYECIPLL-SGPSM
        R+ R+R A L  +  FLL       I  D   + +D Q F SYPWGR+++   + SIK + ++      V + G  YAL +   E +P +  GP +
Subjt:  RKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIK-KSIKNPEALVVGIAGFPYALLVWAYECIPLL-SGPSM

AT3G31910.1 Domain of unknown function (DUF1985)1.3e-0431.4Show/hide
Query:  KDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSIKNPEALVVGIAGFPYALLVWAYECIP
        K+R+ + +L  LS  + G    + I L     + D   F+ YPWGR+A+ + I+S+K    + ++ V  I    +AL++W YE +P
Subjt:  KDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSIKNPEALVVGIAGFPYALLVWAYECIP

AT3G32960.1 Domain of unknown function (DUF1985)2.8e-0430.34Show/hide
Query:  KDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSI-KNPEALVVGIAGFPYALLVWAYECIPLL
        ++R  LA L  + +  L         ++ +    D +   +YPWG  A+   + SIKK++  N       I GFP AL +W  E IP+L
Subjt:  KDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSI-KNPEALVVGIAGFPYALLVWAYECIPLL

AT5G28810.1 Domain of unknown function (DUF1985)2.8e-0432.14Show/hide
Query:  RVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSIKNPEALVVGIAGFPYALLVWAYECIP
        R+ + +L  LS  + G    + + L     + D   F+ YPWGR+A+ + + S+K    + ++ V  I G   ALLVW YE +P
Subjt:  RVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSIKNPEALVVGIAGFPYALLVWAYECIP

AT5G45570.1 Ulp1 protease family protein6.2e-0429.25Show/hide
Query:  VSELFSSMEGV-------KRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSIKNPEALVVGIAGFPYALLVW
        V  LF+ +E V         + R+ + +L  LS  + G    + + L     + D   F+ YPWGR+A+ +   S+K    + ++ V  I G    LLVW
Subjt:  VSELFSSMEGV-------KRKDRVRLAKLYFLSNFLLGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSIKNPEALVVGIAGFPYALLVW

Query:  AYECIP
         YE +P
Subjt:  AYECIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTAGGTGATTGCTTCTCCCTTTTAGATTTAGATCTGAAATTAAGCTATCGAAAAGGAAAAAACACAGGCGAGGGATTTTCTCTCGGGCGAAAAGGGGACGAAAA
GGCGAAAAGGGAAAGAAAAAGACGAAAAGGCGAACGAGTTCCGGCGAGGGGTTTTTTCTCCTGCAACCTTGCGAAGGAGTTCCGGCGAAAAAAGGTAGAAGCTTCGCACG
AAGGAGACGAAGAGATATCGAACTTCCGACGACTGGAAAATACCTCTGAATCAATAGACACAGCGATGGACAACATAATCCCACGTGGAGACGTTGAAACAGAACAAACT
AAAGATGTTCCCGGGACCCAGATAGTCCCGTATGGATATTGTTTCGATGATGTGCTTCCTGAGGAAACCGAAGAAATAGAGAATACATCAGATGGCGATGAGGAAACAGC
TGATGAACAAACTGAAGTTGTCTATGAGGAAGATACTCGCATGGAGCCTGATGAAGAAACGAGTCCATTTAGTGGGGAGAAGCGGAAACAGCCACAATACCATCAAAAGG
ATAAAACCCCAAAAAGGAAGAGAATCGAGCAATATGCTACTGCTTGTATGAGAGAAACCCGGCAATCAGCGTCTTCCAAAAGCGTTAAGCAAACATGCCCTCCTCCAAAA
CCTAAAGGAAACAAATCTGTGAAGAAGGCAAACACTTCTTCCAAAACAACCAAGAAGAGGAAGAATCCTGTTGCGGTGAAAAAGACTACAAAGTCTGTAATTTATAGCAG
GGGGTCGATATTGGGGGCACCAAGGGATGAAATGTCTGAATTGTGGAGAACTCCACCATTGGATATGGTAAAGTTGAAAGGGAGGTTCCTCCACAAGTACTTCAACAAAG
ATAAACCCATCAAAAGATCAATAGTGAGTGAACTATTTTCTTCGATGGAGGGGGTAAAAAGGAAGGATAGGGTTAGGTTGGCCAAATTGTATTTCCTATCAAACTTTCTT
CTAGGAAAACAAATTAGCACGGGAATAGAATTGGATTTCATAACTTTGCTTGATGATGAGCAGCTATTTGACAGCTACCCTTGGGGGCGAATCGCATATACCACTACCAT
AGACTCTATAAAGAAATCGATTAAAAATCCTGAAGCTCTAGTGGTAGGAATCGCTGGATTCCCATATGCCTTGCTTGTTTGGGCATATGAGTGCATTCCCCTTTTATCCG
GCCCTTCCATGATCTGTGCACAACGAGTATCTTCCACAATTCCGATGATGAACAATTGGGTAGCTGATAGCCACCCTGAATGGAAAGATCTTGCTATCAAGTGCTTTGAA
AATGAAAATTTTGAGTTTATTGAGATGACGGATGAGGAGATTGAGTCTATGTTCCATGAAGGGAACAAAGAAAAGAATGCAGACAACCGTTTCCAAGGGGAAACTTCCAG
AGGAAAAGAAAAGGCCACAGATGAAAGAGAGAGTGATCATTTTGAAGGGGGAGACATACCTCCTAATGTAGAAGATGTCACTCAAGATATAAGGGCCTTAGTAATTGACT
GCTTTGAAGTTCTTAACTCAAAAATGGACCGGCTATTTGGAGAAATGGAAAATTTGAAAAAAATGATGAACAAGGAGAATAAACAAATGGAGGGACAGGGGGGAAATGAC
AAAAAAGATCACAATGATAAAGATGACGACGGACAAAACAATGGTGAAGATGATCAAAATGATGAAAATGATAGCGAACATATCCCTCAAGAAGATGCTGCAGACAACTC
ATTAGGAAAGGAAATACATATGCCAAGTCAGGATGATGTCGTAAGCACATCTTTCTTGAGGGAAGTTGAAAAAATTGAGAAGGAGGCTAATTTGATTAAGGCCAAAACTG
TTAATGTCAAGAAAGAGATTGGGACTTCTACCGAAACAAGATTTATTCATGCTATAAAAGGAACAGGATCGTTTGCTCAAACAAACAACATGTTCACTAAACGTGAACGA
AGGGTCATCGTTCCTTCTATGATCTTGAGGTCACCATTCACTTCGAAATTCGGGTCAGCAGAAGGAAAAAAAAAAGACACCAAAACCTCCTACGCAAATGAATTTGATGG
GCCAACGTTCAATTTGCTCACGCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTAGGTGATTGCTTCTCCCTTTTAGATTTAGATCTGAAATTAAGCTATCGAAAAGGAAAAAACACAGGCGAGGGATTTTCTCTCGGGCGAAAAGGGGACGAAAA
GGCGAAAAGGGAAAGAAAAAGACGAAAAGGCGAACGAGTTCCGGCGAGGGGTTTTTTCTCCTGCAACCTTGCGAAGGAGTTCCGGCGAAAAAAGGTAGAAGCTTCGCACG
AAGGAGACGAAGAGATATCGAACTTCCGACGACTGGAAAATACCTCTGAATCAATAGACACAGCGATGGACAACATAATCCCACGTGGAGACGTTGAAACAGAACAAACT
AAAGATGTTCCCGGGACCCAGATAGTCCCGTATGGATATTGTTTCGATGATGTGCTTCCTGAGGAAACCGAAGAAATAGAGAATACATCAGATGGCGATGAGGAAACAGC
TGATGAACAAACTGAAGTTGTCTATGAGGAAGATACTCGCATGGAGCCTGATGAAGAAACGAGTCCATTTAGTGGGGAGAAGCGGAAACAGCCACAATACCATCAAAAGG
ATAAAACCCCAAAAAGGAAGAGAATCGAGCAATATGCTACTGCTTGTATGAGAGAAACCCGGCAATCAGCGTCTTCCAAAAGCGTTAAGCAAACATGCCCTCCTCCAAAA
CCTAAAGGAAACAAATCTGTGAAGAAGGCAAACACTTCTTCCAAAACAACCAAGAAGAGGAAGAATCCTGTTGCGGTGAAAAAGACTACAAAGTCTGTAATTTATAGCAG
GGGGTCGATATTGGGGGCACCAAGGGATGAAATGTCTGAATTGTGGAGAACTCCACCATTGGATATGGTAAAGTTGAAAGGGAGGTTCCTCCACAAGTACTTCAACAAAG
ATAAACCCATCAAAAGATCAATAGTGAGTGAACTATTTTCTTCGATGGAGGGGGTAAAAAGGAAGGATAGGGTTAGGTTGGCCAAATTGTATTTCCTATCAAACTTTCTT
CTAGGAAAACAAATTAGCACGGGAATAGAATTGGATTTCATAACTTTGCTTGATGATGAGCAGCTATTTGACAGCTACCCTTGGGGGCGAATCGCATATACCACTACCAT
AGACTCTATAAAGAAATCGATTAAAAATCCTGAAGCTCTAGTGGTAGGAATCGCTGGATTCCCATATGCCTTGCTTGTTTGGGCATATGAGTGCATTCCCCTTTTATCCG
GCCCTTCCATGATCTGTGCACAACGAGTATCTTCCACAATTCCGATGATGAACAATTGGGTAGCTGATAGCCACCCTGAATGGAAAGATCTTGCTATCAAGTGCTTTGAA
AATGAAAATTTTGAGTTTATTGAGATGACGGATGAGGAGATTGAGTCTATGTTCCATGAAGGGAACAAAGAAAAGAATGCAGACAACCGTTTCCAAGGGGAAACTTCCAG
AGGAAAAGAAAAGGCCACAGATGAAAGAGAGAGTGATCATTTTGAAGGGGGAGACATACCTCCTAATGTAGAAGATGTCACTCAAGATATAAGGGCCTTAGTAATTGACT
GCTTTGAAGTTCTTAACTCAAAAATGGACCGGCTATTTGGAGAAATGGAAAATTTGAAAAAAATGATGAACAAGGAGAATAAACAAATGGAGGGACAGGGGGGAAATGAC
AAAAAAGATCACAATGATAAAGATGACGACGGACAAAACAATGGTGAAGATGATCAAAATGATGAAAATGATAGCGAACATATCCCTCAAGAAGATGCTGCAGACAACTC
ATTAGGAAAGGAAATACATATGCCAAGTCAGGATGATGTCGTAAGCACATCTTTCTTGAGGGAAGTTGAAAAAATTGAGAAGGAGGCTAATTTGATTAAGGCCAAAACTG
TTAATGTCAAGAAAGAGATTGGGACTTCTACCGAAACAAGATTTATTCATGCTATAAAAGGAACAGGATCGTTTGCTCAAACAAACAACATGTTCACTAAACGTGAACGA
AGGGTCATCGTTCCTTCTATGATCTTGAGGTCACCATTCACTTCGAAATTCGGGTCAGCAGAAGGAAAAAAAAAAGACACCAAAACCTCCTACGCAAATGAATTTGATGG
GCCAACGTTCAATTTGCTCACGCAATAG
Protein sequenceShow/hide protein sequence
MAVGDCFSLLDLDLKLSYRKGKNTGEGFSLGRKGDEKAKRERKRRKGERVPARGFFSCNLAKEFRRKKVEASHEGDEEISNFRRLENTSESIDTAMDNIIPRGDVETEQT
KDVPGTQIVPYGYCFDDVLPEETEEIENTSDGDEETADEQTEVVYEEDTRMEPDEETSPFSGEKRKQPQYHQKDKTPKRKRIEQYATACMRETRQSASSKSVKQTCPPPK
PKGNKSVKKANTSSKTTKKRKNPVAVKKTTKSVIYSRGSILGAPRDEMSELWRTPPLDMVKLKGRFLHKYFNKDKPIKRSIVSELFSSMEGVKRKDRVRLAKLYFLSNFL
LGKQISTGIELDFITLLDDEQLFDSYPWGRIAYTTTIDSIKKSIKNPEALVVGIAGFPYALLVWAYECIPLLSGPSMICAQRVSSTIPMMNNWVADSHPEWKDLAIKCFE
NENFEFIEMTDEEIESMFHEGNKEKNADNRFQGETSRGKEKATDERESDHFEGGDIPPNVEDVTQDIRALVIDCFEVLNSKMDRLFGEMENLKKMMNKENKQMEGQGGND
KKDHNDKDDDGQNNGEDDQNDENDSEHIPQEDAADNSLGKEIHMPSQDDVVSTSFLREVEKIEKEANLIKAKTVNVKKEIGTSTETRFIHAIKGTGSFAQTNNMFTKRER
RVIVPSMILRSPFTSKFGSAEGKKKDTKTSYANEFDGPTFNLLTQ