; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g18360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g18360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPlus3 domain-containing protein
Genome locationchr9:14401845..14403365
RNA-Seq ExpressionMoc09g18360
SyntenyMoc09g18360
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.0e-7761.83Show/hide
Query:  ICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNP
        +CARKG  GIVKG TSIKGWVRKWF+ASGEWLAKDES              ++IRP+PELTQASFDTLKYY + FP+GRK+GTLVTDKLLL+SGLLDYNP
Subjt:  ICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNP

Query:  LVRPIEASRPR--------FSSSLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALDISPL-NKVRG
         VRPIE+SRP         F+S++KRKSKG+AHAL+   S++P T     P+++D        P  V EL+S    S+EKRPR+++EA+D+SPL  +VR 
Subjt:  LVRPIEASRPR--------FSSSLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALDISPL-NKVRG

Query:  ESPLKRRRKKKKTTSSFEVGSRGSLPTSHTDLVEDPEARMRGTSDVPMRFWVEPSSSGVKDQ
        E PLKRRRKKKKTTS  EVG+RG LP S  D V+DPEARM GT DV  RF VEPSSSGV+DQ
Subjt:  ESPLKRRRKKKKTTSSFEVGSRGSLPTSHTDLVEDPEARMRGTSDVPMRFWVEPSSSGVKDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.4e-9065.67Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVR
        MFEYGLRLPLHPF QEFL RTGLAP QVAPNGWGVIFALAIL                         KRIAKKPGR+Y+CARKG GGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVR

Query:  KWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRP--------RFSS
        KWF+ASGEWLAKDESGR FFDVP RF NL+SIRP+PELTQASFDTLKYY +RFP+GRK+GTLVTD+LLL+SGLLDYNP VRPIE SRP        RF+S
Subjt:  KWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRP--------RFSS

Query:  SLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALD
         +KRKSKGRAHAL+   S++P T     P+++D        P  V EL+S G  S+EKRPR+++EA+D
Subjt:  SLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALD

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]7.6e-7675.53Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVR
        MFEYGLRLPLHPF QEFL RTGLAP QVAPNGWGVIFALAIL                         KRIAKKPGR+Y+CARKG GGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVR

Query:  KWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRP
        KWF+ASGEWLAKDESGR FFDVP RF NL+SIRP+PELTQASFDTLKYY +RFP+GRK+GTLVTD+LLL+SGLLDYNP VRPIE+SRP
Subjt:  KWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRP

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.9e-13167.89Show/hide
Query:  SDFGEDLARRLEFELEEVENFRFFDDGENSGASTSGQGLEYPSKMPEHYLGPLHRGFKILDDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF
        S+   DLARRLE +LEE+EN R  DDGE+S ASTSGQGLEYPS++PEHYLG L RGF I ++ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF
Subjt:  SDFGEDLARRLEFELEEVENFRFFDDGENSGASTSGQGLEYPSKMPEHYLGPLHRGFKILDDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF

Query:  AQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKD
         QEFL RTGLAP QVAPNGWGVIFALAIL                         KRIAKKPGR+Y+CARKG GGIVKG TSIKGWVRKWF+ASGEWLAKD
Subjt:  AQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKD

Query:  ESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRPR--------FSSSLKRKSKGRAHAL
        ESGR FFDVP RF NL+SIRP+PELTQASFDTLKYY +RFP+GRK+GTLVTD+LLL+SGLLDYNP VRPIE+SRP         F+S +KRKSKGRAHAL
Subjt:  ESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRPR--------FSSSLKRKSKGRAHAL

Query:  KTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALD
        +   S++P T     P+++D        P LV EL+S G  S+EKRPR+++EA+D
Subjt:  KTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.9e-9873.11Show/hide
Query:  ICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNP
        +CARKG GGIVKG TSIKGWV KWFFASGEWLAKDESGR FFDVP RF NL+SI+ IPEL QA+FDTLK+Y D FP+ RKI TLVTDKLLL+SGLLDYNP
Subjt:  ICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNP

Query:  LVRPIEASRPR--------FSSSLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALDISPLNKVRGE
        LVR IEASRP         F+ S+KRKSKGRAHALKTV+ TEP T T  +  AQ N+G S+ +P  V ELD  G  S EKR R ESEALD+SPLN+VRGE
Subjt:  LVRPIEASRPR--------FSSSLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALDISPLNKVRGE

Query:  SPLKRRRKKKKTTSSFEVGSRGSLPTSHTDLVEDPEARMRGTSDVPMRFWVEPSSSGVKDQVSR
        SPL+RRRKKKKT+SS E G+RG+LPTSH DLV+DPEARMRGTS+V MRF +EPSSSGVKDQVSR
Subjt:  SPLKRRRKKKKTTSSFEVGSRGSLPTSHTDLVEDPEARMRGTSDVPMRFWVEPSSSGVKDQVSR

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.0e-7761.83Show/hide
Query:  ICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNP
        +CARKG  GIVKG TSIKGWVRKWF+ASGEWLAKDES              ++IRP+PELTQASFDTLKYY + FP+GRK+GTLVTDKLLL+SGLLDYNP
Subjt:  ICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNP

Query:  LVRPIEASRPR--------FSSSLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALDISPL-NKVRG
         VRPIE+SRP         F+S++KRKSKG+AHAL+   S++P T     P+++D        P  V EL+S    S+EKRPR+++EA+D+SPL  +VR 
Subjt:  LVRPIEASRPR--------FSSSLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALDISPL-NKVRG

Query:  ESPLKRRRKKKKTTSSFEVGSRGSLPTSHTDLVEDPEARMRGTSDVPMRFWVEPSSSGVKDQ
        E PLKRRRKKKKTTS  EVG+RG LP S  D V+DPEARM GT DV  RF VEPSSSGV+DQ
Subjt:  ESPLKRRRKKKKTTSSFEVGSRGSLPTSHTDLVEDPEARMRGTSDVPMRFWVEPSSSGVKDQ

A0A6J1CR42 uncharacterized protein LOC1110138262.6e-9065.67Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVR
        MFEYGLRLPLHPF QEFL RTGLAP QVAPNGWGVIFALAIL                         KRIAKKPGR+Y+CARKG GGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVR

Query:  KWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRP--------RFSS
        KWF+ASGEWLAKDESGR FFDVP RF NL+SIRP+PELTQASFDTLKYY +RFP+GRK+GTLVTD+LLL+SGLLDYNP VRPIE SRP        RF+S
Subjt:  KWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRP--------RFSS

Query:  SLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALD
         +KRKSKGRAHAL+   S++P T     P+++D        P  V EL+S G  S+EKRPR+++EA+D
Subjt:  SLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALD

A0A6J1DWD2 uncharacterized protein LOC1110246803.7e-7675.53Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVR
        MFEYGLRLPLHPF QEFL RTGLAP QVAPNGWGVIFALAIL                         KRIAKKPGR+Y+CARKG GGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVR

Query:  KWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRP
        KWF+ASGEWLAKDESGR FFDVP RF NL+SIRP+PELTQASFDTLKYY +RFP+GRK+GTLVTD+LLL+SGLLDYNP VRPIE+SRP
Subjt:  KWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRP

A0A6J1DXS5 uncharacterized protein LOC1110255021.4e-13167.89Show/hide
Query:  SDFGEDLARRLEFELEEVENFRFFDDGENSGASTSGQGLEYPSKMPEHYLGPLHRGFKILDDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF
        S+   DLARRLE +LEE+EN R  DDGE+S ASTSGQGLEYPS++PEHYLG L RGF I ++ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF
Subjt:  SDFGEDLARRLEFELEEVENFRFFDDGENSGASTSGQGLEYPSKMPEHYLGPLHRGFKILDDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPF

Query:  AQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKD
         QEFL RTGLAP QVAPNGWGVIFALAIL                         KRIAKKPGR+Y+CARKG GGIVKG TSIKGWVRKWF+ASGEWLAKD
Subjt:  AQEFLNRTGLAPTQVAPNGWGVIFALAIL-----------------------SVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKD

Query:  ESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRPR--------FSSSLKRKSKGRAHAL
        ESGR FFDVP RF NL+SIRP+PELTQASFDTLKYY +RFP+GRK+GTLVTD+LLL+SGLLDYNP VRPIE+SRP         F+S +KRKSKGRAHAL
Subjt:  ESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRPR--------FSSSLKRKSKGRAHAL

Query:  KTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALD
        +   S++P T     P+++D        P LV EL+S G  S+EKRPR+++EA+D
Subjt:  KTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256659.0e-9973.11Show/hide
Query:  ICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNP
        +CARKG GGIVKG TSIKGWV KWFFASGEWLAKDESGR FFDVP RF NL+SI+ IPEL QA+FDTLK+Y D FP+ RKI TLVTDKLLL+SGLLDYNP
Subjt:  ICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNP

Query:  LVRPIEASRPR--------FSSSLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALDISPLNKVRGE
        LVR IEASRP         F+ S+KRKSKGRAHALKTV+ TEP T T  +  AQ N+G S+ +P  V ELD  G  S EKR R ESEALD+SPLN+VRGE
Subjt:  LVRPIEASRPR--------FSSSLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNAGQSAELPILVTELDSVGEHSKEKRPRNESEALDISPLNKVRGE

Query:  SPLKRRRKKKKTTSSFEVGSRGSLPTSHTDLVEDPEARMRGTSDVPMRFWVEPSSSGVKDQVSR
        SPL+RRRKKKKT+SS E G+RG+LPTSH DLV+DPEARMRGTS+V MRF +EPSSSGVKDQVSR
Subjt:  SPLKRRRKKKKTTSSFEVGSRGSLPTSHTDLVEDPEARMRGTSDVPMRFWVEPSSSGVKDQVSR

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic3.3e-0527.33Show/hide
Query:  LDDILLRIPEEGERADNPPEGWVTLYLKMFEYG--LRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAI-----------------LSVKRIAK-KPG
        L  + LR+P   ERAD+PP G+ TLY + F YG  L LP+     E++    +A +Q+       +  + I                 L ++R+ K +  
Subjt:  LDDILLRIPEEGERADNPPEGWVTLYLKMFEYG--LRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAI-----------------LSVKRIAK-KPG

Query:  RYYICARKGVGGIVKGSTSIKGWVRKWFF-ASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTL
        RYYI   KG   I    +  + +   +FF A  + + +D  G       I  R L  + PIP+   ++F  L
Subjt:  RYYICARKGVGGIVKGSTSIKGWVRKWFF-ASGEWLAKDESGRPFFDVPIRFRNLLSIRPIPELTQASFDTL

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related2.5e-0831.41Show/hide
Query:  EHYLGPLHRGFKILDD-------ILLRIPEEGE--------RADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAILS
        E  + PL  G  + DD       IL   P E E        R   PPEG++ LY   F   GL  PL  F  E+  R  +A +Q+          LAIL 
Subjt:  EHYLGPLHRGFKILDD-------ILLRIPEEGE--------RADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAILS

Query:  VK-----------------RIAKKPGRYYICARKGVGGIVKGSTS-IKGWVRKWFF
         +                 R+ + PG YY  A K    IV G+ S I GW R++FF
Subjt:  VK-----------------RIAKKPGRYYICARKGVGGIVKGSTS-IKGWVRKWFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCTTCTTCCAATAGTGGTAGCTTAGGTAGCGCAGGTCGGACAATAAGCAGTTCGCCCCCTCAGCCAAGTGATTTTGGGGAGGACTTAGCTCGTAGGTTA
GAGTTCGAACTGGAAGAGGTTGAGAATTTTAGGTTTTTCGATGATGGGGAAAACAGTGGGGCTTCCACCTCGGGCCAGGGTTTGGAATACCCTTCAAAAATGCCC
GAGCACTATCTCGGACCCCTTCATAGGGGGTTTAAAATTCTGGATGACATCCTCCTTAGAATTCCGGAGGAAGGGGAAAGAGCTGACAACCCTCCAGAGGGGTGG
GTCACTCTTTACTTGAAAATGTTCGAGTACGGCCTCAGACTTCCTCTTCACCCTTTTGCCCAGGAGTTCCTAAACCGAACTGGACTAGCTCCTACTCAAGTGGCC
CCCAATGGATGGGGTGTCATTTTTGCCTTGGCCATCCTCTCTGTCAAGAGAATAGCTAAGAAGCCAGGTCGGTACTATATATGCGCAAGGAAAGGCGTAGGTGGA
ATAGTTAAGGGGTCAACCTCTATCAAAGGATGGGTGAGGAAGTGGTTCTTTGCCTCTGGAGAATGGCTGGCAAAGGACGAGTCTGGTCGTCCATTCTTTGACGTT
CCCATTAGGTTTAGGAATTTATTGTCAATCAGACCAATTCCCGAGCTTACTCAAGCTTCCTTTGATACTCTTAAGTATTACAATGATCGTTTTCCAAAGGGCAGG
AAGATCGGAACCCTGGTGACCGACAAGCTGCTTCTTGACTCCGGGTTGTTAGATTACAACCCTTTGGTGCGTCCGATCGAAGCTTCAAGGCCAAGATTCTCCAGC
AGCTTGAAGCGTAAGTCTAAGGGTCGTGCTCACGCCCTCAAGACTGTCCTAAGCACGGAGCCCACAACTTCTACTGCCGCTCAACCTTCAGCTCAGGACAATGCT
GGGCAATCCGCTGAGCTTCCCATTCTAGTGACCGAGCTGGACTCTGTCGGGGAGCACTCCAAAGAGAAGCGCCCAAGGAATGAGTCTGAGGCACTGGACATATCT
CCCCTGAACAAGGTGAGAGGAGAGTCTCCTTTGAAGAGGAGAAGGAAGAAGAAGAAGACCACCTCCTCCTTCGAGGTCGGATCTCGTGGGTCCCTGCCCACGAGC
CATACTGATTTGGTGGAAGACCCCGAAGCTAGGATGAGGGGGACGTCCGATGTGCCAATGCGGTTCTGGGTTGAACCGTCGAGCTCCGGGGTGAAGGACCAGGTG
TCCCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCTTCTTCCAATAGTGGTAGCTTAGGTAGCGCAGGTCGGACAATAAGCAGTTCGCCCCCTCAGCCAAGTGATTTTGGGGAGGACTTAGCTCGTAGGTTA
GAGTTCGAACTGGAAGAGGTTGAGAATTTTAGGTTTTTCGATGATGGGGAAAACAGTGGGGCTTCCACCTCGGGCCAGGGTTTGGAATACCCTTCAAAAATGCCC
GAGCACTATCTCGGACCCCTTCATAGGGGGTTTAAAATTCTGGATGACATCCTCCTTAGAATTCCGGAGGAAGGGGAAAGAGCTGACAACCCTCCAGAGGGGTGG
GTCACTCTTTACTTGAAAATGTTCGAGTACGGCCTCAGACTTCCTCTTCACCCTTTTGCCCAGGAGTTCCTAAACCGAACTGGACTAGCTCCTACTCAAGTGGCC
CCCAATGGATGGGGTGTCATTTTTGCCTTGGCCATCCTCTCTGTCAAGAGAATAGCTAAGAAGCCAGGTCGGTACTATATATGCGCAAGGAAAGGCGTAGGTGGA
ATAGTTAAGGGGTCAACCTCTATCAAAGGATGGGTGAGGAAGTGGTTCTTTGCCTCTGGAGAATGGCTGGCAAAGGACGAGTCTGGTCGTCCATTCTTTGACGTT
CCCATTAGGTTTAGGAATTTATTGTCAATCAGACCAATTCCCGAGCTTACTCAAGCTTCCTTTGATACTCTTAAGTATTACAATGATCGTTTTCCAAAGGGCAGG
AAGATCGGAACCCTGGTGACCGACAAGCTGCTTCTTGACTCCGGGTTGTTAGATTACAACCCTTTGGTGCGTCCGATCGAAGCTTCAAGGCCAAGATTCTCCAGC
AGCTTGAAGCGTAAGTCTAAGGGTCGTGCTCACGCCCTCAAGACTGTCCTAAGCACGGAGCCCACAACTTCTACTGCCGCTCAACCTTCAGCTCAGGACAATGCT
GGGCAATCCGCTGAGCTTCCCATTCTAGTGACCGAGCTGGACTCTGTCGGGGAGCACTCCAAAGAGAAGCGCCCAAGGAATGAGTCTGAGGCACTGGACATATCT
CCCCTGAACAAGGTGAGAGGAGAGTCTCCTTTGAAGAGGAGAAGGAAGAAGAAGAAGACCACCTCCTCCTTCGAGGTCGGATCTCGTGGGTCCCTGCCCACGAGC
CATACTGATTTGGTGGAAGACCCCGAAGCTAGGATGAGGGGGACGTCCGATGTGCCAATGCGGTTCTGGGTTGAACCGTCGAGCTCCGGGGTGAAGGACCAGGTG
TCCCGTTGA
Protein sequenceShow/hide protein sequence
MSSSSNSGSLGSAGRTISSSPPQPSDFGEDLARRLEFELEEVENFRFFDDGENSGASTSGQGLEYPSKMPEHYLGPLHRGFKILDDILLRIPEEGERADNPPEGW
VTLYLKMFEYGLRLPLHPFAQEFLNRTGLAPTQVAPNGWGVIFALAILSVKRIAKKPGRYYICARKGVGGIVKGSTSIKGWVRKWFFASGEWLAKDESGRPFFDV
PIRFRNLLSIRPIPELTQASFDTLKYYNDRFPKGRKIGTLVTDKLLLDSGLLDYNPLVRPIEASRPRFSSSLKRKSKGRAHALKTVLSTEPTTSTAAQPSAQDNA
GQSAELPILVTELDSVGEHSKEKRPRNESEALDISPLNKVRGESPLKRRRKKKKTTSSFEVGSRGSLPTSHTDLVEDPEARMRGTSDVPMRFWVEPSSSGVKDQV
SR