; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g14950 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g14950
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:9337323..9339682
RNA-Seq ExpressionMoc01g14950
SyntenyMoc01g14950
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.9e-5845.17Show/hide
Query:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG
        MFEYG  L +HP VQEFL RTGLAP QVAPNGW                            +ACFE KRI++KPGR+YMCARK AGGI+KGPTSIK W+ 
Subjt:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG

Query:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAMVCGFSQ
        KWF+ SG WLAK+E    F++V  RFGNLV+IRP+P+ ++ +F+ LK++K  F  G+++ TL+TD+LLL SGLLDYNP +   E  RPNS LAMVC F+ 
Subjt:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAMVCGFSQ

Query:  DVTRKN---PRARQTSMAKEASSPAVANPPTK--VEVVEVERGVASPVSPKKSRKKKRKAHHSKDKVREVRAEQRVNPIGDLIESRDSCC
         V RK+     A + + + +  +PAV  P ++    V+E+E       S   SR+K+      +D+   V A+     +  L E  D  C
Subjt:  DVTRKN---PRARQTSMAKEASSPAVANPPTK--VEVVEVERGVASPVSPKKSRKKKRKAHHSKDKVREVRAEQRVNPIGDLIESRDSCC

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.2e-5354.69Show/hide
Query:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG
        MFEYG  L +HP VQEFL RTGLAP QVAPNGW                            +ACFE KRI++KPGR+YMCARK AGGI+KGPTSIK W+ 
Subjt:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG

Query:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSEL
        KWF+ SG WLAK+E    F++V  RFGNLV+IRP+P+ ++ +F+ LK++K  F  G+++ TL+TD+LLL SGLLDYNP +   E+ RPNSEL
Subjt:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]8.8e-5454.64Show/hide
Query:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG
        MFEYG  L +HP VQEFL RTGLAP QVAPNGW                            +ACFE KRI++KPGR+YMCARK A GI+KGPTSIK W+ 
Subjt:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG

Query:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAM
        KWF+ SG WLAK+E    F++V  RFGNLV+IRP+P+ ++ +F+ LK++K  F  G+++ TL+TDKLLL SGLLDYNP +   E+ RPNSEL M
Subjt:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.9e-8850.88Show/hide
Query:  EFANRLDSELDEEIDNFRFSEDDGDDSNTLTSGQGLKYPFQMPENYLGPLRMRYSISDDIILRLPKEGERVDNPPEGCVTSYLKMFEYGFYLSVHPLVQE
        + A RL+S+L EEI+N R S DDG+DS+  TSGQGL+YP ++PE+YLG LR  ++I ++I+LRLP+EGER DNPPEG VT Y KMFEYG  L +HP VQE
Subjt:  EFANRLDSELDEEIDNFRFSEDDGDDSNTLTSGQGLKYPFQMPENYLGPLRMRYSISDDIILRLPKEGERVDNPPEGCVTSYLKMFEYGFYLSVHPLVQE

Query:  FLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIGKWFFTSGGWLAKNEFN
        FL RTGLAP QVAPNGW                            +ACFE KRI++KPGR+YMCARK AGGI+KGPTSIK W+ KWF+ SG WLAK+E  
Subjt:  FLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIGKWFFTSGGWLAKNEFN

Query:  LPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAMVCGFSQDVTRKN---PRARQTS
          F++V  RFGNLV+IRP+P+ ++ +F+ LK++K  F  G+++ TL+TD+LLL SGLLDYNP +   E+ RPNSELAMVCGF+  V RK+     A + +
Subjt:  LPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAMVCGFSQDVTRKN---PRARQTS

Query:  MAKEASSPAVANPPTKVEVVEVERGVASPVSPKKSRKKKRKA
         + + ++PAV  P ++   + +E   +   S +K  + + +A
Subjt:  MAKEASSPAVANPPTKVEVVEVERGVASPVSPKKSRKKKRKA

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.9e-5634.88Show/hide
Query:  MCARKDAGGILKGPTSIKKWIGKWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNP
        MCARK  GGI+KGPTSIK W+GKWFF SG WLAK+E    F++V  RFGNLV+I+ IP+ ++ TF+ LK +K  F   ++I TL+TDKLLL SGLLDYNP
Subjt:  MCARKDAGGILKGPTSIKKWIGKWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNP

Query:  LLSSPEAQRPNSELAMVCGFSQDVTRKN---PRARQTSMAKEASSPAVANP------------PTKVEVVEV-----------ERGVASPVSP-------
        L+   EA RPNSELAMVCGF+  V RK+     A +T +  E  +P V               PT V  +++           E   A  VSP       
Subjt:  LLSSPEAQRPNSELAMVCGFSQDVTRKN---PRARQTSMAKEASSPAVANP------------PTKVEVVEV-----------ERGVASPVSP-------

Query:  ---KKSRKKKRKAHHSKDKVREVRAEQRVNPIGDLIESRDSCCHLSGHSYDEGRTGGTRPS-----------YCEGTGGFLGGTGG--------------
           ++ RKKK+ +  S+   R        + + D         ++      E  + G +             Y      F+   G               
Subjt:  ---KKSRKKKRKAHHSKDKVREVRAEQRVNPIGDLIESRDSCCHLSGHSYDEGRTGGTRPS-----------YCEGTGGFLGGTGG--------------

Query:  ----------CRCLGRELKKARAEVQAWKSISKAN--KAELKSAQAKV------------------AWHIENLRGTHAMAKCLEKEKFVLMKQNDDLECL
                      GRE   A+    ++ ++  A   K EL  AQ +V                    H  +LR  HA+ K LEKEKF L+K+ DDL  +
Subjt:  ----------CRCLGRELKKARAEVQAWKSISKAN--KAELKSAQAKV------------------AWHIENLRGTHAMAKCLEKEKFVLMKQNDDLECL

Query:  QEDLEGKLRAGDFEVAELKVKLE----LEESKLSHGSFWRNPFANTLTLTGDAGFRFLMNGVKEVTPELNLEL--IKLRYAQKWASGPNGTPDPEDFVDQ
         E+ +  +     E+ +LK +L     LEES   H  F  + FA   +   DAGF+FLM G+    P L ++L  +K +Y++KWASGPNGTPDP+  VD+
Subjt:  QEDLEGKLRAGDFEVAELKVKLE----LEESKLSHGSFWRNPFANTLTLTGDAGFRFLMNGVKEVTPELNLEL--IKLRYAQKWASGPNGTPDPEDFVDQ

Query:  RMEELESDVELNEGED
         + EL+SD    E ED
Subjt:  RMEELESDVELNEGED

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138262.9e-5845.17Show/hide
Query:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG
        MFEYG  L +HP VQEFL RTGLAP QVAPNGW                            +ACFE KRI++KPGR+YMCARK AGGI+KGPTSIK W+ 
Subjt:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG

Query:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAMVCGFSQ
        KWF+ SG WLAK+E    F++V  RFGNLV+IRP+P+ ++ +F+ LK++K  F  G+++ TL+TD+LLL SGLLDYNP +   E  RPNS LAMVC F+ 
Subjt:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAMVCGFSQ

Query:  DVTRKN---PRARQTSMAKEASSPAVANPPTK--VEVVEVERGVASPVSPKKSRKKKRKAHHSKDKVREVRAEQRVNPIGDLIESRDSCC
         V RK+     A + + + +  +PAV  P ++    V+E+E       S   SR+K+      +D+   V A+     +  L E  D  C
Subjt:  DVTRKN---PRARQTSMAKEASSPAVANPPTK--VEVVEVERGVASPVSPKKSRKKKRKAHHSKDKVREVRAEQRVNPIGDLIESRDSCC

A0A6J1DWD2 uncharacterized protein LOC1110246805.6e-5454.69Show/hide
Query:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG
        MFEYG  L +HP VQEFL RTGLAP QVAPNGW                            +ACFE KRI++KPGR+YMCARK AGGI+KGPTSIK W+ 
Subjt:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG

Query:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSEL
        KWF+ SG WLAK+E    F++V  RFGNLV+IRP+P+ ++ +F+ LK++K  F  G+++ TL+TD+LLL SGLLDYNP +   E+ RPNSEL
Subjt:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251084.3e-5454.64Show/hide
Query:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG
        MFEYG  L +HP VQEFL RTGLAP QVAPNGW                            +ACFE KRI++KPGR+YMCARK A GI+KGPTSIK W+ 
Subjt:  MFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIG

Query:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAM
        KWF+ SG WLAK+E    F++V  RFGNLV+IRP+P+ ++ +F+ LK++K  F  G+++ TL+TDKLLL SGLLDYNP +   E+ RPNSEL M
Subjt:  KWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255029.1e-8950.88Show/hide
Query:  EFANRLDSELDEEIDNFRFSEDDGDDSNTLTSGQGLKYPFQMPENYLGPLRMRYSISDDIILRLPKEGERVDNPPEGCVTSYLKMFEYGFYLSVHPLVQE
        + A RL+S+L EEI+N R S DDG+DS+  TSGQGL+YP ++PE+YLG LR  ++I ++I+LRLP+EGER DNPPEG VT Y KMFEYG  L +HP VQE
Subjt:  EFANRLDSELDEEIDNFRFSEDDGDDSNTLTSGQGLKYPFQMPENYLGPLRMRYSISDDIILRLPKEGERVDNPPEGCVTSYLKMFEYGFYLSVHPLVQE

Query:  FLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIGKWFFTSGGWLAKNEFN
        FL RTGLAP QVAPNGW                            +ACFE KRI++KPGR+YMCARK AGGI+KGPTSIK W+ KWF+ SG WLAK+E  
Subjt:  FLVRTGLAPVQVAPNGW----------------------------VACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIGKWFFTSGGWLAKNEFN

Query:  LPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAMVCGFSQDVTRKN---PRARQTS
          F++V  RFGNLV+IRP+P+ ++ +F+ LK++K  F  G+++ TL+TD+LLL SGLLDYNP +   E+ RPNSELAMVCGF+  V RK+     A + +
Subjt:  LPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAMVCGFSQDVTRKN---PRARQTS

Query:  MAKEASSPAVANPPTKVEVVEVERGVASPVSPKKSRKKKRKA
         + + ++PAV  P ++   + +E   +   S +K  + + +A
Subjt:  MAKEASSPAVANPPTKVEVVEVERGVASPVSPKKSRKKKRKA

A0A6J1DZB3 uncharacterized protein LOC1110256659.2e-5734.88Show/hide
Query:  MCARKDAGGILKGPTSIKKWIGKWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNP
        MCARK  GGI+KGPTSIK W+GKWFF SG WLAK+E    F++V  RFGNLV+I+ IP+ ++ TF+ LK +K  F   ++I TL+TDKLLL SGLLDYNP
Subjt:  MCARKDAGGILKGPTSIKKWIGKWFFTSGGWLAKNEFNLPFYNVSCRFGNLVAIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNP

Query:  LLSSPEAQRPNSELAMVCGFSQDVTRKN---PRARQTSMAKEASSPAVANP------------PTKVEVVEV-----------ERGVASPVSP-------
        L+   EA RPNSELAMVCGF+  V RK+     A +T +  E  +P V               PT V  +++           E   A  VSP       
Subjt:  LLSSPEAQRPNSELAMVCGFSQDVTRKN---PRARQTSMAKEASSPAVANP------------PTKVEVVEV-----------ERGVASPVSP-------

Query:  ---KKSRKKKRKAHHSKDKVREVRAEQRVNPIGDLIESRDSCCHLSGHSYDEGRTGGTRPS-----------YCEGTGGFLGGTGG--------------
           ++ RKKK+ +  S+   R        + + D         ++      E  + G +             Y      F+   G               
Subjt:  ---KKSRKKKRKAHHSKDKVREVRAEQRVNPIGDLIESRDSCCHLSGHSYDEGRTGGTRPS-----------YCEGTGGFLGGTGG--------------

Query:  ----------CRCLGRELKKARAEVQAWKSISKAN--KAELKSAQAKV------------------AWHIENLRGTHAMAKCLEKEKFVLMKQNDDLECL
                      GRE   A+    ++ ++  A   K EL  AQ +V                    H  +LR  HA+ K LEKEKF L+K+ DDL  +
Subjt:  ----------CRCLGRELKKARAEVQAWKSISKAN--KAELKSAQAKV------------------AWHIENLRGTHAMAKCLEKEKFVLMKQNDDLECL

Query:  QEDLEGKLRAGDFEVAELKVKLE----LEESKLSHGSFWRNPFANTLTLTGDAGFRFLMNGVKEVTPELNLEL--IKLRYAQKWASGPNGTPDPEDFVDQ
         E+ +  +     E+ +LK +L     LEES   H  F  + FA   +   DAGF+FLM G+    P L ++L  +K +Y++KWASGPNGTPDP+  VD+
Subjt:  QEDLEGKLRAGDFEVAELKVKLE----LEESKLSHGSFWRNPFANTLTLTGDAGFRFLMNGVKEVTPELNLEL--IKLRYAQKWASGPNGTPDPEDFVDQ

Query:  RMEELESDVELNEGED
         + EL+SD    E ED
Subjt:  RMEELESDVELNEGED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACATCTTCGAGCTCCTCAAGTTCAAATAGTAATAGCTCGTCTAGTAGCGCAGCTCGGAGTCGGGATCCTTCACCCAAGAAGGTCGATTCCGTAGAGGAGTTTGC
GAATAGGTTAGACTCTGAACTAGATGAGGAGATAGATAACTTTAGGTTCTCCGAAGATGACGGAGATGATAGCAATACTTTGACTTCGGGCCAAGGTTTAAAATACCCTT
TCCAGATGCCTGAGAACTACCTCGGCCCCCTTCGTATGAGATATAGTATATCGGATGACATCATACTTAGACTTCCTAAGGAAGGGGAGCGAGTTGACAATCCTCCGGAG
GGGTGTGTAACCTCATACTTAAAGATGTTTGAGTACGGCTTCTACCTGTCCGTTCATCCTCTGGTGCAGGAGTTCCTAGTCCGAACTGGGCTAGCTCCTGTTCAAGTGGC
CCCTAATGGGTGGGTTGCCTGCTTTGAGGTTAAGCGAATCTCTAGGAAGCCGGGGAGGTATTACATGTGTGCTAGGAAGGACGCGGGAGGTATTTTGAAGGGTCCAACCT
CCATAAAGAAGTGGATTGGAAAGTGGTTCTTCACCTCCGGTGGGTGGCTGGCCAAAAACGAGTTCAACCTGCCTTTCTACAATGTCTCTTGTAGGTTTGGGAACTTAGTT
GCCATAAGGCCGATCCCTCAACCTTCCGAGCCGACCTTCAACGTTCTGAAGTTTTTCAAGAGCACGTTTAAGAGTGGAAAGCAGATCAGCACGCTCATAACTGATAAACT
TCTCCTCGCCTCGGGATTGCTTGACTACAATCCCCTTCTCTCCTCACCTGAAGCTCAGAGACCGAACTCAGAATTAGCAATGGTGTGTGGCTTTTCCCAGGATGTTACGC
GCAAGAATCCTCGTGCAAGACAGACCTCCATGGCTAAGGAGGCGTCGAGTCCTGCTGTTGCCAACCCTCCCACCAAGGTTGAGGTGGTTGAGGTCGAGCGCGGAGTAGCT
TCTCCGGTTTCTCCCAAGAAATCTAGGAAGAAGAAGCGTAAGGCCCATCACTCCAAGGACAAGGTGAGGGAGGTACGTGCAGAACAACGAGTCAATCCCATTGGGGACTT
AATCGAGTCCCGAGACTCATGCTGCCATCTATCAGGTCACTCTTATGATGAAGGCCGAACTGGAGGGACGCGACCTTCTTACTGTGAAGGAACGGGAGGCTTCCTTGGCG
GAACTGGAGGCTGTCGCTGCCTTGGAAGGGAGCTCAAAAAGGCTCGGGCTGAGGTCCAGGCATGGAAGTCTATTTCCAAGGCCAATAAGGCTGAGCTCAAGAGTGCCCAG
GCGAAGGTTGCTTGGCACATAGAGAATCTGAGGGGCACGCATGCTATGGCGAAATGCCTCGAAAAGGAGAAGTTTGTGCTGATGAAGCAGAACGACGACCTCGAGTGCCT
CCAAGAAGACCTGGAGGGCAAGCTAAGGGCCGGTGACTTCGAGGTAGCAGAATTGAAGGTTAAGCTAGAGCTCGAAGAGTCCAAGCTTAGCCACGGGTCCTTCTGGAGGA
ATCCTTTCGCAAACACCCTGACTTTGACGGGTGACGCTGGCTTCAGGTTCCTGATGAATGGGGTCAAGGAAGTGACTCCCGAGCTTAACCTTGAGCTCATCAAGCTCAGA
TATGCCCAGAAGTGGGCTTCAGGTCCCAACGGGACTCCTGACCCGGAAGACTTTGTGGATCAACGCATGGAGGAGCTCGAATCTGACGTCGAGCTCAACGAGGGGGAGGA
CACTGGCCCTTCTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGACATCTTCGAGCTCCTCAAGTTCAAATAGTAATAGCTCGTCTAGTAGCGCAGCTCGGAGTCGGGATCCTTCACCCAAGAAGGTCGATTCCGTAGAGGAGTTTGC
GAATAGGTTAGACTCTGAACTAGATGAGGAGATAGATAACTTTAGGTTCTCCGAAGATGACGGAGATGATAGCAATACTTTGACTTCGGGCCAAGGTTTAAAATACCCTT
TCCAGATGCCTGAGAACTACCTCGGCCCCCTTCGTATGAGATATAGTATATCGGATGACATCATACTTAGACTTCCTAAGGAAGGGGAGCGAGTTGACAATCCTCCGGAG
GGGTGTGTAACCTCATACTTAAAGATGTTTGAGTACGGCTTCTACCTGTCCGTTCATCCTCTGGTGCAGGAGTTCCTAGTCCGAACTGGGCTAGCTCCTGTTCAAGTGGC
CCCTAATGGGTGGGTTGCCTGCTTTGAGGTTAAGCGAATCTCTAGGAAGCCGGGGAGGTATTACATGTGTGCTAGGAAGGACGCGGGAGGTATTTTGAAGGGTCCAACCT
CCATAAAGAAGTGGATTGGAAAGTGGTTCTTCACCTCCGGTGGGTGGCTGGCCAAAAACGAGTTCAACCTGCCTTTCTACAATGTCTCTTGTAGGTTTGGGAACTTAGTT
GCCATAAGGCCGATCCCTCAACCTTCCGAGCCGACCTTCAACGTTCTGAAGTTTTTCAAGAGCACGTTTAAGAGTGGAAAGCAGATCAGCACGCTCATAACTGATAAACT
TCTCCTCGCCTCGGGATTGCTTGACTACAATCCCCTTCTCTCCTCACCTGAAGCTCAGAGACCGAACTCAGAATTAGCAATGGTGTGTGGCTTTTCCCAGGATGTTACGC
GCAAGAATCCTCGTGCAAGACAGACCTCCATGGCTAAGGAGGCGTCGAGTCCTGCTGTTGCCAACCCTCCCACCAAGGTTGAGGTGGTTGAGGTCGAGCGCGGAGTAGCT
TCTCCGGTTTCTCCCAAGAAATCTAGGAAGAAGAAGCGTAAGGCCCATCACTCCAAGGACAAGGTGAGGGAGGTACGTGCAGAACAACGAGTCAATCCCATTGGGGACTT
AATCGAGTCCCGAGACTCATGCTGCCATCTATCAGGTCACTCTTATGATGAAGGCCGAACTGGAGGGACGCGACCTTCTTACTGTGAAGGAACGGGAGGCTTCCTTGGCG
GAACTGGAGGCTGTCGCTGCCTTGGAAGGGAGCTCAAAAAGGCTCGGGCTGAGGTCCAGGCATGGAAGTCTATTTCCAAGGCCAATAAGGCTGAGCTCAAGAGTGCCCAG
GCGAAGGTTGCTTGGCACATAGAGAATCTGAGGGGCACGCATGCTATGGCGAAATGCCTCGAAAAGGAGAAGTTTGTGCTGATGAAGCAGAACGACGACCTCGAGTGCCT
CCAAGAAGACCTGGAGGGCAAGCTAAGGGCCGGTGACTTCGAGGTAGCAGAATTGAAGGTTAAGCTAGAGCTCGAAGAGTCCAAGCTTAGCCACGGGTCCTTCTGGAGGA
ATCCTTTCGCAAACACCCTGACTTTGACGGGTGACGCTGGCTTCAGGTTCCTGATGAATGGGGTCAAGGAAGTGACTCCCGAGCTTAACCTTGAGCTCATCAAGCTCAGA
TATGCCCAGAAGTGGGCTTCAGGTCCCAACGGGACTCCTGACCCGGAAGACTTTGTGGATCAACGCATGGAGGAGCTCGAATCTGACGTCGAGCTCAACGAGGGGGAGGA
CACTGGCCCTTCTTCCTAA
Protein sequenceShow/hide protein sequence
MSTSSSSSSSNSNSSSSSAARSRDPSPKKVDSVEEFANRLDSELDEEIDNFRFSEDDGDDSNTLTSGQGLKYPFQMPENYLGPLRMRYSISDDIILRLPKEGERVDNPPE
GCVTSYLKMFEYGFYLSVHPLVQEFLVRTGLAPVQVAPNGWVACFEVKRISRKPGRYYMCARKDAGGILKGPTSIKKWIGKWFFTSGGWLAKNEFNLPFYNVSCRFGNLV
AIRPIPQPSEPTFNVLKFFKSTFKSGKQISTLITDKLLLASGLLDYNPLLSSPEAQRPNSELAMVCGFSQDVTRKNPRARQTSMAKEASSPAVANPPTKVEVVEVERGVA
SPVSPKKSRKKKRKAHHSKDKVREVRAEQRVNPIGDLIESRDSCCHLSGHSYDEGRTGGTRPSYCEGTGGFLGGTGGCRCLGRELKKARAEVQAWKSISKANKAELKSAQ
AKVAWHIENLRGTHAMAKCLEKEKFVLMKQNDDLECLQEDLEGKLRAGDFEVAELKVKLELEESKLSHGSFWRNPFANTLTLTGDAGFRFLMNGVKEVTPELNLELIKLR
YAQKWASGPNGTPDPEDFVDQRMEELESDVELNEGEDTGPSS