; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003004 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003004
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCoiled-coil domain-containing protein SCD2
Genome locationscaffold595_1:951328..952582
RNA-Seq ExpressionMS003004
SyntenyMS003004
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0000911 - cytokinesis by cell plate formation (biological process)
GO:0097196 - Shu complex (cellular component)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR027417 - P-loop containing nucleoside triphosphate hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025230.1 hypothetical protein SDJN02_11725 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-11883.01Show/hide
Query:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK
        MERFFSVRQI + CDSSHSIML+SGPPSCGKTSLLFQFAFNLGLEGNV FIC RRKLENKPPYLSQASLSLFL  SGVDPASETFQRIQMKYLEDDEGI 
Subjt:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK

Query:  KYFSAFHLHSTLPAAVVIDDFGDFFMD-------------------RRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIF
        KYFSAFHLH TLP AVVIDDFGDFF +                   RRCQEKYANPRGRDLAMVRTLALCHNA+S+AN+ RPC+LVLSDTHHGESPRLIF
Subjt:  KYFSAFHLHSTLPAAVVIDDFGDFFMD-------------------RRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIF

Query:  IYKRWVPTIFTIRGDGSGWF-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGE
        IYKRWVPTIFTIRGDGSGWF +RSINNCGNDCCLRT+SAKYSI+LQFLSLEE+SED  E
Subjt:  IYKRWVPTIFTIRGDGSGWF-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGE

XP_011656789.1 uncharacterized protein LOC101205665 [Cucumis sativus]5.1e-11785.89Show/hide
Query:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK
        ME+FFSV+QIH+HC+SSHSI L+SGPPSCGKTSLLFQFAFNLGLEGNVTFIC RRKLENKPPYLSQ          GVDP SETFQRIQMKYLEDD+GIK
Subjt:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK

Query:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW
        KYFSAFHLHSTLP AVVIDDFGDFF +RRCQEKYANPRGRDLAMVRTLALC NAVS+AN+ RPC+LVLSDTHHGESPRLIFIYKRWVPTIFTIRGDG+GW
Subjt:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW

Query:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ
        F +RSINNCG DCCLRT+ AKYSIALQFLSLEEISEDS EQ
Subjt:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ

XP_022148906.1 uncharacterized protein LOC111017459 [Momordica charantia]1.2e-12995Show/hide
Query:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK
        MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQ          GVDPASETFQRIQMKYLEDDEGIK
Subjt:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK

Query:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW
        KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW
Subjt:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW

Query:  FMRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ
        FMRSINNCGNDCCLRTR AKYSIALQFLSL+EISEDSGEQ
Subjt:  FMRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ

XP_022959968.1 uncharacterized protein LOC111460863 isoform X1 [Cucurbita moschata]2.5e-11686.55Show/hide
Query:  RFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIKKY
        RFFSVRQI + CDSSHSIML+SGPPSCGKTSLLFQFAFNLGLEGNVTFIC RRKLENKPPYLSQ          GVDPASETFQRIQMKYLEDDEGI KY
Subjt:  RFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIKKY

Query:  FSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGWF-
        FSAFHLH TLP AVVIDDFGDFF +RRCQEKYANPRGRDLAMVRTLALCHNA+S+AN+ RPC+LVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGWF 
Subjt:  FSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGWF-

Query:  MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGE
        +RSINNCGN+CCLRT+SAKYSIALQFLSLEE+SED  E
Subjt:  MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGE

XP_038906786.1 uncharacterized protein LOC120092706 isoform X1 [Benincasa hispida]4.6e-11886.72Show/hide
Query:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK
        ME+FFSVRQIH+HCDSSHSIML+SGPPSCGKTSLLFQFAFNLGL+GNVTFIC RRKLENKPPYLSQ          GVDP SETFQRIQMKYLEDDEGIK
Subjt:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK

Query:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW
        KYFSAFHLH TLP AVVIDDFGDFF +RRCQ+KYANPRGRDLAMVRTLALCHNAVS+AN SRPC+LVLSD +HGESPRLIFIYKRWVPTIFTI GDG GW
Subjt:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW

Query:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ
        F +RSINNCGNDCCLR RSAKYSIALQFLSLEEISEDS EQ
Subjt:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ

TrEMBL top hitse value%identityAlignment
A0A0A0K9H1 Uncharacterized protein2.5e-11785.89Show/hide
Query:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK
        ME+FFSV+QIH+HC+SSHSI L+SGPPSCGKTSLLFQFAFNLGLEGNVTFIC RRKLENKPPYLSQ          GVDP SETFQRIQMKYLEDD+GIK
Subjt:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK

Query:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW
        KYFSAFHLHSTLP AVVIDDFGDFF +RRCQEKYANPRGRDLAMVRTLALC NAVS+AN+ RPC+LVLSDTHHGESPRLIFIYKRWVPTIFTIRGDG+GW
Subjt:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW

Query:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ
        F +RSINNCG DCCLRT+ AKYSIALQFLSLEEISEDS EQ
Subjt:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ

A0A1S3CAT0 uncharacterized protein LOC1034988871.6e-11684.65Show/hide
Query:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK
        ME+FFSV+QIH+ C+SSHSI L+SGPPSCGKTSLLFQFAFNLGLEGNVTFIC RRKLENKPPYLSQ          GVDP SETF+RIQMKYLEDD+GIK
Subjt:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK

Query:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW
        KYFSAFHLHSTLP AVVIDDFGDFF +RRCQ KYANPRGRD+AMVRTLALCHNAVS+AN+ RPC+LVLSDTHHGESPRLIFIYKRWVPTIFTIRGDG+GW
Subjt:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW

Query:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ
        F +RSINNCGNDCCLRT+ AKYSIALQFLSLEEISEDS E+
Subjt:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ

A0A6J1D5D7 uncharacterized protein LOC1110174595.6e-13095Show/hide
Query:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK
        MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQ          GVDPASETFQRIQMKYLEDDEGIK
Subjt:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK

Query:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW
        KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW
Subjt:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW

Query:  FMRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ
        FMRSINNCGNDCCLRTR AKYSIALQFLSL+EISEDSGEQ
Subjt:  FMRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ

A0A6J1H6C8 uncharacterized protein LOC111460863 isoform X11.2e-11686.55Show/hide
Query:  RFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIKKY
        RFFSVRQI + CDSSHSIML+SGPPSCGKTSLLFQFAFNLGLEGNVTFIC RRKLENKPPYLSQ          GVDPASETFQRIQMKYLEDDEGI KY
Subjt:  RFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIKKY

Query:  FSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGWF-
        FSAFHLH TLP AVVIDDFGDFF +RRCQEKYANPRGRDLAMVRTLALCHNA+S+AN+ RPC+LVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGWF 
Subjt:  FSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGWF-

Query:  MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGE
        +RSINNCGN+CCLRT+SAKYSIALQFLSLEE+SED  E
Subjt:  MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGE

E5GB61 Uncharacterized protein1.6e-11684.65Show/hide
Query:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK
        ME+FFSV+QIH+ C+SSHSI L+SGPPSCGKTSLLFQFAFNLGLEGNVTFIC RRKLENKPPYLSQ          GVDP SETF+RIQMKYLEDD+GIK
Subjt:  MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIK

Query:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW
        KYFSAFHLHSTLP AVVIDDFGDFF +RRCQ KYANPRGRD+AMVRTLALCHNAVS+AN+ RPC+LVLSDTHHGESPRLIFIYKRWVPTIFTIRGDG+GW
Subjt:  KYFSAFHLHSTLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGW

Query:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ
        F +RSINNCGNDCCLRT+ AKYSIALQFLSLEEISEDS E+
Subjt:  F-MRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDSGEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G23255.1 unknown protein2.0e-6352.65Show/hide
Query:  MERFFSVRQIHSHCDSSH---SIMLLSGPPSCGKTSLLFQFAFNLGLEGNVT---FICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLE
        +ERFF  ++  S  D +H   ++ LLSGP S GKTSLLFQ A N+      T   FIC R+K+E+ PP+LSQ          G+DP+S+ F RIQ+KY++
Subjt:  MERFFSVRQIHSHCDSSH---SIMLLSGPPSCGKTSLLFQFAFNLGLEGNVT---FICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLE

Query:  DDEGIKKYFSAFHLH--STLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFT
        DDEGI+KYF+AFHLH    LP+AV+IDDFGD+F          N R RD+AMVRTLALCHNA+  A ++  C+LVLS+T+HG+SPR +FIYKRW+P IFT
Subjt:  DDEGIKKYFSAFHLH--STLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFT

Query:  IRGDGSGWFMRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDS
        I+G G G F+ + N+         RSAKYSIALQ+L LE+I +DS
Subjt:  IRGDGSGWFMRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDS

AT3G23255.2 unknown protein1.3e-6553.47Show/hide
Query:  MERFFSVRQIHSHCDSSH---SIMLLSGPPSCGKTSLLFQFAFNLGLEGNVT---FICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLE
        +ERFF  ++  S  D +H   ++ LLSGP S GKTSLLFQ A N+      T   FIC R+K+E+ PP+LSQ          G+DP+S+ F RIQ+KY++
Subjt:  MERFFSVRQIHSHCDSSH---SIMLLSGPPSCGKTSLLFQFAFNLGLEGNVT---FICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLE

Query:  DDEGIKKYFSAFHLH--STLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFT
        DDEGI+KYF+AFHLH    LP+AV+IDDFGD+F          N R RD+AMVRTLALCHNA+  ANR   C+LVLS+T+HG+SPR +FIYKRW+P IFT
Subjt:  DDEGIKKYFSAFHLH--STLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFT

Query:  IRGDGSGWFMRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDS
        I+G G G F+ + N+         RSAKYSIALQ+L LE+I +DS
Subjt:  IRGDGSGWFMRSINNCGNDCCLRTRSAKYSIALQFLSLEEISEDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGGTTCTTCTCGGTGCGGCAAATCCACTCTCACTGCGACTCTTCCCACTCGATCATGCTTCTTTCAGGGCCTCCTTCCTGCGGAAAGACCTCCTTGCTTTTTCA
GTTTGCTTTTAATTTGGGTCTGGAGGGAAATGTAACCTTCATCTGCAAGCGCCGCAAATTAGAGAACAAACCTCCATATCTCTCCCAGGCATCTCTCTCTCTCTTTCTCT
CTATTTCTGGAGTTGATCCAGCCTCCGAAACCTTTCAGCGCATACAAATGAAGTATTTGGAAGACGACGAAGGAATTAAGAAGTACTTCTCTGCATTTCACCTGCATAGT
ACACTTCCTGCGGCAGTTGTTATTGATGATTTTGGAGACTTTTTCATGGACCGGCGATGCCAAGAAAAGTATGCCAATCCTCGTGGAAGAGACTTAGCAATGGTTAGAAC
TCTAGCTCTTTGTCACAATGCTGTGAGCATAGCGAATCGAAGTAGGCCTTGTAAGCTTGTGTTGTCTGATACACACCACGGAGAGTCCCCTAGGTTGATCTTCATATATA
AGAGATGGGTTCCAACCATTTTCACCATTAGAGGTGATGGATCTGGATGGTTTATGAGAAGTATTAACAATTGTGGAAATGACTGTTGTTTGAGAACTAGAAGTGCAAAG
TACTCGATTGCCCTTCAGTTCCTGAGTTTGGAGGAAATTTCTGAGGACAGTGGTGAACAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGGTTCTTCTCGGTGCGGCAAATCCACTCTCACTGCGACTCTTCCCACTCGATCATGCTTCTTTCAGGGCCTCCTTCCTGCGGAAAGACCTCCTTGCTTTTTCA
GTTTGCTTTTAATTTGGGTCTGGAGGGAAATGTAACCTTCATCTGCAAGCGCCGCAAATTAGAGAACAAACCTCCATATCTCTCCCAGGCATCTCTCTCTCTCTTTCTCT
CTATTTCTGGAGTTGATCCAGCCTCCGAAACCTTTCAGCGCATACAAATGAAGTATTTGGAAGACGACGAAGGAATTAAGAAGTACTTCTCTGCATTTCACCTGCATAGT
ACACTTCCTGCGGCAGTTGTTATTGATGATTTTGGAGACTTTTTCATGGACCGGCGATGCCAAGAAAAGTATGCCAATCCTCGTGGAAGAGACTTAGCAATGGTTAGAAC
TCTAGCTCTTTGTCACAATGCTGTGAGCATAGCGAATCGAAGTAGGCCTTGTAAGCTTGTGTTGTCTGATACACACCACGGAGAGTCCCCTAGGTTGATCTTCATATATA
AGAGATGGGTTCCAACCATTTTCACCATTAGAGGTGATGGATCTGGATGGTTTATGAGAAGTATTAACAATTGTGGAAATGACTGTTGTTTGAGAACTAGAAGTGCAAAG
TACTCGATTGCCCTTCAGTTCCTGAGTTTGGAGGAAATTTCTGAGGACAGTGGTGAACAA
Protein sequenceShow/hide protein sequence
MERFFSVRQIHSHCDSSHSIMLLSGPPSCGKTSLLFQFAFNLGLEGNVTFICKRRKLENKPPYLSQASLSLFLSISGVDPASETFQRIQMKYLEDDEGIKKYFSAFHLHS
TLPAAVVIDDFGDFFMDRRCQEKYANPRGRDLAMVRTLALCHNAVSIANRSRPCKLVLSDTHHGESPRLIFIYKRWVPTIFTIRGDGSGWFMRSINNCGNDCCLRTRSAK
YSIALQFLSLEEISEDSGEQ