; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g15560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g15560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr5:11761236..11763081
RNA-Seq ExpressionMoc05g15560
SyntenyMoc05g15560
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.5e-6336.02Show/hide
Query:  ILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDTYIFSIR
        +L API  K   PT   Y+G+KDP D+V+ FES+MDF A S A+K R F I L G  R W+R L   SI+++ QLR+ F+A F+     K + T++ +IR
Subjt:  ILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDTYIFSIR

Query:  QRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFC-------------------------------------------------------------NSDAK
        Q+  ET+R+ +  F  EQ+K+ +  D    SA C                                                             N+D K
Subjt:  QRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFC-------------------------------------------------------------NSDAK

Query:  L-------------------------------------------------NLLTDPRPLQKDPEQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGY
                                                           LL  P  L+  PE+R K K+C+FH++HGH TSD + +K+QIE LI+ GY
Subjt:  L-------------------------------------------------NLLTDPRPLQKDPEQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGY

Query:  SKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARH---------------------EGVHLLHNDALVI
         KK+VGK         SS+ K+++++ S+TP +R DRP VINT+ GGPS GQ G K KEL   AR                      E VHL HNDALVI
Subjt:  SKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARH---------------------EGVHLLHNDALVI

Query:  APHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGEGKPKL
        AP IDH+ V R+L++GG S NIL+L T+ ALGW  +QLKKSPT LVGFS ESV PEG I+L ++ G+ + ++
Subjt:  APHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGEGKPKL

XP_022148920.1 uncharacterized protein LOC111017470 [Momordica charantia]7.7e-5558.46Show/hide
Query:  LLTDPRPLQKDPEQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEG
        LL  P  LQ DPE+R+K K+C+FH+DH H T+ C+ +K+QIEGLI+ GY KK+VGK         +S  K++K++ S+TP  R+DRP VINT+ GGPS G
Subjt:  LLTDPRPLQKDPEQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEG

Query:  QPGHKSKELVCKARHEGVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGE
        Q G+K KEL  +AR EG+HL HNDALVIAP IDH+ V+R+L++GGAS NIL+L T+ ALGW  +QLKKSPT L GFSRESVS EGCI+L +  G+
Subjt:  QPGHKSKELVCKARHEGVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGE

XP_022149836.1 uncharacterized protein LOC111018172 [Momordica charantia]1.7e-8352.34Show/hide
Query:  PKHVDAILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDT
        P  V  IL+API+QK   PTF+KY+GTKDPVDHV+T+E IMDFHAYS AMK R  SITLQG  RKWFRLLA  SI+S KQLRKAF+AQFA HKDAKHSDT
Subjt:  PKHVDAILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDT

Query:  YIFSIRQRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFCN--SDAKLNLLTDPRP---------------------LQKDPE----------QRDKSKF
                      D IK FLSEQ+K+E   DLL RSAFCN  +  KL+     +P                     + K+ +          +++K K 
Subjt:  YIFSIRQRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFCN--SDAKLNLLTDPRP---------------------LQKDPE----------QRDKSKF

Query:  CQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARHE----
           +  +G   S C         LIRQGY KKYVGKR+R++ SN SSSRKEQK+E SKT  KRED+P +I+T+ GG S+GQ GHK KEL  +A HE    
Subjt:  CQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARHE----

Query:  -----------------GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSP
                         GVH  HNDALVIA  IDHI +RR+LI+G AS NIL+LST+KAL WG AQLKKSPT LVGFS ESV+P
Subjt:  -----------------GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSP

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]6.3e-5740.63Show/hide
Query:  ILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDTYIFSIR
        I+ API  K   PT   Y+G+KDP D+V+ FE +MDF A + A+K   F I L G  R W R L   SI+++ QLRK F+ QF+     + + T++ +IR
Subjt:  ILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDTYIFSIR

Query:  QRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFCNSDAKLNLLTDPRPLQKDPEQ--RDKSKFCQFHQDHGHYTSD--------------------CYNM
        Q+  ET+   +K  L E+        L       +    L   TD    Q D ++  + K K     +D G  +S                     C+ +
Subjt:  QRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFCNSDAKLNLLTDPRPLQKDPEQ--RDKSKFCQFHQDHGHYTSD--------------------CYNM

Query:  KQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARH---------------------E
        K+QIE LI+  Y KK+VGK         +S  K+++++ S+TP +REDRP VINT+ GGPS GQ  +K KEL C+AR                      E
Subjt:  KQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARH---------------------E

Query:  GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGE
        GVHL HNDALVIAP IDH+ VRR+L++GGAS NIL+L T+ AL    +QLKKSPT LVGFS ESVSPEGCI+L ++ G+
Subjt:  GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGE

XP_022158091.1 uncharacterized protein LOC111024660 [Momordica charantia]7.4e-5860.29Show/hide
Query:  EQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCK
        + RD+S FC FH+ HGHYTS+CY++KQQIEGLIRQGY KKYVGKRN +D+S  SSS+ E+K+E S+ P K ED+P +IN +HG PS+GQ G K KEL  K
Subjt:  EQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCK

Query:  ARHE---------------------GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHI
        ARHE                     GVHL HNDALVIA  I+HI V R+LI+GGAS NILTLST+KAL WG AQLKKSPT LVGFS E V+ + CIEL I
Subjt:  ARHE---------------------GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHI

Query:  SFGEGKPKL
        S GEG+ ++
Subjt:  SFGEGKPKL

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088137.4e-6436.02Show/hide
Query:  ILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDTYIFSIR
        +L API  K   PT   Y+G+KDP D+V+ FES+MDF A S A+K R F I L G  R W+R L   SI+++ QLR+ F+A F+     K + T++ +IR
Subjt:  ILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDTYIFSIR

Query:  QRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFC-------------------------------------------------------------NSDAK
        Q+  ET+R+ +  F  EQ+K+ +  D    SA C                                                             N+D K
Subjt:  QRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFC-------------------------------------------------------------NSDAK

Query:  L-------------------------------------------------NLLTDPRPLQKDPEQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGY
                                                           LL  P  L+  PE+R K K+C+FH++HGH TSD + +K+QIE LI+ GY
Subjt:  L-------------------------------------------------NLLTDPRPLQKDPEQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGY

Query:  SKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARH---------------------EGVHLLHNDALVI
         KK+VGK         SS+ K+++++ S+TP +R DRP VINT+ GGPS GQ G K KEL   AR                      E VHL HNDALVI
Subjt:  SKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARH---------------------EGVHLLHNDALVI

Query:  APHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGEGKPKL
        AP IDH+ V R+L++GG S NIL+L T+ ALGW  +QLKKSPT LVGFS ESV PEG I+L ++ G+ + ++
Subjt:  APHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGEGKPKL

A0A6J1D4A4 uncharacterized protein LOC1110174703.7e-5558.46Show/hide
Query:  LLTDPRPLQKDPEQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEG
        LL  P  LQ DPE+R+K K+C+FH+DH H T+ C+ +K+QIEGLI+ GY KK+VGK         +S  K++K++ S+TP  R+DRP VINT+ GGPS G
Subjt:  LLTDPRPLQKDPEQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEG

Query:  QPGHKSKELVCKARHEGVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGE
        Q G+K KEL  +AR EG+HL HNDALVIAP IDH+ V+R+L++GGAS NIL+L T+ ALGW  +QLKKSPT L GFSRESVS EGCI+L +  G+
Subjt:  QPGHKSKELVCKARHEGVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGE

A0A6J1D9M1 uncharacterized protein LOC1110181728.5e-8452.34Show/hide
Query:  PKHVDAILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDT
        P  V  IL+API+QK   PTF+KY+GTKDPVDHV+T+E IMDFHAYS AMK R  SITLQG  RKWFRLLA  SI+S KQLRKAF+AQFA HKDAKHSDT
Subjt:  PKHVDAILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDT

Query:  YIFSIRQRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFCN--SDAKLNLLTDPRP---------------------LQKDPE----------QRDKSKF
                      D IK FLSEQ+K+E   DLL RSAFCN  +  KL+     +P                     + K+ +          +++K K 
Subjt:  YIFSIRQRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFCN--SDAKLNLLTDPRP---------------------LQKDPE----------QRDKSKF

Query:  CQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARHE----
           +  +G   S C         LIRQGY KKYVGKR+R++ SN SSSRKEQK+E SKT  KRED+P +I+T+ GG S+GQ GHK KEL  +A HE    
Subjt:  CQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARHE----

Query:  -----------------GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSP
                         GVH  HNDALVIA  IDHI +RR+LI+G AS NIL+LST+KAL WG AQLKKSPT LVGFS ESV+P
Subjt:  -----------------GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSP

A0A6J1DPC9 uncharacterized protein LOC1110222803.0e-5740.63Show/hide
Query:  ILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDTYIFSIR
        I+ API  K   PT   Y+G+KDP D+V+ FE +MDF A + A+K   F I L G  R W R L   SI+++ QLRK F+ QF+     + + T++ +IR
Subjt:  ILNAPITQKICLPTFDKYNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDTYIFSIR

Query:  QRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFCNSDAKLNLLTDPRPLQKDPEQ--RDKSKFCQFHQDHGHYTSD--------------------CYNM
        Q+  ET+   +K  L E+        L       +    L   TD    Q D ++  + K K     +D G  +S                     C+ +
Subjt:  QRPKETIRDNIKCFLSEQVKLENYIDLLTRSAFCNSDAKLNLLTDPRPLQKDPEQ--RDKSKFCQFHQDHGHYTSD--------------------CYNM

Query:  KQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARH---------------------E
        K+QIE LI+  Y KK+VGK         +S  K+++++ S+TP +REDRP VINT+ GGPS GQ  +K KEL C+AR                      E
Subjt:  KQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCKARH---------------------E

Query:  GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGE
        GVHL HNDALVIAP IDH+ VRR+L++GGAS NIL+L T+ AL    +QLKKSPT LVGFS ESVSPEGCI+L ++ G+
Subjt:  GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGE

A0A6J1E005 uncharacterized protein LOC1110246603.6e-5860.29Show/hide
Query:  EQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCK
        + RD+S FC FH+ HGHYTS+CY++KQQIEGLIRQGY KKYVGKRN +D+S  SSS+ E+K+E S+ P K ED+P +IN +HG PS+GQ G K KEL  K
Subjt:  EQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGGPSEGQPGHKSKELVCK

Query:  ARHE---------------------GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHI
        ARHE                     GVHL HNDALVIA  I+HI V R+LI+GGAS NILTLST+KAL WG AQLKKSPT LVGFS E V+ + CIEL I
Subjt:  ARHE---------------------GVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHI

Query:  SFGEGKPKL
        S GEG+ ++
Subjt:  SFGEGKPKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGCGCGTATGGACACCATGGCTCAGCCGCTGTTCAATGCAGAAGTTCCGCTACATGACTTCCTTGAGCCCGAGCTCCAGCATGATCCAAGGAAAGTGAAAGAAAA
AGAGGGCGAGAACCCTCAAAAGCGACAAGATCAGAATCCAGTCCCCAAATTAGTCCTTGGAGGTAACAACCGACCTAAGCCTTATGACAATCTCCATCCTAAACTTCTGC
CCCAACTCTCCAAGATTCGAGGTCAAAGTAGAACCCGAGGTCCGAAACACGTCGACGCTATCCTCAATGCGCCTATTACCCAGAAGATTTGCCTGCCGACCTTCGACAAA
TACAATGGGACGAAGGATCCTGTTGACCATGTGAAAACCTTCGAGTCCATCATGGACTTTCACGCGTACTCTTATGCGATGAAGCATAGGGAATTTTCCATTACACTACA
GGGACTAGATCGAAAGTGGTTCAGGTTGCTGGCCCCCTGCTCAATTACGAGCTGGAAGCAATTGAGGAAGGCATTTGTAGCCCAGTTTGCACCCCATAAGGATGCAAAAC
ACTCAGATACCTACATCTTCTCAATTCGGCAAAGGCCAAAGGAGACCATCAGAGACAACATCAAGTGTTTCCTCTCGGAACAGGTCAAGTTGGAGAACTACATTGATCTG
CTAACCCGATCTGCTTTCTGCAATAGCGACGCCAAGCTCAATTTGCTAACTGACCCTAGGCCATTGCAGAAGGATCCCGAGCAGCGGGACAAATCAAAGTTCTGTCAGTT
TCACCAGGATCACGGCCACTATACTTCAGATTGCTACAACATGAAGCAACAAATTGAAGGGTTAATTAGACAAGGCTACTCCAAGAAATATGTTGGTAAGAGAAATCGGG
ATGACAACTCAAATCTGTCAAGCAGCAGAAAGGAGCAAAAGAAGGAGAATTCAAAGACTCCTACCAAACGAGAAGACCGACCTCATGTAATCAACACGGTCCATGGGGGA
CCTAGCGAAGGCCAACCTGGCCATAAGAGTAAGGAGTTAGTCTGCAAGGCTAGACACGAAGGTGTCCACCTGCTGCACAATGATGCGCTGGTTATTGCCCCCCATATCGA
CCACATCAAGGTACGCAGATTACTTATTAATGGAGGGGCATCCGTCAACATCCTGACCTTGTCGACCTTCAAGGCATTAGGATGGGGATTAGCTCAACTGAAGAAGAGTC
CAACACTATTGGTTGGATTCTCTAGAGAAAGTGTGAGCCCAGAAGGTTGCATTGAGCTCCACATCTCGTTTGGTGAAGGAAAACCCAAGCTTCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGCGCGTATGGACACCATGGCTCAGCCGCTGTTCAATGCAGAAGTTCCGCTACATGACTTCCTTGAGCCCGAGCTCCAGCATGATCCAAGGAAAGTGAAAGAAAA
AGAGGGCGAGAACCCTCAAAAGCGACAAGATCAGAATCCAGTCCCCAAATTAGTCCTTGGAGGTAACAACCGACCTAAGCCTTATGACAATCTCCATCCTAAACTTCTGC
CCCAACTCTCCAAGATTCGAGGTCAAAGTAGAACCCGAGGTCCGAAACACGTCGACGCTATCCTCAATGCGCCTATTACCCAGAAGATTTGCCTGCCGACCTTCGACAAA
TACAATGGGACGAAGGATCCTGTTGACCATGTGAAAACCTTCGAGTCCATCATGGACTTTCACGCGTACTCTTATGCGATGAAGCATAGGGAATTTTCCATTACACTACA
GGGACTAGATCGAAAGTGGTTCAGGTTGCTGGCCCCCTGCTCAATTACGAGCTGGAAGCAATTGAGGAAGGCATTTGTAGCCCAGTTTGCACCCCATAAGGATGCAAAAC
ACTCAGATACCTACATCTTCTCAATTCGGCAAAGGCCAAAGGAGACCATCAGAGACAACATCAAGTGTTTCCTCTCGGAACAGGTCAAGTTGGAGAACTACATTGATCTG
CTAACCCGATCTGCTTTCTGCAATAGCGACGCCAAGCTCAATTTGCTAACTGACCCTAGGCCATTGCAGAAGGATCCCGAGCAGCGGGACAAATCAAAGTTCTGTCAGTT
TCACCAGGATCACGGCCACTATACTTCAGATTGCTACAACATGAAGCAACAAATTGAAGGGTTAATTAGACAAGGCTACTCCAAGAAATATGTTGGTAAGAGAAATCGGG
ATGACAACTCAAATCTGTCAAGCAGCAGAAAGGAGCAAAAGAAGGAGAATTCAAAGACTCCTACCAAACGAGAAGACCGACCTCATGTAATCAACACGGTCCATGGGGGA
CCTAGCGAAGGCCAACCTGGCCATAAGAGTAAGGAGTTAGTCTGCAAGGCTAGACACGAAGGTGTCCACCTGCTGCACAATGATGCGCTGGTTATTGCCCCCCATATCGA
CCACATCAAGGTACGCAGATTACTTATTAATGGAGGGGCATCCGTCAACATCCTGACCTTGTCGACCTTCAAGGCATTAGGATGGGGATTAGCTCAACTGAAGAAGAGTC
CAACACTATTGGTTGGATTCTCTAGAGAAAGTGTGAGCCCAGAAGGTTGCATTGAGCTCCACATCTCGTTTGGTGAAGGAAAACCCAAGCTTCCCTAG
Protein sequenceShow/hide protein sequence
MQARMDTMAQPLFNAEVPLHDFLEPELQHDPRKVKEKEGENPQKRQDQNPVPKLVLGGNNRPKPYDNLHPKLLPQLSKIRGQSRTRGPKHVDAILNAPITQKICLPTFDK
YNGTKDPVDHVKTFESIMDFHAYSYAMKHREFSITLQGLDRKWFRLLAPCSITSWKQLRKAFVAQFAPHKDAKHSDTYIFSIRQRPKETIRDNIKCFLSEQVKLENYIDL
LTRSAFCNSDAKLNLLTDPRPLQKDPEQRDKSKFCQFHQDHGHYTSDCYNMKQQIEGLIRQGYSKKYVGKRNRDDNSNLSSSRKEQKKENSKTPTKREDRPHVINTVHGG
PSEGQPGHKSKELVCKARHEGVHLLHNDALVIAPHIDHIKVRRLLINGGASVNILTLSTFKALGWGLAQLKKSPTLLVGFSRESVSPEGCIELHISFGEGKPKLP