; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g09190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g09190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:6739681..6741435
RNA-Seq ExpressionMoc06g09190
SyntenyMoc06g09190
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.5e-7780.2Show/hide
Query:  RFYMCARKGLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
        +++  + + LAKDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVC FAS
Subjt:  RFYMCARKGLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGARGALPANFG
        NVKRKSKG+AHALEAAQSS+P TPAV GPASEDPAPVIELESS GPSREKRPR QTEA DVS LGEEVREE PLKR RKKKKTTSPLEVGARG LPA+F 
Subjt:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGARGALPANFG

Query:  KK
         +
Subjt:  KK

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.4e-8358.75Show/hide
Query:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSEPATPAV---
        +SI+P+PEL QA+FDTLK+YK++FPRGRK+GTLVTDKLLLESGLLDYNP VRPIE+SRPNSELAMVC F S+VKRKSKGRAHAL+  QSS+P TPAV   
Subjt:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSEPATPAV---

Query:  -----AGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGARGALPANFGKKSSAALEAASSTMKDELLKA
             AGP+S  P PVIEL+S+   SREKR R ++EA DVS          PL+ +R                                           
Subjt:  -----AGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGARGALPANFGKKSSAALEAASSTMKDELLKA

Query:  HSEVKILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELKMVKECLSNGALLEESFRQHPDFDGFAKD
                   EAK ELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE K+  +    AELK  KE L+NGALLE +FRQHPDFDGFAKD
Subjt:  HSEVKILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELKMVKECLSNGALLEESFRQHPDFDGFAKD

Query:  FSDAGFKFLMKGIASDMPDL
        FSDAGFKFLMKGIA+D+P L
Subjt:  FSDAGFKFLMKGIASDMPDL

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.8e-8679.74Show/hide
Query:  ARDSEEVELLDVDQLLACFEAKRIAKKPNRFYMCARKG-------------------------LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTL
        ARDSEE ELLDVDQLLACFEAKRIAKKP RFYMCARKG                         LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTL
Subjt:  ARDSEEVELLDVDQLLACFEAKRIAKKPNRFYMCARKG-------------------------LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTL

Query:  KYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGP
        KYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS VKRKSKGRAHALEAAQSS+P TPAV GPASEDPAPVIELESS GP
Subjt:  KYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGP

Query:  SREKRPR-------CQTEAADVSSLGE
        SREKRPR        QTEAADV  LGE
Subjt:  SREKRPR-------CQTEAADVSSLGE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.9e-8859.94Show/hide
Query:  MSSSFTSDLGSDEDLARRLESELEKIENFRFSDDGEDRGSLS----------LRTSSLGFRRRG---RELTILQRDE-----------------------
        MSSS +S+L  + DLARRLES+LE+IEN R SDDGED  + +          +    LG  RRG    E  +L+  E                       
Subjt:  MSSSFTSDLGSDEDLARRLESELEKIENFRFSDDGEDRGSLS----------LRTSSLGFRRRG---RELTILQRDE-----------------------

Query:  ----------------SLSTSKCLSTASGFSFTLLSKNFSSELARDSEEVELLDVDQLLACFEAKRIAKKPNRFYMCARKG-------------------
                         L+ ++      G  F L    +    ARDSEE EL DVDQLLACFEAKRIAKKP RFYMCARKG                   
Subjt:  ----------------SLSTSKCLSTASGFSFTLLSKNFSSELARDSEEVELLDVDQLLACFEAKRIAKKPNRFYMCARKG-------------------

Query:  ------LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVK
              LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVC FAS VK
Subjt:  ------LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVK

Query:  RKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRPRCQTEAAD
        RKSKGRAHALEAAQSS+PATPAV GPASEDPA VIELESS GPSREKRPR QTEA D
Subjt:  RKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRPRCQTEAAD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.8e-10253.83Show/hide
Query:  RFYMCARKGLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
        +++  + + LAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR IE+SRPNSELAMVC F  
Subjt:  RFYMCARKGLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  NVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGAR
        +VKRKSKGRAHAL+    +EP TP V        +GP+S  P PVIEL+ S G S EKR R ++EA DVS L  EVR E+PL+R RKKKKT+S  E GAR
Subjt:  NVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGAR

Query:  GALPAN---------------------FG-----------------------------------------------------------------------
        G LP +                     FG                                                                       
Subjt:  GALPAN---------------------FG-----------------------------------------------------------------------

Query:  ---KKSSAALEAASSTMKDELLKAHSEVKILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELKMVKE
           + S AALEAA +T+K ELLKA  EV IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T ELK +KE
Subjt:  ---KKSSAALEAASSTMKDELLKAHSEVKILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELKMVKE

Query:  CLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL
         L+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP L
Subjt:  CLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092987.4e-7880.2Show/hide
Query:  RFYMCARKGLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
        +++  + + LAKDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVC FAS
Subjt:  RFYMCARKGLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGARGALPANFG
        NVKRKSKG+AHALEAAQSS+P TPAV GPASEDPAPVIELESS GPSREKRPR QTEA DVS LGEEVREE PLKR RKKKKTTSPLEVGARG LPA+F 
Subjt:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGARGALPANFG

Query:  KK
         +
Subjt:  KK

A0A6J1CLV1 uncharacterized protein LOC1110124676.9e-8458.75Show/hide
Query:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSEPATPAV---
        +SI+P+PEL QA+FDTLK+YK++FPRGRK+GTLVTDKLLLESGLLDYNP VRPIE+SRPNSELAMVC F S+VKRKSKGRAHAL+  QSS+P TPAV   
Subjt:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSEPATPAV---

Query:  -----AGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGARGALPANFGKKSSAALEAASSTMKDELLKA
             AGP+S  P PVIEL+S+   SREKR R ++EA DVS          PL+ +R                                           
Subjt:  -----AGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGARGALPANFGKKSSAALEAASSTMKDELLKA

Query:  HSEVKILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELKMVKECLSNGALLEESFRQHPDFDGFAKD
                   EAK ELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE K+  +    AELK  KE L+NGALLE +FRQHPDFDGFAKD
Subjt:  HSEVKILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELKMVKECLSNGALLEESFRQHPDFDGFAKD

Query:  FSDAGFKFLMKGIASDMPDL
        FSDAGFKFLMKGIA+D+P L
Subjt:  FSDAGFKFLMKGIASDMPDL

A0A6J1CR42 uncharacterized protein LOC1110138268.7e-8779.74Show/hide
Query:  ARDSEEVELLDVDQLLACFEAKRIAKKPNRFYMCARKG-------------------------LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTL
        ARDSEE ELLDVDQLLACFEAKRIAKKP RFYMCARKG                         LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTL
Subjt:  ARDSEEVELLDVDQLLACFEAKRIAKKPNRFYMCARKG-------------------------LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTL

Query:  KYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGP
        KYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVCRFAS VKRKSKGRAHALEAAQSS+P TPAV GPASEDPAPVIELESS GP
Subjt:  KYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGP

Query:  SREKRPR-------CQTEAADVSSLGE
        SREKRPR        QTEAADV  LGE
Subjt:  SREKRPR-------CQTEAADVSSLGE

A0A6J1DXS5 uncharacterized protein LOC1110255029.3e-8959.94Show/hide
Query:  MSSSFTSDLGSDEDLARRLESELEKIENFRFSDDGEDRGSLS----------LRTSSLGFRRRG---RELTILQRDE-----------------------
        MSSS +S+L  + DLARRLES+LE+IEN R SDDGED  + +          +    LG  RRG    E  +L+  E                       
Subjt:  MSSSFTSDLGSDEDLARRLESELEKIENFRFSDDGEDRGSLS----------LRTSSLGFRRRG---RELTILQRDE-----------------------

Query:  ----------------SLSTSKCLSTASGFSFTLLSKNFSSELARDSEEVELLDVDQLLACFEAKRIAKKPNRFYMCARKG-------------------
                         L+ ++      G  F L    +    ARDSEE EL DVDQLLACFEAKRIAKKP RFYMCARKG                   
Subjt:  ----------------SLSTSKCLSTASGFSFTLLSKNFSSELARDSEEVELLDVDQLLACFEAKRIAKKPNRFYMCARKG-------------------

Query:  ------LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVK
              LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVC FAS VK
Subjt:  ------LAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFASNVK

Query:  RKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRPRCQTEAAD
        RKSKGRAHALEAAQSS+PATPAV GPASEDPA VIELESS GPSREKRPR QTEA D
Subjt:  RKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRPRCQTEAAD

A0A6J1DZB3 uncharacterized protein LOC1110256658.6e-10353.83Show/hide
Query:  RFYMCARKGLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
        +++  + + LAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP VR IE+SRPNSELAMVC F  
Subjt:  RFYMCARKGLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS

Query:  NVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGAR
        +VKRKSKGRAHAL+    +EP TP V        +GP+S  P PVIEL+ S G S EKR R ++EA DVS L  EVR E+PL+R RKKKKT+S  E GAR
Subjt:  NVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGAR

Query:  GALPAN---------------------FG-----------------------------------------------------------------------
        G LP +                     FG                                                                       
Subjt:  GALPAN---------------------FG-----------------------------------------------------------------------

Query:  ---KKSSAALEAASSTMKDELLKAHSEVKILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELKMVKE
           + S AALEAA +T+K ELLKA  EV IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE K+  +   T ELK +KE
Subjt:  ---KKSSAALEAASSTMKDELLKAHSEVKILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELKMVKE

Query:  CLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL
         L+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP L
Subjt:  CLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTACCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGAAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGTTGACAATCCTCCAGAGGGATGAGTCACTCTCTACTTCAAAATGTTTGAGTA
CGGCCTCAGGCTTCTCCTTCACCCTTTTGTCCAAGAATTTCTCTTCTGAACTAGCTCGGGATAGTGAAGAGGTCGAGCTGTTGGATGTAGACCAGCTCCTCGCGTGCTTC
GAAGCGAAAAGGATAGCTAAAAAGCCGAATCGGTTCTATATGTGCGCAAGGAAAGGCCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGG
GAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGA
CCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCAGATTTGCAAGC
AACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGGAACCTGCCACTCCTGCCGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGT
GATCGAGCTGGAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCCTAGGTGTCAGACCGAGGCGGCAGACGTCTCGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTC
TGAAGCGAATGAGGAAGAAGAAGAAGACCACTTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAACTTCGGAAAGAAGAGTTCTGCTGCCTTGGAGGCTGCC
TCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGAAAATTTTGAAGGCTGAGGTGGAGGCCAAGACCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAA
GGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAAGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGG
AGCTGAAGCACGCGACTGCTGAGCTGAAGATGGTGAAGGAGTGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAA
GACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTACCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGAAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGTTGACAATCCTCCAGAGGGATGAGTCACTCTCTACTTCAAAATGTTTGAGTA
CGGCCTCAGGCTTCTCCTTCACCCTTTTGTCCAAGAATTTCTCTTCTGAACTAGCTCGGGATAGTGAAGAGGTCGAGCTGTTGGATGTAGACCAGCTCCTCGCGTGCTTC
GAAGCGAAAAGGATAGCTAAAAAGCCGAATCGGTTCTATATGTGCGCAAGGAAAGGCCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGG
GAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGA
CCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCAGATTTGCAAGC
AACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGGAACCTGCCACTCCTGCCGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGT
GATCGAGCTGGAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCCTAGGTGTCAGACCGAGGCGGCAGACGTCTCGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTC
TGAAGCGAATGAGGAAGAAGAAGAAGACCACTTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAACTTCGGAAAGAAGAGTTCTGCTGCCTTGGAGGCTGCC
TCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGAAAATTTTGAAGGCTGAGGTGGAGGCCAAGACCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAA
GGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAAGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGG
AGCTGAAGCACGCGACTGCTGAGCTGAAGATGGTGAAGGAGTGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAA
GACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTTAG
Protein sequenceShow/hide protein sequence
MSSSFTSDLGSDEDLARRLESELEKIENFRFSDDGEDRGSLSLRTSSLGFRRRGRELTILQRDESLSTSKCLSTASGFSFTLLSKNFSSELARDSEEVELLDVDQLLACF
EAKRIAKKPNRFYMCARKGLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCRFAS
NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRPRCQTEAADVSSLGEEVREEAPLKRMRKKKKTTSPLEVGARGALPANFGKKSSAALEAA
SSTMKDELLKAHSEVKILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELKMVKECLSNGALLEESFRQHPDFDGFAK
DFSDAGFKFLMKGIASDMPDL