; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr4:11823309..11828784
RNA-Seq ExpressionMoc04g15780
SyntenyMoc04g15780
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74819.1 hypothetical protein VITISV_034590 [Vitis vinifera]3.8e-2131.56Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSINVTSPVP----AAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQR
        M+P P I K FSLV Q+E+Q SIN     P    AA  S+  +AI          S    N K +K+R  C+HCG+LGHTVD+CYKL+ YP GYKF+   
Subjt:  MDPPPTINKAFSLVNQEEQQRSINVTSPVP----AAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQR

Query:  SSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATT----------SDDASASYMAESLFENMSRISPVVVN
         S  PH  + +    S T+       +  A A SP+ S S    QCQQLI LL SQL ++   T          S  +S   ++   F N    S  V++
Subjt:  SSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATT----------SDDASASYMAESLFENMSRISPVVVN

Query:  LP------------NNIKFTME-FKGIFI-------VYELNSLMILVFFMTNT----------PNDHSSSGCSIEVADNSLGCSIEIGASDSNAPR----
                      ++ KF     K +F+        Y+L +L     F++             ND S    S   +D  L   I I ++DS  P     
Subjt:  LP------------NNIKFTME-FKGIFI-------VYELNSLMILVFFMTNT----------PNDHSSSGCSIEVADNSLGCSIEIGASDSNAPR----

Query:  -----HSLHTVKSPSYLQDYDCALLHGDSLPAYYTKHPLQHHVSYSRLSTSHCALEDSHSRRRQLQNSLAEIFKPTT
             H   + ++PSYLQDY C+     S  +  T HPL   + Y +LST H            L N+++  F+PTT
Subjt:  -----HSLHTVKSPSYLQDYDCALLHGDSLPAYYTKHPLQHHVSYSRLSTSHCALEDSHSRRRQLQNSLAEIFKPTT

KAA8517066.1 hypothetical protein F0562_017116 [Nyssa sinensis]1.6e-1930.31Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQG
        MDP P IN+ FSL+ QEEQQR    T+P   + +S+  MA    +   K G      S  +++  Q+++R  CTHC +LGHTVDRCYK+H YP GYKF+ 
Subjt:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQG

Query:  QRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATTS--DDASASYMAESLFENMSRISPVVVNLPNNIK
                 NSN+A+    TS   +   +     V  +NS      Q QQL+ +L + L+NS   T+  D +  + +A+ + ++   +  V+ +      
Subjt:  QRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATTS--DDASASYMAESLFENMSRISPVVVNLPNNIK

Query:  FTMEFKGIFIVYELNSLMILVFFMTNTPNDH----SSSGCSIEVADNSLGCSIEIGASDSNAP--------RHSLHTVKSPSYLQDYDCALLHGDSLPAY
        F  +     +     +  +      + P+ +    SSSG S     ++   +I I    + AP        R S      P+YL+DY C LL G    A 
Subjt:  FTMEFKGIFIVYELNSLMILVFFMTNTPNDH----SSSGCSIEVADNSLGCSIEIGASDSNAP--------RHSLHTVKSPSYLQDYDCALLHGDSLPAY

Query:  YT-KHPLQHHVSYSRLSTSH
         +  +P+ +++SY  LS SH
Subjt:  YT-KHPLQHHVSYSRLSTSH

KAA8536734.1 hypothetical protein F0562_029212 [Nyssa sinensis]5.2e-1832.05Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG---------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYK
        MD  P IN+ FSL+ QEEQQR  N++S    + +S+  MA    + V K G         S  +++  Q+++R  CTHC +LGHTVDRCYK+H YP GYK
Subjt:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG---------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYK

Query:  FQGQRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANS-KATTSDDASASYMAESLFENMSRISPVVVNLPNN
        F+          NSN+A+    TS   +   +     V  +NS      Q QQL+ +L + L++S K T + D S +     L  +      V   +P +
Subjt:  FQGQRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANS-KATTSDDASASYMAESLFENMSRISPVVVNLPNN

Query:  IKFTMEFKGIFIVYELNSLMILVFFMTNTPNDHSSSGCSIEVADNSLGCSIEIGASDSNAPRHSLHTVKSPSYLQDYDCALLHGDSLPAYYT---KHPLQ
        +          IV         V F T+T N        I V D     +I    S     R S      P+YL+DY C LL G   P + +    +P+ 
Subjt:  IKFTMEFKGIFIVYELNSLMILVFFMTNTPNDHSSSGCSIEVADNSLGCSIEIGASDSNAPRHSLHTVKSPSYLQDYDCALLHGDSLPAYYT---KHPLQ

Query:  HHVSYSRLSTSH
        +++SY  LS SH
Subjt:  HHVSYSRLSTSH

KAA8550199.1 hypothetical protein F0562_001883 [Nyssa sinensis]8.0e-1931.52Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG---------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYK
        MDP P IN+ FSL+ QEEQQR    T+P   + +S+  MA    + VNK G         S  +++  Q+K++  CTHC + GHTVDRCYK+H YP GYK
Subjt:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG---------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYK

Query:  FQGQRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANS-KATTSDDASASYMAESLFENMSRISPVVVNLPNN
        F+          NSN+A+    TS       +        +NS      Q QQL+ +L + L++S K T + D S       L + ++   P +V LP++
Subjt:  FQGQRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANS-KATTSDDASASYMAESLFENMSRISPVVVNLPNN

Query:  IKFTMEFKGIFIVYELNSLMILVFFMTNTPND------------HSSSGCSIEVADNSLGCSIEIGASDSNAP--------RHSLHTVKSPSYLQDYDCA
               +  FI+   +S +      +  P D             SSSG S     ++    I +    + AP        R S   +  P+YL+DY C 
Subjt:  IKFTMEFKGIFIVYELNSLMILVFFMTNTPND------------HSSSGCSIEVADNSLGCSIEIGASDSNAP--------RHSLHTVKSPSYLQDYDCA

Query:  LLHGDSLPAYYT-KHPLQHHVSYSRLSTSH
        LL G S  A  +  +P+ +++SY  LS SH
Subjt:  LLHGDSLPAYYT-KHPLQHHVSYSRLSTSH

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]5.9e-2241.57Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSI---NVTSPVPAAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQRS
        M+P PTIN+AF+LV QE QQRSI   +VTSP  +A  ++       S+  N   +  ++++ +RK++S+CTHCG+ GHTVD+CYKLH YP GY     RS
Subjt:  MDPPPTINKAFSLVNQEEQQRSI---NVTSPVPAAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQRS

Query:  SFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDM-ACQCQQLIQLLQSQLANSKATTSDDASASYMAESLF
        S     +SN+ S  S ++  P+K +      +S  NS + + A QCQ+L+ LLQS L  +K  + +D+  S++AE+ F
Subjt:  SFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDM-ACQCQQLIQLLQSQLANSKATTSDDASASYMAESLF

TrEMBL top hitse value%identityAlignment
A0A2N9I913 Uncharacterized protein6.6e-1927.87Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSI-NVTSPVPAAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQRSSF
        MDP P INK FSL+ QEE+QRSI ++   +   F  S A+  + S      G++ +  +K  KER  CTHCGLLGHTVD+CYKLH +P GYK +G+  + 
Subjt:  MDPPPTINKAFSLVNQEEQQRSI-NVTSPVPAAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQRSSF

Query:  --QPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATTSDDASASYMAESL--FENMSRISPVVVNLPN------
          Q  +++  ++ H+             A  +SP+      A        +         +   D  +  +M  SL  F  ++ I    V LPN      
Subjt:  --QPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATTSDDASASYMAESL--FENMSRISPVVVNLPN------

Query:  ------NIKFTMEFKGIFIVYELNSLMILVF---------------FMTNTP-----NDHSSSGCSIEVADNSLG------------------CSIEIGA
               +  ++    +  V   +  +I  F               ++ ++P     +  S+S  S    +N++                    S +   
Subjt:  ------NIKFTMEFKGIFIVYELNSLMILVF---------------FMTNTP-----NDHSSSGCSIEVADNSLG------------------CSIEIGA

Query:  SDSNAP-------RHSLHTVKSPSYLQDYDCAL---LHGDSLPAYYTKHPLQHHVSYSRLSTSHCA
        S S++P       R S   VK PSYLQDY C+L   L    L +  T +P+QH +SYS+LS  H A
Subjt:  SDSNAP-------RHSLHTVKSPSYLQDYDCAL---LHGDSLPAYYTKHPLQHHVSYSRLSTSHCA

A0A5J4ZHF9 Uncharacterized protein7.8e-2030.31Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQG
        MDP P IN+ FSL+ QEEQQR    T+P   + +S+  MA    +   K G      S  +++  Q+++R  CTHC +LGHTVDRCYK+H YP GYKF+ 
Subjt:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQG

Query:  QRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATTS--DDASASYMAESLFENMSRISPVVVNLPNNIK
                 NSN+A+    TS   +   +     V  +NS      Q QQL+ +L + L+NS   T+  D +  + +A+ + ++   +  V+ +      
Subjt:  QRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATTS--DDASASYMAESLFENMSRISPVVVNLPNNIK

Query:  FTMEFKGIFIVYELNSLMILVFFMTNTPNDH----SSSGCSIEVADNSLGCSIEIGASDSNAP--------RHSLHTVKSPSYLQDYDCALLHGDSLPAY
        F  +     +     +  +      + P+ +    SSSG S     ++   +I I    + AP        R S      P+YL+DY C LL G    A 
Subjt:  FTMEFKGIFIVYELNSLMILVFFMTNTPNDH----SSSGCSIEVADNSLGCSIEIGASDSNAP--------RHSLHTVKSPSYLQDYDCALLHGDSLPAY

Query:  YT-KHPLQHHVSYSRLSTSH
         +  +P+ +++SY  LS SH
Subjt:  YT-KHPLQHHVSYSRLSTSH

A0A5J5C4X6 Retrotran_gag_3 domain-containing protein3.9e-1931.52Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG---------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYK
        MDP P IN+ FSL+ QEEQQR    T+P   + +S+  MA    + VNK G         S  +++  Q+K++  CTHC + GHTVDRCYK+H YP GYK
Subjt:  MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPG---------SKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYK

Query:  FQGQRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANS-KATTSDDASASYMAESLFENMSRISPVVVNLPNN
        F+          NSN+A+    TS       +        +NS      Q QQL+ +L + L++S K T + D S       L + ++   P +V LP++
Subjt:  FQGQRSSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANS-KATTSDDASASYMAESLFENMSRISPVVVNLPNN

Query:  IKFTMEFKGIFIVYELNSLMILVFFMTNTPND------------HSSSGCSIEVADNSLGCSIEIGASDSNAP--------RHSLHTVKSPSYLQDYDCA
               +  FI+   +S +      +  P D             SSSG S     ++    I +    + AP        R S   +  P+YL+DY C 
Subjt:  IKFTMEFKGIFIVYELNSLMILVFFMTNTPND------------HSSSGCSIEVADNSLGCSIEIGASDSNAP--------RHSLHTVKSPSYLQDYDCA

Query:  LLHGDSLPAYYT-KHPLQHHVSYSRLSTSH
        LL G S  A  +  +P+ +++SY  LS SH
Subjt:  LLHGDSLPAYYT-KHPLQHHVSYSRLSTSH

A0A6J1DNP7 uncharacterized protein LOC1110220652.9e-2241.57Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSI---NVTSPVPAAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQRS
        M+P PTIN+AF+LV QE QQRSI   +VTSP  +A  ++       S+  N   +  ++++ +RK++S+CTHCG+ GHTVD+CYKLH YP GY     RS
Subjt:  MDPPPTINKAFSLVNQEEQQRSI---NVTSPVPAAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQRS

Query:  SFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDM-ACQCQQLIQLLQSQLANSKATTSDDASASYMAESLF
        S     +SN+ S  S ++  P+K +      +S  NS + + A QCQ+L+ LLQS L  +K  + +D+  S++AE+ F
Subjt:  SFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDM-ACQCQQLIQLLQSQLANSKATTSDDASASYMAESLF

A5BP26 Uncharacterized protein1.9e-2131.56Show/hide
Query:  MDPPPTINKAFSLVNQEEQQRSINVTSPVP----AAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQR
        M+P P I K FSLV Q+E+Q SIN     P    AA  S+  +AI          S    N K +K+R  C+HCG+LGHTVD+CYKL+ YP GYKF+   
Subjt:  MDPPPTINKAFSLVNQEEQQRSINVTSPVP----AAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQR

Query:  SSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATT----------SDDASASYMAESLFENMSRISPVVVN
         S  PH  + +    S T+       +  A A SP+ S S    QCQQLI LL SQL ++   T          S  +S   ++   F N    S  V++
Subjt:  SSFQPHVNSNSASLHSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATT----------SDDASASYMAESLFENMSRISPVVVN

Query:  LP------------NNIKFTME-FKGIFI-------VYELNSLMILVFFMTNT----------PNDHSSSGCSIEVADNSLGCSIEIGASDSNAPR----
                      ++ KF     K +F+        Y+L +L     F++             ND S    S   +D  L   I I ++DS  P     
Subjt:  LP------------NNIKFTME-FKGIFI-------VYELNSLMILVFFMTNT----------PNDHSSSGCSIEVADNSLGCSIEIGASDSNAPR----

Query:  -----HSLHTVKSPSYLQDYDCALLHGDSLPAYYTKHPLQHHVSYSRLSTSHCALEDSHSRRRQLQNSLAEIFKPTT
             H   + ++PSYLQDY C+     S  +  T HPL   + Y +LST H            L N+++  F+PTT
Subjt:  -----HSLHTVKSPSYLQDYDCALLHGDSLPAYYTKHPLQHHVSYSRLSTSHCALEDSHSRRRQLQNSLAEIFKPTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCGCCGCCGACCATTAATAAAGCCTTTTCACTGGTGAATCAAGAAGAGCAACAGCGATCGATTAATGTTACTTCACCTGTTCCTGCTGCCTTTTCTTCTTCGTT
TGCTATGGCTATTCAACATTCATCTCATGTCAACAAGCCTGGTTCTAAGCCTACTTCTAATTATAAGCAGAGAAAGGAACGCTCTATGTGCACTCATTGTGGCCTCTTAG
GGCACACTGTTGATAGGTGTTATAAACTTCACAGCTATCCTCTCGGATATAAATTCCAAGGTCAACGATCGTCTTTTCAACCTCATGTGAATTCTAATTCTGCTAGTTTG
CATTCTCCGACTTCTGTGCCTCCGACAAAGTTCTTGGATTTTCAAGCTATGGCGGTTTCTCCGATTAATTCTGGATCAGATATGGCTTGTCAGTGCCAACAACTGATTCA
ATTGCTTCAGTCTCAGTTGGCCAATTCTAAAGCCACAACTTCTGATGATGCATCTGCTTCTTATATGGCAGAATCATTATTTGAAAACATGTCTCGCATTTCTCCCGTGG
TTGTCAACTTACCGAATAATATAAAGTTTACTATGGAGTTCAAAGGGATTTTCATCGTCTATGAGTTGAATTCTCTAATGATTCTTGTCTTCTTCATGACAAACACTCCA
AATGATCATTCCTCTTCTGGTTGTTCAATTGAGGTTGCTGATAATTCTCTTGGTTGCTCAATTGAAATTGGTGCTTCTGATAGCAATGCTCCGAGGCACTCCTTGCACAC
GGTTAAATCGCCTTCCTACCTCCAAGATTACGATTGTGCACTATTGCACGGAGATTCTTTACCAGCTTATTATACCAAGCATCCTTTACAACATCATGTCTCCTATTCCC
GTTTATCAACCAGTCATTGTGCCTTAGAAGACAGCCACAGCCGCAGAAGACAGTTGCAGAACAGTCTTGCCGAAATCTTTAAGCCCACGACGCCACAGAAGATAGTCACC
GACGACACCGAGAATTTCCATCACGCTGCTGCTCCCATCTCCCGTTGTTTCCATCCCGCCGCCTCTCCCATCCAGCGGCCGCTCCCATCCCACCGCCGCTCCCATCATGC
TATCGCCAAGACTGTTTCTCTTGTGTGTGAGGAAGCAGAGCGTTTTGATGAAAGTGAGGCCTACTGTAATGTTCCCATTATAGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCCGCCGCCGACCATTAATAAAGCCTTTTCACTGGTGAATCAAGAAGAGCAACAGCGATCGATTAATGTTACTTCACCTGTTCCTGCTGCCTTTTCTTCTTCGTT
TGCTATGGCTATTCAACATTCATCTCATGTCAACAAGCCTGGTTCTAAGCCTACTTCTAATTATAAGCAGAGAAAGGAACGCTCTATGTGCACTCATTGTGGCCTCTTAG
GGCACACTGTTGATAGGTGTTATAAACTTCACAGCTATCCTCTCGGATATAAATTCCAAGGTCAACGATCGTCTTTTCAACCTCATGTGAATTCTAATTCTGCTAGTTTG
CATTCTCCGACTTCTGTGCCTCCGACAAAGTTCTTGGATTTTCAAGCTATGGCGGTTTCTCCGATTAATTCTGGATCAGATATGGCTTGTCAGTGCCAACAACTGATTCA
ATTGCTTCAGTCTCAGTTGGCCAATTCTAAAGCCACAACTTCTGATGATGCATCTGCTTCTTATATGGCAGAATCATTATTTGAAAACATGTCTCGCATTTCTCCCGTGG
TTGTCAACTTACCGAATAATATAAAGTTTACTATGGAGTTCAAAGGGATTTTCATCGTCTATGAGTTGAATTCTCTAATGATTCTTGTCTTCTTCATGACAAACACTCCA
AATGATCATTCCTCTTCTGGTTGTTCAATTGAGGTTGCTGATAATTCTCTTGGTTGCTCAATTGAAATTGGTGCTTCTGATAGCAATGCTCCGAGGCACTCCTTGCACAC
GGTTAAATCGCCTTCCTACCTCCAAGATTACGATTGTGCACTATTGCACGGAGATTCTTTACCAGCTTATTATACCAAGCATCCTTTACAACATCATGTCTCCTATTCCC
GTTTATCAACCAGTCATTGTGCCTTAGAAGACAGCCACAGCCGCAGAAGACAGTTGCAGAACAGTCTTGCCGAAATCTTTAAGCCCACGACGCCACAGAAGATAGTCACC
GACGACACCGAGAATTTCCATCACGCTGCTGCTCCCATCTCCCGTTGTTTCCATCCCGCCGCCTCTCCCATCCAGCGGCCGCTCCCATCCCACCGCCGCTCCCATCATGC
TATCGCCAAGACTGTTTCTCTTGTGTGTGAGGAAGCAGAGCGTTTTGATGAAAGTGAGGCCTACTGTAATGTTCCCATTATAGTGTAA
Protein sequenceShow/hide protein sequence
MDPPPTINKAFSLVNQEEQQRSINVTSPVPAAFSSSFAMAIQHSSHVNKPGSKPTSNYKQRKERSMCTHCGLLGHTVDRCYKLHSYPLGYKFQGQRSSFQPHVNSNSASL
HSPTSVPPTKFLDFQAMAVSPINSGSDMACQCQQLIQLLQSQLANSKATTSDDASASYMAESLFENMSRISPVVVNLPNNIKFTMEFKGIFIVYELNSLMILVFFMTNTP
NDHSSSGCSIEVADNSLGCSIEIGASDSNAPRHSLHTVKSPSYLQDYDCALLHGDSLPAYYTKHPLQHHVSYSRLSTSHCALEDSHSRRRQLQNSLAEIFKPTTPQKIVT
DDTENFHHAAAPISRCFHPAASPIQRPLPSHRRSHHAIAKTVSLVCEEAERFDESEAYCNVPIIV