; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g30580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g30580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:21927713..21932333
RNA-Seq ExpressionMoc08g30580
SyntenyMoc08g30580
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]8.0e-10255.1Show/hide
Query:  MNPPNPNMRQPISPNVRIEEIVDRLLVVDDPEVAVPPFNVVLLADDIDREIRLYAAPTFYNFNPVIMEPKIAALKLELK--------------------D
        MNPPNPN+ QPI PNVRIEEIVD + V  + EV VP  NVVLLA  IDREIR YAAPTFYNFNPVI E +I A K ELK                    D
Subjt:  MNPPNPNMRQPISPNVRIEEIVDRLLVVDDPEVAVPPFNVVLLADDIDREIRLYAAPTFYNFNPVIMEPKIAALKLELK--------------------D

Query:  EARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGAL
        EARTWL SLP ESITSWD+LA  FLMKYFPP KNAKYRS+INNFQQF+GES                         IEMYYNGLD+A RLV   S N AL
Subjt:  EARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGAL

Query:  LAKPYDEAFNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSE
        LAKPY EAFNILERI SN HS SD RAIQGRG+K LN+S+SYS  NSKIEN+ DLV RSMTQQ+  GA  GKAN  H QG S  F  G HHYNNCPG+ E
Subjt:  LAKPYDEAFNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSE

Query:  SVYYLGNPLNSRNNSYSNMYNPGWKNHSNYVGAKIMEETMLLGSIAEESGNADGSVGNRSEGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLPTAYPN
        SVY LGN  NSRNNSYSN YNPG +NH N    +I EE ML+  + +             +G +Y VL   I   +R+ +   N       +    A   
Subjt:  SVYYLGNPLNSRNNSYSNMYNPGWKNHSNYVGAKIMEETMLLGSIAEESGNADGSVGNRSEGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLPTAYPN

Query:  VPRIHKELNQDE
          R+ K+  Q E
Subjt:  VPRIHKELNQDE

XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]1.3e-7070.24Show/hide
Query:  MKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGALLAKPYDEAFNILERILSNNHSWSDP
        MKYFPPRKNAKYRSEI NFQQ   ES                         IE YY GLD+A RLVIDAS NGALL KPY EAFNILERI SNNHSWSDP
Subjt:  MKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGALLAKPYDEAFNILERILSNNHSWSDP

Query:  RAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNPLNSRNNSYSNMYNPGWK
        RAIQGRG KGLN+SESY ALNSK+ENLT+LVMRSMTQQN  GAS GKANV HIQGISC FC+GEHHYNN P + ESVYYLGN  N+  NSYSN YNPGW+
Subjt:  RAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNPLNSRNNSYSNMYNPGWK

Query:  NHSNY
        NH N+
Subjt:  NHSNY

XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]7.6e-6851.6Show/hide
Query:  MEPKIAALKL---ELKDEARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGESIEMYYNGLDNAKRLVIDASANGALLAKPYDEA
        +  K+  LKL    L+DEARTWLESLP ESITSWD+LA KFLMKYFPP KNAKYR+EINNFQQF GE                   S N A       EA
Subjt:  MEPKIAALKL---ELKDEARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGESIEMYYNGLDNAKRLVIDASANGALLAKPYDEA

Query:  FNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNP
        FNILERI SNNHSW DP+A+QG+ SK L +SESY+ LNSKIENLTDLVMRS+TQQ+ AGAS G  NV  IQGISC F +G+HHYNNCPG+ ES    GN 
Subjt:  FNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNP

Query:  LNSRNNSYSNMYNPGWKNHSNYVGAKIMEETMLLGSIAEESGNAD-------------GSVGNRSEGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLP
            N   SN   P ++      G+    E ++   +A      +             G +      R  G LPSD +VPKRDGK+QC ALTL SGK LP
Subjt:  LNSRNNSYSNMYNPGWKNHSNYVGAKIMEETMLLGSIAEESGNAD-------------GSVGNRSEGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLP

Query:  TAYPNVPRIHKE
          + N P + KE
Subjt:  TAYPNVPRIHKE

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]1.1e-6360.81Show/hide
Query:  KIAALKL---ELKDEARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGESIEMYYNGLDNAKRLVIDASANG---ALLAKPYDEA
        ++  LKL    L+DEARTWLESLP+ESITSWD+LA KFLMKYFPP KNAKYRSEINNFQQF+GES+       +  KRL+     +G    +  + Y + 
Subjt:  KIAALKL---ELKDEARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGESIEMYYNGLDNAKRLVIDASANG---ALLAKPYDEA

Query:  FNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNP
         N   R+        DPRA+QG+ SKGL +SESY+ LNS IENLT LVMRSM QQ++ GA  G ANV  IQGISC FC+G+HHYNNCPG+ ESVYYLGNP
Subjt:  FNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNP

Query:  LNSRNNSYSNMYNPGWKNHSNY
         N+RNN YSN YNPGW+NH N+
Subjt:  LNSRNNSYSNMYNPGWKNHSNY

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]2.0e-0482.86Show/hide
Query:  ERNKEVSIILGRPFLATERALVHVHKGELTMRVQD
        + +KEV IILGRPFLAT RALV VHKGELTMRVQD
Subjt:  ERNKEVSIILGRPFLATERALVHVHKGELTMRVQD

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]7.3e-8762.13Show/hide
Query:  MKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGALLAKPYDEAFNILERILSNNHSWSDP
        MKYFPP KNAKYRSEINNFQQF GES                         I+ YYNGLD+A RLVIDASANGALLAKPY EAFNILERI SNN SWSDP
Subjt:  MKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGALLAKPYDEAFNILERILSNNHSWSDP

Query:  RAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNPLNSRNNSYSNMYNPGWK
        RAI G+GSKG N+SES++ALN KIENLTDLVMRSMT Q+  GASAGKANV HIQGISC FC GE+ YNNCPG+ ESV+YLGN  N+ NN YS        
Subjt:  RAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNPLNSRNNSYSNMYNPGWK

Query:  NHSNYVGAKIMEETML---------LGSIAEESGNADGSVGNRS---EGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLPTAYPNVPRIHKELNQDER
             V  +I +ETML         + S A    N +  VG  +   + +  GVLPSDIKVPKRDGK+QCNALTLRSGKTLPTA+PN   I KELNQDER
Subjt:  NHSNYVGAKIMEETML---------LGSIAEESGNADGSVGNRS---EGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLPTAYPNVPRIHKELNQDER

Query:  N
        +
Subjt:  N

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189103.9e-10255.1Show/hide
Query:  MNPPNPNMRQPISPNVRIEEIVDRLLVVDDPEVAVPPFNVVLLADDIDREIRLYAAPTFYNFNPVIMEPKIAALKLELK--------------------D
        MNPPNPN+ QPI PNVRIEEIVD + V  + EV VP  NVVLLA  IDREIR YAAPTFYNFNPVI E +I A K ELK                    D
Subjt:  MNPPNPNMRQPISPNVRIEEIVDRLLVVDDPEVAVPPFNVVLLADDIDREIRLYAAPTFYNFNPVIMEPKIAALKLELK--------------------D

Query:  EARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGAL
        EARTWL SLP ESITSWD+LA  FLMKYFPP KNAKYRS+INNFQQF+GES                         IEMYYNGLD+A RLV   S N AL
Subjt:  EARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGAL

Query:  LAKPYDEAFNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSE
        LAKPY EAFNILERI SN HS SD RAIQGRG+K LN+S+SYS  NSKIEN+ DLV RSMTQQ+  GA  GKAN  H QG S  F  G HHYNNCPG+ E
Subjt:  LAKPYDEAFNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSE

Query:  SVYYLGNPLNSRNNSYSNMYNPGWKNHSNYVGAKIMEETMLLGSIAEESGNADGSVGNRSEGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLPTAYPN
        SVY LGN  NSRNNSYSN YNPG +NH N    +I EE ML+  + +             +G +Y VL   I   +R+ +   N       +    A   
Subjt:  SVYYLGNPLNSRNNSYSNMYNPGWKNHSNYVGAKIMEETMLLGSIAEESGNADGSVGNRSEGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLPTAYPN

Query:  VPRIHKELNQDE
          R+ K+  Q E
Subjt:  VPRIHKELNQDE

A0A6J1DRG1 uncharacterized protein LOC1110236696.1e-7170.24Show/hide
Query:  MKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGALLAKPYDEAFNILERILSNNHSWSDP
        MKYFPPRKNAKYRSEI NFQQ   ES                         IE YY GLD+A RLVIDAS NGALL KPY EAFNILERI SNNHSWSDP
Subjt:  MKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGALLAKPYDEAFNILERILSNNHSWSDP

Query:  RAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNPLNSRNNSYSNMYNPGWK
        RAIQGRG KGLN+SESY ALNSK+ENLT+LVMRSMTQQN  GAS GKANV HIQGISC FC+GEHHYNN P + ESVYYLGN  N+  NSYSN YNPGW+
Subjt:  RAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNPLNSRNNSYSNMYNPGWK

Query:  NHSNY
        NH N+
Subjt:  NHSNY

A0A6J1DTD1 uncharacterized protein LOC1110241363.7e-6851.6Show/hide
Query:  MEPKIAALKL---ELKDEARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGESIEMYYNGLDNAKRLVIDASANGALLAKPYDEA
        +  K+  LKL    L+DEARTWLESLP ESITSWD+LA KFLMKYFPP KNAKYR+EINNFQQF GE                   S N A       EA
Subjt:  MEPKIAALKL---ELKDEARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGESIEMYYNGLDNAKRLVIDASANGALLAKPYDEA

Query:  FNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNP
        FNILERI SNNHSW DP+A+QG+ SK L +SESY+ LNSKIENLTDLVMRS+TQQ+ AGAS G  NV  IQGISC F +G+HHYNNCPG+ ES    GN 
Subjt:  FNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNP

Query:  LNSRNNSYSNMYNPGWKNHSNYVGAKIMEETMLLGSIAEESGNAD-------------GSVGNRSEGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLP
            N   SN   P ++      G+    E ++   +A      +             G +      R  G LPSD +VPKRDGK+QC ALTL SGK LP
Subjt:  LNSRNNSYSNMYNPGWKNHSNYVGAKIMEETMLLGSIAEESGNAD-------------GSVGNRSEGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLP

Query:  TAYPNVPRIHKE
          + N P + KE
Subjt:  TAYPNVPRIHKE

A0A6J1DXK5 uncharacterized protein LOC1110255003.5e-8762.13Show/hide
Query:  MKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGALLAKPYDEAFNILERILSNNHSWSDP
        MKYFPP KNAKYRSEINNFQQF GES                         I+ YYNGLD+A RLVIDASANGALLAKPY EAFNILERI SNN SWSDP
Subjt:  MKYFPPRKNAKYRSEINNFQQFSGES-------------------------IEMYYNGLDNAKRLVIDASANGALLAKPYDEAFNILERILSNNHSWSDP

Query:  RAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNPLNSRNNSYSNMYNPGWK
        RAI G+GSKG N+SES++ALN KIENLTDLVMRSMT Q+  GASAGKANV HIQGISC FC GE+ YNNCPG+ ESV+YLGN  N+ NN YS        
Subjt:  RAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNPLNSRNNSYSNMYNPGWK

Query:  NHSNYVGAKIMEETML---------LGSIAEESGNADGSVGNRS---EGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLPTAYPNVPRIHKELNQDER
             V  +I +ETML         + S A    N +  VG  +   + +  GVLPSDIKVPKRDGK+QCNALTLRSGKTLPTA+PN   I KELNQDER
Subjt:  NHSNYVGAKIMEETML---------LGSIAEESGNADGSVGNRS---EGRLYGVLPSDIKVPKRDGKKQCNALTLRSGKTLPTAYPNVPRIHKELNQDER

Query:  N
        +
Subjt:  N

A0A6J1E1F3 uncharacterized protein LOC1110250655.5e-6460.81Show/hide
Query:  KIAALKL---ELKDEARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGESIEMYYNGLDNAKRLVIDASANG---ALLAKPYDEA
        ++  LKL    L+DEARTWLESLP+ESITSWD+LA KFLMKYFPP KNAKYRSEINNFQQF+GES+       +  KRL+     +G    +  + Y + 
Subjt:  KIAALKL---ELKDEARTWLESLPVESITSWDNLAGKFLMKYFPPRKNAKYRSEINNFQQFSGESIEMYYNGLDNAKRLVIDASANG---ALLAKPYDEA

Query:  FNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNP
         N   R+        DPRA+QG+ SKGL +SESY+ LNS IENLT LVMRSM QQ++ GA  G ANV  IQGISC FC+G+HHYNNCPG+ ESVYYLGNP
Subjt:  FNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNAAGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNP

Query:  LNSRNNSYSNMYNPGWKNHSNY
         N+RNN YSN YNPGW+NH N+
Subjt:  LNSRNNSYSNMYNPGWKNHSNY

A0A6J1E1F3 uncharacterized protein LOC1110250659.8e-0582.86Show/hide
Query:  ERNKEVSIILGRPFLATERALVHVHKGELTMRVQD
        + +KEV IILGRPFLAT RALV VHKGELTMRVQD
Subjt:  ERNKEVSIILGRPFLATERALVHVHKGELTMRVQD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCGCCTAACCCAAATATGCGCCAGCCTATTTCACCAAATGTGAGGATTGAGGAAATAGTAGACAGGCTTCTTGTTGTTGATGACCCTGAGGTAGCAGTACCCCC
TTTCAATGTTGTATTACTGGCAGATGACATCGACAGAGAAATCAGGTTGTATGCAGCTCCGACATTTTATAATTTCAACCCAGTCATCATGGAGCCAAAAATTGCAGCCC
TAAAATTGGAACTTAAAGATGAAGCGAGGACTTGGTTGGAGTCACTCCCTGTAGAATCGATTACAAGCTGGGACAACTTGGCGGGAAAGTTTTTGATGAAGTACTTCCCG
CCAAGAAAAAATGCTAAGTACAGAAGCGAAATCAACAACTTTCAGCAATTTTCTGGGGAGTCAATCGAAATGTATTACAATGGTTTAGACAATGCGAAGCGCTTGGTCAT
CGATGCGTCAGCAAATGGGGCTTTGCTAGCAAAACCTTATGATGAAGCATTCAATATCTTAGAGAGGATATTGTCGAACAATCATTCATGGTCTGACCCTAGAGCCATCC
AAGGGAGAGGAAGCAAGGGATTGAACAAATCCGAGTCATACTCTGCATTGAATTCAAAGATTGAGAATCTGACAGACTTGGTTATGAGGAGTATGACGCAGCAAAACGCA
GCGGGAGCATCTGCCGGTAAAGCAAACGTCTGCCATATCCAAGGGATTTCTTGTTATTTTTGCAAAGGAGAGCATCATTACAACAATTGCCCTGGCAGTTCAGAGTCGGT
GTACTATTTGGGGAATCCACTAAACAGCAGAAACAACTCATATTCCAATATGTATAACCCCGGCTGGAAGAACCACTCTAATTATGTTGGAGCGAAAATCATGGAGGAAA
CAATGCTGTTAGGCAGCATCGCTGAGGAATCTGGAAATGCAGATGGGTCAGTTGGCAACAGATCTGAAGGTAGGCTTTATGGAGTGCTGCCCAGTGATATTAAGGTGCCT
AAAAGGGATGGGAAGAAACAATGTAATGCTCTGACATTGCGGAGTGGCAAGACATTGCCTACTGCTTATCCAAATGTTCCAAGAATCCATAAAGAGTTAAATCAAGATGA
AAGAAATAAGGAGGTTTCTATCATCCTTGGGAGGCCATTCCTTGCCACTGAAAGAGCACTCGTGCATGTCCATAAAGGAGAGCTGACAATGCGCGTTCAAGATCAAGAAG
TGAAGTTCCCAGTCTATGATTCCATGAAGTTTCCTACTAAATCAGAAGAATGCTCAGTGTTTAAGATTTTAGATGAAGCTTTAATGGAGGAATCAAGTGCAGAAGCAATG
CTGGAGCATAGTGTAAGTAGGGATGTTGGAGCTCAAGAAGGTTTTGGTGATAGGAATAAAGAGATGATGGTAAAGGAATATGCTACCGCGGAGATAGAGACCCTAGCACC
GGGAGAAGAGTATAAAAATGTTGAGCCGTCAACCTCGGTGGATAAAATGCACATCACTGCAGAGGAAAAGCTAGCAAAATTACCTAAAAAAGAGCCACCCGACATCAAAC
CGACTCTAAAAGGTTCTAAAGCAAGGACTAGAAGAAAAATAGAGCTGGAAGCATCATGCAGCAAAGAGAAAAAGGCAAAACCTTCCCAAGTCATTGGCCAAGCTATTTCA
CAGGAGCATGCCCCAGCCGCAAAGAGAGGGAAGAAGACAATAGATCCAGAGAGGAAGATCGACCCACGGGCGACTCGTTGGATGAGAAAGAAGGCAAAGAACATTGACAT
GTTTGACCTGGGAGAGGGAATTCAGTATGTGAATCTCCCTAAAAGTGATGATAAAAAGAAAGCTGTTAGCAATCCTGTGTTCTCAACATTTGCCTTTTACCCTGCCCCGG
TCATGTTTGCCTGCCCATGCCCTAGCATCCCTCTCGTAGTACTGGCCCCAGAGGTAATATGCATGCTCTTGCTTGGGAACAAGCAAGATTTTAAGTTTGGGGGTGTGATA
ATTCTTCAAAAGAGAGTTATTCTACTTAAATACAAGGTTTGGAAAAAGAAGACAAAGCATGAAGAGAGTGCTAAGCCCAGAGAAGAGTGGATGAAGCATGTTCAAGGCAA
ATCAGACGAAAGTTGGAGCGACGCAATGCGGGAGCATGGAATTACTGCGCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCGCCTAACCCAAATATGCGCCAGCCTATTTCACCAAATGTGAGGATTGAGGAAATAGTAGACAGGCTTCTTGTTGTTGATGACCCTGAGGTAGCAGTACCCCC
TTTCAATGTTGTATTACTGGCAGATGACATCGACAGAGAAATCAGGTTGTATGCAGCTCCGACATTTTATAATTTCAACCCAGTCATCATGGAGCCAAAAATTGCAGCCC
TAAAATTGGAACTTAAAGATGAAGCGAGGACTTGGTTGGAGTCACTCCCTGTAGAATCGATTACAAGCTGGGACAACTTGGCGGGAAAGTTTTTGATGAAGTACTTCCCG
CCAAGAAAAAATGCTAAGTACAGAAGCGAAATCAACAACTTTCAGCAATTTTCTGGGGAGTCAATCGAAATGTATTACAATGGTTTAGACAATGCGAAGCGCTTGGTCAT
CGATGCGTCAGCAAATGGGGCTTTGCTAGCAAAACCTTATGATGAAGCATTCAATATCTTAGAGAGGATATTGTCGAACAATCATTCATGGTCTGACCCTAGAGCCATCC
AAGGGAGAGGAAGCAAGGGATTGAACAAATCCGAGTCATACTCTGCATTGAATTCAAAGATTGAGAATCTGACAGACTTGGTTATGAGGAGTATGACGCAGCAAAACGCA
GCGGGAGCATCTGCCGGTAAAGCAAACGTCTGCCATATCCAAGGGATTTCTTGTTATTTTTGCAAAGGAGAGCATCATTACAACAATTGCCCTGGCAGTTCAGAGTCGGT
GTACTATTTGGGGAATCCACTAAACAGCAGAAACAACTCATATTCCAATATGTATAACCCCGGCTGGAAGAACCACTCTAATTATGTTGGAGCGAAAATCATGGAGGAAA
CAATGCTGTTAGGCAGCATCGCTGAGGAATCTGGAAATGCAGATGGGTCAGTTGGCAACAGATCTGAAGGTAGGCTTTATGGAGTGCTGCCCAGTGATATTAAGGTGCCT
AAAAGGGATGGGAAGAAACAATGTAATGCTCTGACATTGCGGAGTGGCAAGACATTGCCTACTGCTTATCCAAATGTTCCAAGAATCCATAAAGAGTTAAATCAAGATGA
AAGAAATAAGGAGGTTTCTATCATCCTTGGGAGGCCATTCCTTGCCACTGAAAGAGCACTCGTGCATGTCCATAAAGGAGAGCTGACAATGCGCGTTCAAGATCAAGAAG
TGAAGTTCCCAGTCTATGATTCCATGAAGTTTCCTACTAAATCAGAAGAATGCTCAGTGTTTAAGATTTTAGATGAAGCTTTAATGGAGGAATCAAGTGCAGAAGCAATG
CTGGAGCATAGTGTAAGTAGGGATGTTGGAGCTCAAGAAGGTTTTGGTGATAGGAATAAAGAGATGATGGTAAAGGAATATGCTACCGCGGAGATAGAGACCCTAGCACC
GGGAGAAGAGTATAAAAATGTTGAGCCGTCAACCTCGGTGGATAAAATGCACATCACTGCAGAGGAAAAGCTAGCAAAATTACCTAAAAAAGAGCCACCCGACATCAAAC
CGACTCTAAAAGGTTCTAAAGCAAGGACTAGAAGAAAAATAGAGCTGGAAGCATCATGCAGCAAAGAGAAAAAGGCAAAACCTTCCCAAGTCATTGGCCAAGCTATTTCA
CAGGAGCATGCCCCAGCCGCAAAGAGAGGGAAGAAGACAATAGATCCAGAGAGGAAGATCGACCCACGGGCGACTCGTTGGATGAGAAAGAAGGCAAAGAACATTGACAT
GTTTGACCTGGGAGAGGGAATTCAGTATGTGAATCTCCCTAAAAGTGATGATAAAAAGAAAGCTGTTAGCAATCCTGTGTTCTCAACATTTGCCTTTTACCCTGCCCCGG
TCATGTTTGCCTGCCCATGCCCTAGCATCCCTCTCGTAGTACTGGCCCCAGAGGTAATATGCATGCTCTTGCTTGGGAACAAGCAAGATTTTAAGTTTGGGGGTGTGATA
ATTCTTCAAAAGAGAGTTATTCTACTTAAATACAAGGTTTGGAAAAAGAAGACAAAGCATGAAGAGAGTGCTAAGCCCAGAGAAGAGTGGATGAAGCATGTTCAAGGCAA
ATCAGACGAAAGTTGGAGCGACGCAATGCGGGAGCATGGAATTACTGCGCAGTAA
Protein sequenceShow/hide protein sequence
MNPPNPNMRQPISPNVRIEEIVDRLLVVDDPEVAVPPFNVVLLADDIDREIRLYAAPTFYNFNPVIMEPKIAALKLELKDEARTWLESLPVESITSWDNLAGKFLMKYFP
PRKNAKYRSEINNFQQFSGESIEMYYNGLDNAKRLVIDASANGALLAKPYDEAFNILERILSNNHSWSDPRAIQGRGSKGLNKSESYSALNSKIENLTDLVMRSMTQQNA
AGASAGKANVCHIQGISCYFCKGEHHYNNCPGSSESVYYLGNPLNSRNNSYSNMYNPGWKNHSNYVGAKIMEETMLLGSIAEESGNADGSVGNRSEGRLYGVLPSDIKVP
KRDGKKQCNALTLRSGKTLPTAYPNVPRIHKELNQDERNKEVSIILGRPFLATERALVHVHKGELTMRVQDQEVKFPVYDSMKFPTKSEECSVFKILDEALMEESSAEAM
LEHSVSRDVGAQEGFGDRNKEMMVKEYATAEIETLAPGEEYKNVEPSTSVDKMHITAEEKLAKLPKKEPPDIKPTLKGSKARTRRKIELEASCSKEKKAKPSQVIGQAIS
QEHAPAAKRGKKTIDPERKIDPRATRWMRKKAKNIDMFDLGEGIQYVNLPKSDDKKKAVSNPVFSTFAFYPAPVMFACPCPSIPLVVLAPEVICMLLLGNKQDFKFGGVI
ILQKRVILLKYKVWKKKTKHEESAKPREEWMKHVQGKSDESWSDAMREHGITAQ