; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g25580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g25580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr5:18113856..18115246
RNA-Seq ExpressionMoc05g25580
SyntenyMoc05g25580
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]2.2e-11566.38Show/hide
Query:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ
        M+  VG+FHGHATE PHQHLKF MGV NSFKDEGLSK V+RLKLF +SLR EARTWLESL SE ITSWDDL EKFLMKYF P+K                
Subjt:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ

Query:  IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRSMTQQSSAGALVGKANVN
            Y+   N  +     S N        A+A NILERISS+NHSW D +A++GKSSK LVESESYTTLNSKIE LTDLV+RS+TQQS AGA VG  NVN
Subjt:  IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRSMTQQSSAGALVGKANVN

Query:  QIQGISCSFCKGDHHYNNCPGNPD--GNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQV
        QIQGISCSF +GDHHYNNCPGNP+  GNQGGHN G SNAP FQQK                 QS+GS  SLE +MKQYMANNDATV+ Q + LRNLELQV
Subjt:  QIQGISCSFCKGDHHYNNCPGNPD--GNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQV

Query:  GQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKE
        GQLA DL SRP+GALPSDTEVPKRD KEQC ALTL SGKALPP H NAP L+KE
Subjt:  GQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKE

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]7.0e-21090Show/hide
Query:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ
        MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ
Subjt:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ

Query:  IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRSMTQQSSAGALVGKANVN
        IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDL  R+ +  ++    +      
Subjt:  IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRSMTQQSSAGALVGKANVN

Query:  QIQGISCSFCKGDHHYNNCPGNPDGNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQ
                        N+      GNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQ
Subjt:  QIQGISCSFCKGDHHYNNCPGNPDGNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQ

Query:  LAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKEPAQIVQGEPQSEQDSEPAEVVVPIPPEQIAEQPKEGQNTSKQSVNPVI
        LAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKEPAQIVQGEPQSEQDSEPAEVVVPIPPEQIAEQPKEGQNTSKQSVNPVI
Subjt:  LAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKEPAQIVQGEPQSEQDSEPAEVVVPIPPEQIAEQPKEGQNTSKQSVNPVI

Query:  ARAPETGSSQNIIPEKESRQSIKVLWQSTG
        ARAPETGSSQNIIPEKESRQSIKVLWQSTG
Subjt:  ARAPETGSSQNIIPEKESRQSIKVLWQSTG

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]2.6e-10359.19Show/hide
Query:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPS-----------------
        M+  V QFHGHATE PHQHLKFFMGV NSFK+EGLS  VLRLKLF YSLR EARTWLESL  E ITSWDDL EKFLMKYF PS                 
Subjt:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPS-----------------

Query:  ----------KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKI
                  KRL Q CP+HGIP  IQIETYYK L++ATRL                                 D RA++GKSSK LVESESYTTLNS I
Subjt:  ----------KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKI

Query:  EILTDLVIRSMTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPD---------------------------------GNQGGHNTGISNAPTF
        E LT LV+RSM QQSS GAL G ANVNQIQGISCSFC+GDHHYNNCPGNP+                                 G+QGGHN G S+AP F
Subjt:  EILTDLVIRSMTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPD---------------------------------GNQGGHNTGISNAPTF

Query:  QQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQLAMDLKSRPV
        Q KVSYPPGF  QGQMV   QS+GSI SLE +MKQYMANNDATVQSQA SLRNL+LQVGQLA DLKS+P+
Subjt:  QQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQLAMDLKSRPV

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]1.4e-7761.71Show/hide
Query:  KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRS
        KRL Q+  Y GIP  IQI+TYY GLD+ATRLVIDAS NGALL KPYA+A NILERISS+N SWSD RAI GK SK   ESES+T LN KIE LTDLV+RS
Subjt:  KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRS

Query:  MTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPDGNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDAT
        MT QS+ GA  GKANV+ IQGISCSFC G++ YNNCPGNP+      N   +    +  + ++  G       VE           +  M +YM NND T
Subjt:  MTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPDGNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDAT

Query:  VQSQAASLRNLELQVGQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKE
        VQSQA SLRNLE+QVGQLA DLKS+P G LPSD +VPKRD KEQCNALTLRSGK LP  HPNA  + KE
Subjt:  VQSQAASLRNLELQVGQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKE

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.3e-7541.35Show/hide
Query:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPS-----------------
        M+  VGQF G  TE PH H++ F+ V +SFK +G+S+  LRLKLF +SLR  AR WL +L  + +T+W+DL EKFL KYF P+                 
Subjt:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPS-----------------

Query:  ----------KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKI
                  K L ++CP+HGIP  IQ+ET+Y GL+ A+R+V+DAS NGA+L K Y +A  ILERI+S+N+ WS +RA   +    ++E ++ T L +++
Subjt:  ----------KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKI

Query:  EILTDLVIRSMTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPD-----GNQ----------GGHNTGISNAPTF----QQKVSYPPGFAYQG
          +T+++     +  + G  V  A   Q    SC +C   H + NCP N       GNQ            +N    + P F    Q K S+PPGF+ Q 
Subjt:  EILTDLVIRSMTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPD-----GNQ----------GGHNTGISNAPTF----QQKVSYPPGFAYQG

Query:  QMVEHNQSKGSIT-SLENIMKQYMANNDATVQSQAASLRNLELQVGQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKEP
        +  + +Q +GS T SLE++M+ YMA ND  +QSQAASLRNLE+Q+GQLA DLK+RP G LPSDTE P+RD KE C A+TLRSGK +      A T +KEP
Subjt:  QMVEHNQSKGSIT-SLENIMKQYMANNDATVQSQAASLRNLELQVGQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKEP

Query:  AQIVQGEPQSEQDSEPAEVVVPIPPEQIAEQPKEGQNTSKQSVNP
        + I   + + E   +PA   V IPP   A         S Q   P
Subjt:  AQIVQGEPQSEQDSEPAEVVVPIPPEQIAEQPKEGQNTSKQSVNP

TrEMBL top hitse value%identityAlignment
A0A6J1DRG1 uncharacterized protein LOC1110236695.4e-7562Show/hide
Query:  KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRS
        K+L Q+CP+HGIP  IQIE YYKGLD+ATRLVIDAS NGALLVKPYA+A NILERISS+NHSWSD RAI+G+  K L ESESY  LNSK+E LT+LV+RS
Subjt:  KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRS

Query:  MTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPD---------------------------------GNQGGHNTGISNAPTFQQKVSYPPGF
        MTQQ++ GA  GKANV+ IQGISCSFC+G+HHYNN P NP+                                 GNQGG+N G SNAP +QQK SYPP F
Subjt:  MTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPD---------------------------------GNQGGHNTGISNAPTFQQKVSYPPGF

Query:  AYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQ
        + QGQ+     S+GS  SLEN+MK+ M  ND TVQSQAASLRNLE+QVGQ
Subjt:  AYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQ

A0A6J1DTD1 uncharacterized protein LOC1110241361.1e-11566.38Show/hide
Query:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ
        M+  VG+FHGHATE PHQHLKF MGV NSFKDEGLSK V+RLKLF +SLR EARTWLESL SE ITSWDDL EKFLMKYF P+K                
Subjt:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ

Query:  IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRSMTQQSSAGALVGKANVN
            Y+   N  +     S N        A+A NILERISS+NHSW D +A++GKSSK LVESESYTTLNSKIE LTDLV+RS+TQQS AGA VG  NVN
Subjt:  IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRSMTQQSSAGALVGKANVN

Query:  QIQGISCSFCKGDHHYNNCPGNPD--GNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQV
        QIQGISCSF +GDHHYNNCPGNP+  GNQGGHN G SNAP FQQK                 QS+GS  SLE +MKQYMANNDATV+ Q + LRNLELQV
Subjt:  QIQGISCSFCKGDHHYNNCPGNPD--GNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQV

Query:  GQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKE
        GQLA DL SRP+GALPSDTEVPKRD KEQC ALTL SGKALPP H NAP L+KE
Subjt:  GQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKE

A0A6J1DWK1 uncharacterized protein LOC1110250533.4e-21090Show/hide
Query:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ
        MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ
Subjt:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQ

Query:  IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRSMTQQSSAGALVGKANVN
        IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDL  R+ +  ++    +      
Subjt:  IETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRSMTQQSSAGALVGKANVN

Query:  QIQGISCSFCKGDHHYNNCPGNPDGNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQ
                        N+      GNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQ
Subjt:  QIQGISCSFCKGDHHYNNCPGNPDGNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQ

Query:  LAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKEPAQIVQGEPQSEQDSEPAEVVVPIPPEQIAEQPKEGQNTSKQSVNPVI
        LAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKEPAQIVQGEPQSEQDSEPAEVVVPIPPEQIAEQPKEGQNTSKQSVNPVI
Subjt:  LAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKEPAQIVQGEPQSEQDSEPAEVVVPIPPEQIAEQPKEGQNTSKQSVNPVI

Query:  ARAPETGSSQNIIPEKESRQSIKVLWQSTG
        ARAPETGSSQNIIPEKESRQSIKVLWQSTG
Subjt:  ARAPETGSSQNIIPEKESRQSIKVLWQSTG

A0A6J1DXK5 uncharacterized protein LOC1110255006.9e-7861.71Show/hide
Query:  KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRS
        KRL Q+  Y GIP  IQI+TYY GLD+ATRLVIDAS NGALL KPYA+A NILERISS+N SWSD RAI GK SK   ESES+T LN KIE LTDLV+RS
Subjt:  KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRS

Query:  MTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPDGNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDAT
        MT QS+ GA  GKANV+ IQGISCSFC G++ YNNCPGNP+      N   +    +  + ++  G       VE           +  M +YM NND T
Subjt:  MTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPDGNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDAT

Query:  VQSQAASLRNLELQVGQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKE
        VQSQA SLRNLE+QVGQLA DLKS+P G LPSD +VPKRD KEQCNALTLRSGK LP  HPNA  + KE
Subjt:  VQSQAASLRNLELQVGQLAMDLKSRPVGALPSDTEVPKRDSKEQCNALTLRSGKALPPTHPNAPTLTKE

A0A6J1E1F3 uncharacterized protein LOC1110250651.2e-10359.19Show/hide
Query:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPS-----------------
        M+  V QFHGHATE PHQHLKFFMGV NSFK+EGLS  VLRLKLF YSLR EARTWLESL  E ITSWDDL EKFLMKYF PS                 
Subjt:  MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPS-----------------

Query:  ----------KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKI
                  KRL Q CP+HGIP  IQIETYYK L++ATRL                                 D RA++GKSSK LVESESYTTLNS I
Subjt:  ----------KRLFQRCPYHGIPGSIQIETYYKGLDNATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKI

Query:  EILTDLVIRSMTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPD---------------------------------GNQGGHNTGISNAPTF
        E LT LV+RSM QQSS GAL G ANVNQIQGISCSFC+GDHHYNNCPGNP+                                 G+QGGHN G S+AP F
Subjt:  EILTDLVIRSMTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCPGNPD---------------------------------GNQGGHNTGISNAPTF

Query:  QQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQLAMDLKSRPV
        Q KVSYPPGF  QGQMV   QS+GSI SLE +MKQYMANNDATVQSQA SLRNL+LQVGQLA DLKS+P+
Subjt:  QQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQLAMDLKSRPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCACATAGTGGGTCAGTTTCATGGACATGCTACAGAGGTCCCGCATCAGCATCTGAAGTTCTTTATGGGAGTTTGGAATTCATTCAAGGATGAAGGATTGAGCAA
AGGAGTGTTGAGGCTTAAGCTATTTTCATATTCACTTAGAGGTGAAGCCAGAACATGGTTGGAGTCCCTTTCTTCAGAATATATTACAAGTTGGGATGACTTGACTGAGA
AGTTTTTGATGAAGTATTTCCTACCCAGCAAACGATTATTCCAAAGGTGTCCTTACCATGGGATTCCAGGAAGCATACAGATAGAGACATATTACAAAGGTTTGGATAAT
GCCACACGCTTAGTAATTGATGCATCCCCAAATGGGGCGTTGCTAGTAAAACCCTATGCTAAAGCACTCAATATTTTAGAAAGAATATCATCAAGCAATCACTCATGGTC
TGATCATAGAGCTATTGAAGGAAAATCAAGTAAGGAGCTAGTTGAGTCTGAATCATACACTACATTGAATTCAAAGATTGAGATTCTGACGGACTTGGTAATAAGGAGTA
TGACACAACAAAGTTCAGCTGGAGCATTAGTTGGTAAGGCTAATGTCAATCAAATTCAAGGGATTTCATGTTCTTTCTGCAAGGGAGATCACCATTACAACAACTGCCCT
GGAAATCCGGATGGCAATCAAGGAGGACATAACACTGGAATATCCAATGCTCCAACTTTTCAACAGAAAGTAAGTTATCCTCCTGGTTTTGCGTATCAAGGACAGATGGT
AGAACATAATCAATCAAAAGGATCAATTACATCATTAGAAAATATAATGAAGCAATACATGGCCAATAATGACGCCACTGTGCAAAGCCAGGCTGCATCTTTGAGGAACC
TAGAGTTGCAAGTAGGCCAGTTAGCTATGGATTTGAAGAGCAGACCGGTTGGAGCATTGCCCAGTGATACAGAAGTGCCAAAGAGAGACAGTAAGGAACAATGCAACGCC
CTCACTCTACGAAGTGGGAAAGCATTACCTCCAACACACCCGAACGCTCCAACATTGACCAAAGAGCCTGCTCAAATTGTCCAAGGAGAACCTCAGTCAGAACAGGACAG
TGAGCCAGCAGAAGTAGTCGTACCTATTCCACCAGAGCAAATAGCTGAACAACCAAAGGAGGGTCAAAACACATCCAAGCAATCAGTTAATCCAGTAATTGCTAGAGCAC
CTGAGACAGGGTCTTCACAGAACATAATACCCGAGAAAGAAAGCAGGCAGAGCATTAAAGTGCTCTGGCAGAGTACAGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCCACATAGTGGGTCAGTTTCATGGACATGCTACAGAGGTCCCGCATCAGCATCTGAAGTTCTTTATGGGAGTTTGGAATTCATTCAAGGATGAAGGATTGAGCAA
AGGAGTGTTGAGGCTTAAGCTATTTTCATATTCACTTAGAGGTGAAGCCAGAACATGGTTGGAGTCCCTTTCTTCAGAATATATTACAAGTTGGGATGACTTGACTGAGA
AGTTTTTGATGAAGTATTTCCTACCCAGCAAACGATTATTCCAAAGGTGTCCTTACCATGGGATTCCAGGAAGCATACAGATAGAGACATATTACAAAGGTTTGGATAAT
GCCACACGCTTAGTAATTGATGCATCCCCAAATGGGGCGTTGCTAGTAAAACCCTATGCTAAAGCACTCAATATTTTAGAAAGAATATCATCAAGCAATCACTCATGGTC
TGATCATAGAGCTATTGAAGGAAAATCAAGTAAGGAGCTAGTTGAGTCTGAATCATACACTACATTGAATTCAAAGATTGAGATTCTGACGGACTTGGTAATAAGGAGTA
TGACACAACAAAGTTCAGCTGGAGCATTAGTTGGTAAGGCTAATGTCAATCAAATTCAAGGGATTTCATGTTCTTTCTGCAAGGGAGATCACCATTACAACAACTGCCCT
GGAAATCCGGATGGCAATCAAGGAGGACATAACACTGGAATATCCAATGCTCCAACTTTTCAACAGAAAGTAAGTTATCCTCCTGGTTTTGCGTATCAAGGACAGATGGT
AGAACATAATCAATCAAAAGGATCAATTACATCATTAGAAAATATAATGAAGCAATACATGGCCAATAATGACGCCACTGTGCAAAGCCAGGCTGCATCTTTGAGGAACC
TAGAGTTGCAAGTAGGCCAGTTAGCTATGGATTTGAAGAGCAGACCGGTTGGAGCATTGCCCAGTGATACAGAAGTGCCAAAGAGAGACAGTAAGGAACAATGCAACGCC
CTCACTCTACGAAGTGGGAAAGCATTACCTCCAACACACCCGAACGCTCCAACATTGACCAAAGAGCCTGCTCAAATTGTCCAAGGAGAACCTCAGTCAGAACAGGACAG
TGAGCCAGCAGAAGTAGTCGTACCTATTCCACCAGAGCAAATAGCTGAACAACCAAAGGAGGGTCAAAACACATCCAAGCAATCAGTTAATCCAGTAATTGCTAGAGCAC
CTGAGACAGGGTCTTCACAGAACATAATACCCGAGAAAGAAAGCAGGCAGAGCATTAAAGTGCTCTGGCAGAGTACAGGCTAG
Protein sequenceShow/hide protein sequence
MIHIVGQFHGHATEVPHQHLKFFMGVWNSFKDEGLSKGVLRLKLFSYSLRGEARTWLESLSSEYITSWDDLTEKFLMKYFLPSKRLFQRCPYHGIPGSIQIETYYKGLDN
ATRLVIDASPNGALLVKPYAKALNILERISSSNHSWSDHRAIEGKSSKELVESESYTTLNSKIEILTDLVIRSMTQQSSAGALVGKANVNQIQGISCSFCKGDHHYNNCP
GNPDGNQGGHNTGISNAPTFQQKVSYPPGFAYQGQMVEHNQSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQLAMDLKSRPVGALPSDTEVPKRDSKEQCNA
LTLRSGKALPPTHPNAPTLTKEPAQIVQGEPQSEQDSEPAEVVVPIPPEQIAEQPKEGQNTSKQSVNPVIARAPETGSSQNIIPEKESRQSIKVLWQSTG