; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g19960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g19960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr10:14734571..14735383
RNA-Seq ExpressionMoc10g19960
SyntenyMoc10g19960
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]1.4e-6263.73Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        +WKFQL AILKAHKLYGFIDG+ P P +   ++S S S+  P  NP++ DWIAKD ALMT++NATLSP ALAY+VG  SS+QVW  L K YSS SR+NVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
        NLKS+LQ+I KKP ESID Y++RIKE+KDKLANVS  ++ EDLLIY LNGLP E+N F TSM TRSQ V+FEEL+VLL  EE+A+ KQ+K D+ + Q + 
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST

Query:  LLAN
        LL++
Subjt:  LLAN

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.4e-6263.73Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        +WKFQL AILKAHKLYGFIDG+ P P +   ++S S S+  P  NP++ DWIAKD ALMT++NATLSP ALAY+VG  SS+QVW  L K YSS SR+NVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
        NLKS+LQ+I KKP ESID Y++RIKE+KDKLANVS  ++ EDLLIY LNGLP E+N F TSM TRSQ V+FEEL+VLL  EE+A+ KQ+K D+ + Q + 
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST

Query:  LLAN
        LL++
Subjt:  LLAN

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]1.4e-6263.73Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        +WKFQL AILKAHKLYGFIDG+ P P +   ++S S S+  P  NP++ DWIAKD ALMT++NATLSP ALAY+VG  SS+QVW  L K YSS SR+NVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
        NLKS+LQ+I KKP ESID Y++RIKE+KDKLANVS  ++ EDLLIY LNGLP E+N F TSM TRSQ V+FEEL+VLL  EE+A+ KQ+K D+ + Q + 
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST

Query:  LLAN
        LL++
Subjt:  LLAN

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]3.6e-7985.56Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        +WKFQL AILKAHKLYGFIDGSTPKPA+FLVS  D  SS  P  NPAFS+WIAKDHALMTLLNA LS SALAY+VGCDSSQQVWQTLVK+YSSSSRTNVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDK
        NLKS+LQSISKKPG SID Y+QRIKELKDKLANV VLVDNEDLLIYTLN LPPEFN F TSM TRSQSVSFEEL+VLLV EEAAIDK
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDK

XP_022159298.1 uncharacterized protein LOC111025709 [Momordica charantia]8.4e-145100Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
        NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST

Query:  LLANMATKGQNSNPNIPRVRGFNGGGKGKFPNPNNRSKISSLGTTGGRSRGFSGTAIPFEIQSNQSSSDT
        LLANMATKGQNSNPNIPRVRGFNGGGKGKFPNPNNRSKISSLGTTGGRSRGFSGTAIPFEIQSNQSSSDT
Subjt:  LLANMATKGQNSNPNIPRVRGFNGGGKGKFPNPNNRSKISSLGTTGGRSRGFSGTAIPFEIQSNQSSSDT

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X26.7e-6363.73Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        +WKFQL AILKAHKLYGFIDG+ P P +   ++S S S+  P  NP++ DWIAKD ALMT++NATLSP ALAY+VG  SS+QVW  L K YSS SR+NVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
        NLKS+LQ+I KKP ESID Y++RIKE+KDKLANVS  ++ EDLLIY LNGLP E+N F TSM TRSQ V+FEEL+VLL  EE+A+ KQ+K D+ + Q + 
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST

Query:  LLAN
        LL++
Subjt:  LLAN

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X16.7e-6363.73Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        +WKFQL AILKAHKLYGFIDG+ P P +   ++S S S+  P  NP++ DWIAKD ALMT++NATLSP ALAY+VG  SS+QVW  L K YSS SR+NVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
        NLKS+LQ+I KKP ESID Y++RIKE+KDKLANVS  ++ EDLLIY LNGLP E+N F TSM TRSQ V+FEEL+VLL  EE+A+ KQ+K D+ + Q + 
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST

Query:  LLAN
        LL++
Subjt:  LLAN

A0A5D3CLI6 T4.56.7e-6363.73Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        +WKFQL AILKAHKLYGFIDG+ P P +   ++S S S+  P  NP++ DWIAKD ALMT++NATLSP ALAY+VG  SS+QVW  L K YSS SR+NVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
        NLKS+LQ+I KKP ESID Y++RIKE+KDKLANVS  ++ EDLLIY LNGLP E+N F TSM TRSQ V+FEEL+VLL  EE+A+ KQ+K D+ + Q + 
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST

Query:  LLAN
        LL++
Subjt:  LLAN

A0A6J1DYF1 uncharacterized protein LOC1110257094.1e-145100Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
        NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST

Query:  LLANMATKGQNSNPNIPRVRGFNGGGKGKFPNPNNRSKISSLGTTGGRSRGFSGTAIPFEIQSNQSSSDT
        LLANMATKGQNSNPNIPRVRGFNGGGKGKFPNPNNRSKISSLGTTGGRSRGFSGTAIPFEIQSNQSSSDT
Subjt:  LLANMATKGQNSNPNIPRVRGFNGGGKGKFPNPNNRSKISSLGTTGGRSRGFSGTAIPFEIQSNQSSSDT

A0A6J1E049 uncharacterized protein LOC1110251501.7e-7985.56Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        +WKFQL AILKAHKLYGFIDGSTPKPA+FLVS  D  SS  P  NPAFS+WIAKDHALMTLLNA LS SALAY+VGCDSSQQVWQTLVK+YSSSSRTNVV
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDK
        NLKS+LQSISKKPG SID Y+QRIKELKDKLANV VLVDNEDLLIYTLN LPPEFN F TSM TRSQSVSFEEL+VLLV EEAAIDK
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-0622.08Show/hide
Query:  DWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVVNLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLN
        DW   D    + +   LS   +  ++  D+++ +W  L   Y S + TN + LK  L ++    G +   ++     L  +LAN+ V ++ ED  I  LN
Subjt:  DWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVVNLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLN

Query:  GLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFV---------QSSTLLANMATKGQNSNPNIPRVRG-FNGGGKGKF----PNPNN
         LP  ++   T++     ++  +++   L+  E    K     +  +         +SS        +G++ N +  RVR  +N    G F    PNP  
Subjt:  GLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFV---------QSSTLLANMATKGQNSNPNIPRVRG-FNGGGKGKF----PNPNN

Query:  RSKISSLGTTGGRSRGFSGTAIPFEIQSNQS
               G T G+    +  A+   +Q+N +
Subjt:  RSKISSLGTTGGRSRGFSGTAIPFEIQSNQS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-0922.49Show/hide
Query:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV
        MW  Q+ A+   ++L GF+DGSTP P   +   +D++    P VNP ++ W  +D  + + +   +S S    +    ++ Q+W+TL K Y++ S  +V 
Subjt:  MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVV

Query:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST
         L+                ++ R     D+LA +   +D+++ +   L  LP ++      +  +    S  E++  L+  E+ +      + V + ++ 
Subjt:  NLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSST

Query:  LLANMATKGQNSNPNIPRVRGFNGGGKGKFPNPNNRSKISSLGTTGGRS
        +        +N N         N G    + N NNRS      ++G RS
Subjt:  LLANMATKGQNSNPNIPRVRGFNGGGKGKFPNPNNRSKISSLGTTGGRS

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.1e-0419.84Show/hide
Query:  WKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVVN
        WK +  + L+  K +GFIDG+ PKP  F               +P +  W   +  +M  L  +++   L  ++  +++ ++W+ L + +       +  
Subjt:  WKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVVN

Query:  LKSNLQSISKKPGESIDLYMQRIKEL
        L+  L ++ ++ G+S++ Y  ++ ++
Subjt:  LKSNLQSISKKPGESIDLYMQRIKEL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.9e-0827.03Show/hide
Query:  DWIAKDHALMTLLNATLSPSAL-AYMVGCDSSQQVWQTLVKYYSSSSRTNVVNLKSNLQSISKKPGE-SIDLYMQRIKELKDKLANVSVLVDNEDLLIYT
        +W  +D  +   L  TL+P       V   +S+ +W  +   + ++     + L S L+  +K  G+  +  Y +++K+L D L NV V V + +L++Y 
Subjt:  DWIAKDHALMTLLNATLSPSAL-AYMVGCDSSQQVWQTLVKYYSSSSRTNVVNLKSNLQSISKKPGE-SIDLYMQRIKELKDKLANVSVLVDNEDLLIYT

Query:  LNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFV---QSSTLLA-NMATKGQNSNPNIPRVRGFNGGGKG
        LNGL P+F+     +  R    SF++   +L  EE  + +  K +   V    SST+LA + A    N   +     G+ G G+G
Subjt:  LNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFV---QSSTLLA-NMATKGQNSNPNIPRVRGFNGGGKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGAAGTTTCAACTTGCAGCAATCCTTAAAGCCCATAAACTTTATGGCTTTATAGATGGATCGACACCGAAACCTGCTAAGTTCTTGGTTTCGTCTTCTGATAGTTT
GTCATCTACTTCGCCTATTGTGAATCCGGCTTTTAGCGACTGGATCGCCAAAGATCATGCTCTCATGACTCTTCTGAATGCTACTCTGTCACCATCGGCTCTTGCATATA
TGGTTGGGTGTGATTCGTCTCAACAGGTTTGGCAAACCTTGGTGAAGTACTACTCATCCTCGTCCAGAACAAATGTGGTAAATTTGAAGTCAAATCTTCAGTCTATCAGC
AAGAAACCTGGTGAATCCATTGATCTCTATATGCAGCGAATTAAAGAACTGAAGGACAAACTTGCAAATGTATCTGTTCTTGTTGACAACGAAGATCTGCTCATCTACAC
TCTCAATGGTCTACCACCCGAATTCAATGCATTTTGTACTTCCATGTGCACTCGCTCTCAATCTGTATCATTTGAAGAGCTCTATGTCCTATTAGTTTATGAGGAAGCAG
CGATTGATAAACAGACCAAGCACGATGAAGTCTTTGTTCAGTCTTCTACTCTCCTTGCGAACATGGCTACGAAAGGTCAGAATTCTAACCCTAATATTCCACGTGTTCGA
GGTTTCAATGGTGGTGGTAAAGGAAAATTCCCTAATCCTAATAATCGATCCAAAATATCCTCTCTGGGTACCACTGGTGGTAGAAGCCGTGGCTTTTCAGGTACTGCTAT
ACCATTTGAAATTCAATCGAATCAGTCGTCTTCTGATACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGAAGTTTCAACTTGCAGCAATCCTTAAAGCCCATAAACTTTATGGCTTTATAGATGGATCGACACCGAAACCTGCTAAGTTCTTGGTTTCGTCTTCTGATAGTTT
GTCATCTACTTCGCCTATTGTGAATCCGGCTTTTAGCGACTGGATCGCCAAAGATCATGCTCTCATGACTCTTCTGAATGCTACTCTGTCACCATCGGCTCTTGCATATA
TGGTTGGGTGTGATTCGTCTCAACAGGTTTGGCAAACCTTGGTGAAGTACTACTCATCCTCGTCCAGAACAAATGTGGTAAATTTGAAGTCAAATCTTCAGTCTATCAGC
AAGAAACCTGGTGAATCCATTGATCTCTATATGCAGCGAATTAAAGAACTGAAGGACAAACTTGCAAATGTATCTGTTCTTGTTGACAACGAAGATCTGCTCATCTACAC
TCTCAATGGTCTACCACCCGAATTCAATGCATTTTGTACTTCCATGTGCACTCGCTCTCAATCTGTATCATTTGAAGAGCTCTATGTCCTATTAGTTTATGAGGAAGCAG
CGATTGATAAACAGACCAAGCACGATGAAGTCTTTGTTCAGTCTTCTACTCTCCTTGCGAACATGGCTACGAAAGGTCAGAATTCTAACCCTAATATTCCACGTGTTCGA
GGTTTCAATGGTGGTGGTAAAGGAAAATTCCCTAATCCTAATAATCGATCCAAAATATCCTCTCTGGGTACCACTGGTGGTAGAAGCCGTGGCTTTTCAGGTACTGCTAT
ACCATTTGAAATTCAATCGAATCAGTCGTCTTCTGATACTTAA
Protein sequenceShow/hide protein sequence
MWKFQLAAILKAHKLYGFIDGSTPKPAKFLVSSSDSLSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSRTNVVNLKSNLQSIS
KKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFNAFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSSTLLANMATKGQNSNPNIPRVR
GFNGGGKGKFPNPNNRSKISSLGTTGGRSRGFSGTAIPFEIQSNQSSSDT