; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g26880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g26880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr11:19738367..19741795
RNA-Seq ExpressionMoc11g26880
SyntenyMoc11g26880
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150853.1 uncharacterized protein LOC111018898 [Momordica charantia]1.6e-4233.63Show/hide
Query:  KIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSIN
        K Q  Q   FL+VLKQ+HINIPLV+A+EQMS Y K L D+LTKK   GE ET+  TKECS ILT K+ +K+ D  SFTIP+SIGG  +  A+CDLG SIN
Subjt:  KIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSIN

Query:  LVPLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLK
        L+PLSVY+RLG+G  RPT                     V L   D     R+ A P                 + ++K VM Q                
Subjt:  LVPLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLK

Query:  LFPYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISSN
                     ++ F+                 FP                               A  +V+D  A                      
Subjt:  LFPYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISSN

Query:  NHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALILDEAIMEELETQAMLEHLEAVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESLDLTNPELR
                        D+ +PII GRPFLATGRAL+               + +  + +   +     +  D       D A    R++ESLDL +   +
Subjt:  NHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALILDEAIMEELETQAMLEHLEAVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESLDLTNPELR

Query:  LQKPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLKAH
          KPSIEEP  LELK L +HLKYAYLG+S TLPIII ADL  EKE  LL VL+ H
Subjt:  LQKPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLKAH

XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]4.3e-5660.99Show/hide
Query:  EIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLKLFPYLLRHEARTWLESFLQ
        EIVD VPV    +V VPS NVVLLA  IDREIRAYAAPTFYNFNPVITE EI A KFELK        +DEG +KEVLRLKLF + LR EARTWL S   
Subjt:  EIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLKLFPYLLRHEARTWLESFLQ

Query:  NLLQVGMTWLK-FLMKYFPPSKNAKYRSEINNFQQFAGES-------------------------IETYYKGVDDATRLVIDASANGALLAKPYAEAFNI
          +       + FLMKYFPPSKNAKYRS+INNFQQF GES                         IE YY G+DDATRLV   S N ALLAKPYAEAFNI
Subjt:  NLLQVGMTWLK-FLMKYFPPSKNAKYRSEINNFQQFAGES-------------------------IETYYKGVDDATRLVIDASANGALLAKPYAEAFNI

Query:  LERISSNNHSWCDPRAVQGKSSK
        LERISSN HS  D RA+QG+ +K
Subjt:  LERISSNNHSWCDPRAVQGKSSK

XP_022159030.1 uncharacterized protein LOC111025474 [Momordica charantia]4.9e-5285.71Show/hide
Query:  MKIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSI
        +K Q+DQL +FLD+LKQLHINIPLVKAIEQMSNYAKIL D LTKKRRFGEF+ I STKECSAILTDKLPQKIWD GSFTIPISIGGKNVSHAICDLGVSI
Subjt:  MKIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSI

Query:  NLVPLSVYQRLGIGEARPTIEIVDEV
        NLVPLSVYQRLGIGEARPT+   +E+
Subjt:  NLVPLSVYQRLGIGEARPTIEIVDEV

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]3.5e-4232.31Show/hide
Query:  QNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSINLV
        Q+ Q   FLDVLKQLHINIPLV+A+EQM +Y K + DILTKKRR GEFET+  T+ECSAIL ++LP K+ D GSFTIP SIG + +  A+CDLG SINL+
Subjt:  QNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSINLV

Query:  PLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLKLF
        P+S++++LGIGE  PT   +                    LAD      R+YA P                                EG  ++VL     
Subjt:  PLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLKLF

Query:  PYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISSNNH
                   ++ F+                 FP                               A  +V+D  A                        
Subjt:  PYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISSNNH

Query:  SWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALI-----------LDEAI-------------MEELETQAMLEHLEAVDAESLAD---ASKEELEDTQS
                      DK++PII GRPFLATG+ LI            D+ +             +EE     +L+ L A + E        ++E+L D++ 
Subjt:  SWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALI-----------LDEAI-------------MEELETQAMLEHLEAVDAESLAD---ASKEELEDTQS

Query:  DCMNDN-----------AGFVKRMYESLDLTNPELRLQKPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLK
        +  N++           A   +R +ESLDL+   LR  KPS+EEP +LEL+ L  HL+YAYLG S+TLP+IIA+ L   +E  LL VLK
Subjt:  DCMNDN-----------AGFVKRMYESLDLTNPELRLQKPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLK

XP_030502183.1 uncharacterized protein LOC115717351 [Cannabis sativa]2.4e-4334.73Show/hide
Query:  QNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSINLV
        ++ Q   FLDVLKQLHINIPLV+A+EQMSNY K L DILTKKRR GEFET+  T+ CSA+L  K+P K+ D GSFTIP SIGG++V  A+CDLG SINL+
Subjt:  QNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSINLV

Query:  PLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLKLF
        P+S++++LGIGEARPT   +                    LAD      R+ A P        I +  ++  KF + P  F +L  D    ++V      
Subjt:  PLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLKLF

Query:  PYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKN-AKYRSEINNFQQFAGESIETYYKGVDDATRL-VIDASANGALLAKPYAEAFNILERISSN
        P +L            ++ L  G T +    +      N  K    + N  +F  E        +++ +R+ VID+           AE F+        
Subjt:  PYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKN-AKYRSEINNFQQFAGESIETYYKGVDDATRL-VIDASANGALLAKPYAEAFNILERISSN

Query:  NHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALILDEAIMEELETQAMLEHLEAVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESLDLTNPELR
                    ++ KD+K        F+++           +ELE  +  E  +    E L    K                  K+ +ESL+L     +
Subjt:  NHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALILDEAIMEELETQAMLEHLEAVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESLDLTNPELR

Query:  LQKPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLKAH
          KPS +EP  LELK L  HLKYAYLG ++TLP+IIA++L +E E  LL VLK H
Subjt:  LQKPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLKAH

TrEMBL top hitse value%identityAlignment
A0A6J1CS22 uncharacterized protein LOC1110138054.9e-4234.52Show/hide
Query:  KIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSIN
        K QN +   FLDVLKQLH+N+PLV+A+EQM NY + L +ILTKKR  GE+E +  TK CS ILT K+P K+ D GSFTIP+SIGG+ +   +CD+G SIN
Subjt:  KIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSIN

Query:  LVPLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLK
        ++PLS+Y +LGI EARPT   +                    LAD      R+   P        I +  ++  KF   P  F +L  D   +KEV  + 
Subjt:  LVPLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLK

Query:  LFPYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISSN
          P+L    A          L+ V    L   +      ++ + +  ++N  +F+ ES                              E  ++L+     
Subjt:  LFPYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISSN

Query:  NHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALILDEAIMEELETQAMLEHLEAVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESLDL
                                           ILDEA+MEELE + MLE LEAV A+S+ +A +EELED +S+C+N N GFVK++YESLD+
Subjt:  NHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALILDEAIMEELETQAMLEHLEAVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESLDL

A0A6J1DAJ9 uncharacterized protein LOC1110188987.6e-4333.63Show/hide
Query:  KIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSIN
        K Q  Q   FL+VLKQ+HINIPLV+A+EQMS Y K L D+LTKK   GE ET+  TKECS ILT K+ +K+ D  SFTIP+SIGG  +  A+CDLG SIN
Subjt:  KIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSIN

Query:  LVPLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLK
        L+PLSVY+RLG+G  RPT                     V L   D     R+ A P                 + ++K VM Q                
Subjt:  LVPLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLK

Query:  LFPYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISSN
                     ++ F+                 FP                               A  +V+D  A                      
Subjt:  LFPYLLRHEARTWLESFLQNLLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISSN

Query:  NHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALILDEAIMEELETQAMLEHLEAVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESLDLTNPELR
                        D+ +PII GRPFLATGRAL+               + +  + +   +     +  D       D A    R++ESLDL +   +
Subjt:  NHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALILDEAIMEELETQAMLEHLEAVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESLDLTNPELR

Query:  LQKPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLKAH
          KPSIEEP  LELK L +HLKYAYLG+S TLPIII ADL  EKE  LL VL+ H
Subjt:  LQKPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLKAH

A0A6J1DAK9 uncharacterized protein LOC1110189102.1e-5660.99Show/hide
Query:  EIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLKLFPYLLRHEARTWLESFLQ
        EIVD VPV    +V VPS NVVLLA  IDREIRAYAAPTFYNFNPVITE EI A KFELK        +DEG +KEVLRLKLF + LR EARTWL S   
Subjt:  EIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLKLFPYLLRHEARTWLESFLQ

Query:  NLLQVGMTWLK-FLMKYFPPSKNAKYRSEINNFQQFAGES-------------------------IETYYKGVDDATRLVIDASANGALLAKPYAEAFNI
          +       + FLMKYFPPSKNAKYRS+INNFQQF GES                         IE YY G+DDATRLV   S N ALLAKPYAEAFNI
Subjt:  NLLQVGMTWLK-FLMKYFPPSKNAKYRSEINNFQQFAGES-------------------------IETYYKGVDDATRLVIDASANGALLAKPYAEAFNI

Query:  LERISSNNHSWCDPRAVQGKSSK
        LERISSN HS  D RA+QG+ +K
Subjt:  LERISSNNHSWCDPRAVQGKSSK

A0A6J1DV77 uncharacterized protein LOC1110238181.9e-4134.53Show/hide
Query:  KIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSIN
        K Q+ Q   FL+VLKQLHINIPL++A+EQM NY K L DIL KKRR GEFE +  TKE SAILT KLPQK+ D GSFTIP+ IGGKNV HA+CDLG SIN
Subjt:  KIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSIN

Query:  LVPLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLK
        L+PLSVYQ+LGIGEARP                      + +     DR I                                                 
Subjt:  LVPLSVYQRLGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLK

Query:  LFPYLLRHEARTWLESFLQN-LLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISS
                   T+LE  +++ L+QV     KF+   FP                               A  +++D  A                     
Subjt:  LFPYLLRHEARTWLESFLQN-LLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISS

Query:  NNHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALI------LDEAIMEELETQAMLEHLE-AVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESL
                         DK+IPII GRPFL+TGRALI      L   + ++  T ++   ++  +D E   + S   + D   D M+D      +  E L
Subjt:  NNHSWCDPRAVQGKSSKDKKIPIIHGRPFLATGRALI------LDEAIMEELETQAMLEHLE-AVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESL

Query:  DLTNPEL------RLQ---KPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLKAH
        +    EL      R+Q   +PS+ +   LELK L  HLKYAYLG  ETLP+ IAADL  EKE  L+ +L+ H
Subjt:  DLTNPEL------RLQ---KPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAADLPLEKEQMLLNVLKAH

A0A6J1DXH8 uncharacterized protein LOC1110254742.4e-5285.71Show/hide
Query:  MKIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSI
        +K Q+DQL +FLD+LKQLHINIPLVKAIEQMSNYAKIL D LTKKRRFGEF+ I STKECSAILTDKLPQKIWD GSFTIPISIGGKNVSHAICDLGVSI
Subjt:  MKIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSI

Query:  NLVPLSVYQRLGIGEARPTIEIVDEV
        NLVPLSVYQRLGIGEARPT+   +E+
Subjt:  NLVPLSVYQRLGIGEARPTIEIVDEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCCAAAATGATCAACTCATGCATTTTTTGGATGTGTTGAAGCAACTCCACATCAATATACCCTTAGTTAAGGCTATTGAGCAGATGTCTAACTATGCAAAAAT
TTTGAACGATATCTTGACTAAGAAAAGGAGGTTTGGAGAGTTTGAAACGATAAATTCAACCAAGGAGTGCAGTGCAATTTTAACAGACAAGCTGCCACAGAAAATCTGGG
ATCTAGGGAGTTTCACTATTCCAATCTCTATTGGTGGAAAGAATGTGAGCCATGCTATATGCGATTTGGGTGTAAGCATAAACCTTGTGCCATTATCAGTATATCAGAGG
TTGGGTATTGGTGAAGCAAGACCTACCATAGAAATAGTAGATGAGGTTCCTGTCGTTGCTTACCCTGATGTAGCAGTGCCCTCTCGCAACGTTGTACTCTTAGCAGACGA
CATCGACAGGGAAATTAGAGCATATGCGGCTCCGACATTCTACAACTTCAACCCAGTTATCACGGAGCCAGAAATTGAAGCTTCTAAATTTGAGCTGAAACCAGTGATGT
TTCAGATGCTCCAGACAGATGAAGGATTGAGCAAAGAAGTGCTGAGGCTTAAGCTATTTCCGTATTTGCTTAGACATGAAGCCAGAACATGGTTGGAGTCATTCCTTCAG
AATCTATTACAAGTTGGGATGACTTGGCTGAAGTTTTTGATGAAGTATTTCCCACCCAGCAAAAACGCTAAGTATCGTAGTGAGATCAACAATTTTCAACAATTTGCTGG
GGAATCAATAGAGACATATTACAAAGGTGTGGATGATGCCACACGATTAGTGATTGATGCGTCTGCAAATGGGGCTTTGCTAGCAAAACCCTATGCTGAAGCATTCAATA
TTTTGGAAAGAATATCATCAAATAATCACTCATGGTGTGATCCCAGAGCTGTTCAAGGAAAATCAAGCAAGGATAAGAAGATTCCTATTATTCATGGAAGACCATTCCTT
GCAACGGGGAGAGCTTTGATATTGGATGAAGCAATAATGGAGGAGTTGGAAACACAAGCCATGCTGGAACATCTAGAAGCAGTTGACGCTGAAAGTCTTGCCGACGCATC
TAAAGAGGAACTAGAAGATACCCAGTCTGACTGCATGAATGACAATGCAGGCTTTGTGAAAAGAATGTATGAGTCTTTAGACCTCACAAACCCGGAGCTTAGATTACAGA
AGCCATCTATTGAAGAGCCGTCATTGCTAGAGCTTAAAGCATTGTCGCAACATCTAAAATATGCTTACCTGGGTTCATCAGAGACATTGCCAATTATCATAGCAGCAGAC
TTGCCTTTGGAAAAGGAACAGATGCTGTTGAACGTACTCAAGGCACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATCCAAAATGATCAACTCATGCATTTTTTGGATGTGTTGAAGCAACTCCACATCAATATACCCTTAGTTAAGGCTATTGAGCAGATGTCTAACTATGCAAAAAT
TTTGAACGATATCTTGACTAAGAAAAGGAGGTTTGGAGAGTTTGAAACGATAAATTCAACCAAGGAGTGCAGTGCAATTTTAACAGACAAGCTGCCACAGAAAATCTGGG
ATCTAGGGAGTTTCACTATTCCAATCTCTATTGGTGGAAAGAATGTGAGCCATGCTATATGCGATTTGGGTGTAAGCATAAACCTTGTGCCATTATCAGTATATCAGAGG
TTGGGTATTGGTGAAGCAAGACCTACCATAGAAATAGTAGATGAGGTTCCTGTCGTTGCTTACCCTGATGTAGCAGTGCCCTCTCGCAACGTTGTACTCTTAGCAGACGA
CATCGACAGGGAAATTAGAGCATATGCGGCTCCGACATTCTACAACTTCAACCCAGTTATCACGGAGCCAGAAATTGAAGCTTCTAAATTTGAGCTGAAACCAGTGATGT
TTCAGATGCTCCAGACAGATGAAGGATTGAGCAAAGAAGTGCTGAGGCTTAAGCTATTTCCGTATTTGCTTAGACATGAAGCCAGAACATGGTTGGAGTCATTCCTTCAG
AATCTATTACAAGTTGGGATGACTTGGCTGAAGTTTTTGATGAAGTATTTCCCACCCAGCAAAAACGCTAAGTATCGTAGTGAGATCAACAATTTTCAACAATTTGCTGG
GGAATCAATAGAGACATATTACAAAGGTGTGGATGATGCCACACGATTAGTGATTGATGCGTCTGCAAATGGGGCTTTGCTAGCAAAACCCTATGCTGAAGCATTCAATA
TTTTGGAAAGAATATCATCAAATAATCACTCATGGTGTGATCCCAGAGCTGTTCAAGGAAAATCAAGCAAGGATAAGAAGATTCCTATTATTCATGGAAGACCATTCCTT
GCAACGGGGAGAGCTTTGATATTGGATGAAGCAATAATGGAGGAGTTGGAAACACAAGCCATGCTGGAACATCTAGAAGCAGTTGACGCTGAAAGTCTTGCCGACGCATC
TAAAGAGGAACTAGAAGATACCCAGTCTGACTGCATGAATGACAATGCAGGCTTTGTGAAAAGAATGTATGAGTCTTTAGACCTCACAAACCCGGAGCTTAGATTACAGA
AGCCATCTATTGAAGAGCCGTCATTGCTAGAGCTTAAAGCATTGTCGCAACATCTAAAATATGCTTACCTGGGTTCATCAGAGACATTGCCAATTATCATAGCAGCAGAC
TTGCCTTTGGAAAAGGAACAGATGCTGTTGAACGTACTCAAGGCACATTAA
Protein sequenceShow/hide protein sequence
MKIQNDQLMHFLDVLKQLHINIPLVKAIEQMSNYAKILNDILTKKRRFGEFETINSTKECSAILTDKLPQKIWDLGSFTIPISIGGKNVSHAICDLGVSINLVPLSVYQR
LGIGEARPTIEIVDEVPVVAYPDVAVPSRNVVLLADDIDREIRAYAAPTFYNFNPVITEPEIEASKFELKPVMFQMLQTDEGLSKEVLRLKLFPYLLRHEARTWLESFLQ
NLLQVGMTWLKFLMKYFPPSKNAKYRSEINNFQQFAGESIETYYKGVDDATRLVIDASANGALLAKPYAEAFNILERISSNNHSWCDPRAVQGKSSKDKKIPIIHGRPFL
ATGRALILDEAIMEELETQAMLEHLEAVDAESLADASKEELEDTQSDCMNDNAGFVKRMYESLDLTNPELRLQKPSIEEPSLLELKALSQHLKYAYLGSSETLPIIIAAD
LPLEKEQMLLNVLKAH