; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14500 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14500
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:12520068..12533276
RNA-Seq ExpressionMoc09g14500
SyntenyMoc09g14500
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB5519281.1 hypothetical protein DKX38_023600 [Salix brachista]2.4e-4835.64Show/hide
Query:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL
        EPP  E  +KP +     DDARL L I+NSI+ +I+GL+N CE+VKEL+ YLEF+YSGKGN+ R++++CK FY+ E   +SLT YFM+ K+ Y   N LL
Subjt:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL

Query:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I
        P S D KV+  QRE++A++SFL GLP   +  K Q+L           F  I                                               I
Subjt:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I

Query:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS
         C YCH+ GHTK+ C+KL N  Q+ Q A+V +T      ++ K + + A+ FAKF  YQ+SL  S+ +      E       T     E           
Subjt:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS

Query:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRK
         ED+++L+Y +      T +   P +     P  + P I QVYSRRQ  TD+CP PT     D     DL I LRK
Subjt:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRK

KAB5533943.1 hypothetical protein DKX38_017029 [Salix brachista]8.4e-4935.53Show/hide
Query:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL
        EPP  E  +KP +     DDARL L I+NSI+ +I+GL+N CE+VKEL+ YLEF+YSGKGN+ R++++CK FY+ E   +SLT YFM+ K+ Y   N LL
Subjt:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL

Query:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I
        P S D KV+  QRE++A++SFL GLP   +  K Q+L           F  I                                               I
Subjt:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I

Query:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS
         C YCH+ GHTK+ C+KL N+ Q+ Q A+V +T      ++ K + + A+ FAKF  YQ+SL  S+ +      E       T     E           
Subjt:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS

Query:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQ
         ED+++L+Y +      T +   P +     P  + P I QVYSRRQ  TD+CP  +A    D     DL I LRKG  Q
Subjt:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQ

KAB5561110.1 hypothetical protein DKX38_006067 [Salix brachista]1.4e-4835.7Show/hide
Query:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL
        EPP  E  +KP +     DDARL L I+NSI  +I+GL+N CE+VKEL+ YLEF+YSGKGN+ R++++CK FY+ E   +SLT+YFM+ K+ Y   N LL
Subjt:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL

Query:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I
        P S D KV+  QRE++A++SFL  LP   +  K Q+L           F  I                                               I
Subjt:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I

Query:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS
         C YCH+ GHTK+ C+KL N+ Q+ Q A+V +T      ++ K + + A+ FAKF  YQ+SL  S+ +      E       T     E           
Subjt:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS

Query:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQG
         ED+++L+Y +      T +   P +     P  + P I QVYSRRQ  TD+CP PT     D     DL I LRKGL++G
Subjt:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQG

KAF9681460.1 hypothetical protein SADUNF_Sadunf05G0003800 [Salix dunnii]7.1e-4833.03Show/hide
Query:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL
        EPP  E  +KP +     DDARL L I+NSI+ +I+GL+N CE+VKEL+ YLEF+YSGKGN+ R++++CK FY+ +   +SLT YFM+ K+ Y   N LL
Subjt:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL

Query:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I
        P S D KV+  QRE++A++SFL GLP   + AK Q+L           F  I                                               I
Subjt:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I

Query:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHV----ASTPDNAGKLVTIPAEAFAKFQQYQESLTTS-------------------------------SSNP
         C YCH+ GHTK+ C+KL N+ Q+ Q A+V    ++T  ++ K + + A+ FAKF QYQESL  S                               + NP
Subjt:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHV----ASTPDNAGKLVTIPAEAFAKFQQYQESLTTS-------------------------------SSNP

Query:  IT-----------VIAESDLKMKKTIGKGRESNGLY----------------TQRESSQEDDDFLVYCIVFTST----EKLPSNTSSYVPDPSLPTITQV
         T            +   D    K +G G      Y                   +S  ED+++L+Y +  ++T     + P +     P P+ P I QV
Subjt:  IT-----------VIAESDLKMKKTIGKGRESNGLY----------------TQRESSQEDDDFLVYCIVFTST----EKLPSNTSSYVPDPSLPTITQV

Query:  YSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQG
        YSRRQ  TD+CP  T     D     DL I LRKGL++G
Subjt:  YSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQG

RVW66431.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.7e-4938.29Show/hide
Query:  DDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALLPDSNDAKVRLAQREQLAI
        DDARL L +KNSI  DI+GL++ CE+VKEL+ YL+F+YSGKGNV R++++   F+  E G +SLT YFM+ K++Y   NAL+P S D +V+ AQREQ+A+
Subjt:  DDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALLPDSNDAKVRLAQREQLAI

Query:  ISFLLGLPPRFDVAKDQLLFGD-------------------------IIFCNYCHKRGHTKQECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAE
        +SFL GLP  F+ AK Q+L G+                          I C YCH+ GHTK+ CRKL N+ ++ Q+A+VA++      D++ K+VT+ AE
Subjt:  ISFLLGLPPRFDVAKDQLLFGD-------------------------IIFCNYCHKRGHTKQECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAE

Query:  AFAKFQQYQESLTTSSSNPITVIAE----------------------------SDLKMKKTIGKGRESNGLYTQRESSQEDDDFLVYCIVFTSTEKLPSN
         F+K+ QYQ++L   +S P++ +AE                            SDL  K+T GKG  S+GLY         D+++   +   ST   P  
Subjt:  AFAKFQQYQESLTTSSSNPITVIAE----------------------------SDLKMKKTIGKGRESNGLYTQRESSQEDDDFLVYCIVFTSTEKLPSN

Query:  TSSYVPDPSLPTITQV
            +  PSLP + ++
Subjt:  TSSYVPDPSLPTITQV

TrEMBL top hitse value%identityAlignment
A0A438G2L4 Retrovirus-related Pol polyprotein from transposon TNT 1-948.2e-5038.29Show/hide
Query:  DDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALLPDSNDAKVRLAQREQLAI
        DDARL L +KNSI  DI+GL++ CE+VKEL+ YL+F+YSGKGNV R++++   F+  E G +SLT YFM+ K++Y   NAL+P S D +V+ AQREQ+A+
Subjt:  DDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALLPDSNDAKVRLAQREQLAI

Query:  ISFLLGLPPRFDVAKDQLLFGD-------------------------IIFCNYCHKRGHTKQECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAE
        +SFL GLP  F+ AK Q+L G+                          I C YCH+ GHTK+ CRKL N+ ++ Q+A+VA++      D++ K+VT+ AE
Subjt:  ISFLLGLPPRFDVAKDQLLFGD-------------------------IIFCNYCHKRGHTKQECRKLLNKGQKTQSAHVAST-----PDNAGKLVTIPAE

Query:  AFAKFQQYQESLTTSSSNPITVIAE----------------------------SDLKMKKTIGKGRESNGLYTQRESSQEDDDFLVYCIVFTSTEKLPSN
         F+K+ QYQ++L   +S P++ +AE                            SDL  K+T GKG  S+GLY         D+++   +   ST   P  
Subjt:  AFAKFQQYQESLTTSSSNPITVIAE----------------------------SDLKMKKTIGKGRESNGLYTQRESSQEDDDFLVYCIVFTSTEKLPSN

Query:  TSSYVPDPSLPTITQV
            +  PSLP + ++
Subjt:  TSSYVPDPSLPTITQV

A0A5N5JJ99 Uncharacterized protein1.2e-4835.64Show/hide
Query:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL
        EPP  E  +KP +     DDARL L I+NSI+ +I+GL+N CE+VKEL+ YLEF+YSGKGN+ R++++CK FY+ E   +SLT YFM+ K+ Y   N LL
Subjt:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL

Query:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I
        P S D KV+  QRE++A++SFL GLP   +  K Q+L           F  I                                               I
Subjt:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I

Query:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS
         C YCH+ GHTK+ C+KL N  Q+ Q A+V +T      ++ K + + A+ FAKF  YQ+SL  S+ +      E       T     E           
Subjt:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS

Query:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRK
         ED+++L+Y +      T +   P +     P  + P I QVYSRRQ  TD+CP PT     D     DL I LRK
Subjt:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRK

A0A5N5KU30 Uncharacterized protein4.1e-4935.53Show/hide
Query:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL
        EPP  E  +KP +     DDARL L I+NSI+ +I+GL+N CE+VKEL+ YLEF+YSGKGN+ R++++CK FY+ E   +SLT YFM+ K+ Y   N LL
Subjt:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL

Query:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I
        P S D KV+  QRE++A++SFL GLP   +  K Q+L           F  I                                               I
Subjt:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I

Query:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS
         C YCH+ GHTK+ C+KL N+ Q+ Q A+V +T      ++ K + + A+ FAKF  YQ+SL  S+ +      E       T     E           
Subjt:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS

Query:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQ
         ED+++L+Y +      T +   P +     P  + P I QVYSRRQ  TD+CP  +A    D     DL I LRKG  Q
Subjt:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQ

A0A5N5N0U4 Uncharacterized protein6.9e-4935.7Show/hide
Query:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL
        EPP  E  +KP +     DDARL L I+NSI  +I+GL+N CE+VKEL+ YLEF+YSGKGN+ R++++CK FY+ E   +SLT+YFM+ K+ Y   N LL
Subjt:  EPPTLE--QKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALL

Query:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I
        P S D KV+  QRE++A++SFL  LP   +  K Q+L           F  I                                               I
Subjt:  PDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLL-----------FGDI-----------------------------------------------I

Query:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS
         C YCH+ GHTK+ C+KL N+ Q+ Q A+V +T      ++ K + + A+ FAKF  YQ+SL  S+ +      E       T     E           
Subjt:  FCNYCHKRGHTKQECRKLLNKGQKTQSAHVASTPD----NAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESS

Query:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQG
         ED+++L+Y +      T +   P +     P  + P I QVYSRRQ  TD+CP PT     D     DL I LRKGL++G
Subjt:  QEDDDFLVYCI----VFTSTEKLPSNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQG

A5B136 Uncharacterized protein3.8e-4740.93Show/hide
Query:  SIVEPPTLEQKPLLSHLK----YDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYAN
        S+ +   L ++P   H +     DDARL L +KNSI  DI+GL++ CE+VKEL+ YL+F+YSGKGNV R++++   F+  E G +SLT YFM+ K++Y  
Subjt:  SIVEPPTLEQKPLLSHLK----YDDARLVLHIKNSIEGDIIGLVNECEYVKELLKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYAN

Query:  FNALLPDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLLFGD-----------------------------------IIFCNYCHKRGHTKQECRKL
         NAL+P S D +V+ AQREQ+A++SFL GLP  F+ AK Q+L G                                     I C YCH+ GHTK+ CRKL
Subjt:  FNALLPDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLLFGD-----------------------------------IIFCNYCHKRGHTKQECRKL

Query:  LNKGQKTQSAHVAST-----PDNAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAES
         N+ ++ Q+A+VA++      D++ K+VT+ AE F+K+ QYQ++L   +S P++ +AES
Subjt:  LNKGQKTQSAHVAST-----PDNAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAES

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAACCCGGTTATGGTCCAACCAATAACCGGAAGTCCCTCTCGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGTCCCGGAGTCAGTATTTAAGGGAACAC
TCATCTACTCCCTTAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGTCG
GCGACTCCGGCCACTCTCACCCATACAGATCAAAGAACGAGTCCTCACGGGCAGGAGTCTATAACTCACTCAGGATTGAGAATGAGTTGTTTGGTCATCTTAAGA
AATGGCAATCTATTAGTTAATGGTGTTACATCTAAAGATTGCCTATTTTGCGGCCGAAGTGTTGGCACTGGACATAGCAACATTGATGCAGAAGGAGATGGTGAC
GATGAATCAAATGCTAAAAGAATGGCTCTAGAAAGGAGGAATACACTAGCCAAACCTGTGCAGTCCATGCAGCCGGAGTTTATTGCCTCTTCTCCCATTTGCCAA
ATTAATGAACTCGTTTGGGAGTGGGAAGTTGGTAATGACGTGGGGAGTTACCCCACGTCATTCTTCTTTTTCATTGTGAAATGGGAAGCAGAACGGTTCTGCAAC
TCACACATAACAAAAGAAGAAAAAAAACTTCCAATCTCTCAAGGAGCTCTCTATCTGAATACTCTCTCTCATTCCAAACGAGGTGCTCTCACAAGCACGGTCTCG
AGACCTAAGAGGATAGCGAGGAAGACACAGTGGAAGAAAAAGAAGATGGTAGCTATGACCAAATATTGTAGTGAAGCCGTAGGTTATCCTGTACCCATTAAATGT
AGAGATCTCGACAGTTTTATCATCCCTTATTCAATTGGAGGAAAGAACTTAGGTAGAGCTTTATGCAACTTAGTTGCTAGCATTAGCCTTATGCCTTTATCTATT
CTCAAGGAGTTAGGCATAAGAGAAGCTAGACCCCATGACAGAATGAAAGAAGAGTTAGAGGCTGTGGAAGGTGATTGCCCACCTAGAAGTAGCAGAAGAGTTGAA
TCCACCGATGGACAAGATGGAGGCAAAGGAGATGATGACAGAAGAATGCAAGACATTGCAACATCCATTGTGGAACCACCCACTTTGGAACAAAAGCCTTTACTG
TCTCATTTAAAATATGATGATGCTCGACTAGTTCTACATATCAAGAATTCGATCGAGGGTGACATTATTGGCTTGGTTAATGAATGTGAGTATGTTAAAGAGTTG
CTTAAATATTTAGAATTCATTTATTCTGGAAAAGGAAATGTTGGTCGAATATTTGAAATTTGCAAGCGCTTCTACCAACTTGAGTTTGGTGACCAATCGCTTACA
AATTACTTTATGGAACACAAACGAATTTATGCAAACTTTAATGCATTACTCCCAGATAGTAATGATGCAAAAGTTCGGCTTGCCCAACGCGAACAACTAGCAATT
ATCAGTTTTCTTCTTGGTCTTCCACCTAGATTTGATGTGGCCAAAGACCAATTACTCTTTGGAGATATTATTTTTTGCAATTATTGTCATAAGCGTGGTCATACA
AAACAAGAATGTAGAAAGCTATTGAATAAAGGTCAGAAAACACAGTCTGCCCATGTTGCATCTACTCCTGATAATGCTGGCAAGTTAGTTACAATTCCTGCGGAA
GCATTTGCTAAGTTCCAACAGTATCAAGAATCATTGACGACATCGTCCTCTAATCCGATTACTGTCATCGCTGAGTCAGATCTTAAGATGAAGAAGACTATTGGT
AAAGGGCGTGAATCCAATGGCCTCTACACTCAGAGGGAGAGTTCACAAGAAGATGATGACTTTCTTGTCTATTGCATTGTCTTTACTTCTACTGAAAAGCTTCCT
AGCAATACATCTTCATATGTGCCTGATCCTTCTCTTCCCACCATTACTCAAGTTTATTCTCGTCGGCAACTTCCTACGGACTCATGCCCTATACCAACAGCTTCT
TCGTTCGAGGATCTAGGAATAAGTGATGACCTTTCTATTGCTCTTAGAAAAGGTCTGCTTCAAGGCGATGAGACAGAGCATATTGTACATGTCATTAGGGGAGAT
TTAGAGGAAGAGTCCTATAAGGCTCCCATGGAATCTAGTAGCCATGCATGGGAAAACGAGTTCTCGTGTTCTAACGATGATGAGATTGAATGGGGTGGGGAGCAT
TCAATAGAGTCTCCTAGTATGGAGGAAGATGATAGTCCAAATGAAAATGAGGGGATGTACAATGGGGCCGATGGATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAACCCGGTTATGGTCCAACCAATAACCGGAAGTCCCTCTCGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGTCCCGGAGTCAGTATTTAAGGGAACAC
TCATCTACTCCCTTAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTCCCAGCTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGTCG
GCGACTCCGGCCACTCTCACCCATACAGATCAAAGAACGAGTCCTCACGGGCAGGAGTCTATAACTCACTCAGGATTGAGAATGAGTTGTTTGGTCATCTTAAGA
AATGGCAATCTATTAGTTAATGGTGTTACATCTAAAGATTGCCTATTTTGCGGCCGAAGTGTTGGCACTGGACATAGCAACATTGATGCAGAAGGAGATGGTGAC
GATGAATCAAATGCTAAAAGAATGGCTCTAGAAAGGAGGAATACACTAGCCAAACCTGTGCAGTCCATGCAGCCGGAGTTTATTGCCTCTTCTCCCATTTGCCAA
ATTAATGAACTCGTTTGGGAGTGGGAAGTTGGTAATGACGTGGGGAGTTACCCCACGTCATTCTTCTTTTTCATTGTGAAATGGGAAGCAGAACGGTTCTGCAAC
TCACACATAACAAAAGAAGAAAAAAAACTTCCAATCTCTCAAGGAGCTCTCTATCTGAATACTCTCTCTCATTCCAAACGAGGTGCTCTCACAAGCACGGTCTCG
AGACCTAAGAGGATAGCGAGGAAGACACAGTGGAAGAAAAAGAAGATGGTAGCTATGACCAAATATTGTAGTGAAGCCGTAGGTTATCCTGTACCCATTAAATGT
AGAGATCTCGACAGTTTTATCATCCCTTATTCAATTGGAGGAAAGAACTTAGGTAGAGCTTTATGCAACTTAGTTGCTAGCATTAGCCTTATGCCTTTATCTATT
CTCAAGGAGTTAGGCATAAGAGAAGCTAGACCCCATGACAGAATGAAAGAAGAGTTAGAGGCTGTGGAAGGTGATTGCCCACCTAGAAGTAGCAGAAGAGTTGAA
TCCACCGATGGACAAGATGGAGGCAAAGGAGATGATGACAGAAGAATGCAAGACATTGCAACATCCATTGTGGAACCACCCACTTTGGAACAAAAGCCTTTACTG
TCTCATTTAAAATATGATGATGCTCGACTAGTTCTACATATCAAGAATTCGATCGAGGGTGACATTATTGGCTTGGTTAATGAATGTGAGTATGTTAAAGAGTTG
CTTAAATATTTAGAATTCATTTATTCTGGAAAAGGAAATGTTGGTCGAATATTTGAAATTTGCAAGCGCTTCTACCAACTTGAGTTTGGTGACCAATCGCTTACA
AATTACTTTATGGAACACAAACGAATTTATGCAAACTTTAATGCATTACTCCCAGATAGTAATGATGCAAAAGTTCGGCTTGCCCAACGCGAACAACTAGCAATT
ATCAGTTTTCTTCTTGGTCTTCCACCTAGATTTGATGTGGCCAAAGACCAATTACTCTTTGGAGATATTATTTTTTGCAATTATTGTCATAAGCGTGGTCATACA
AAACAAGAATGTAGAAAGCTATTGAATAAAGGTCAGAAAACACAGTCTGCCCATGTTGCATCTACTCCTGATAATGCTGGCAAGTTAGTTACAATTCCTGCGGAA
GCATTTGCTAAGTTCCAACAGTATCAAGAATCATTGACGACATCGTCCTCTAATCCGATTACTGTCATCGCTGAGTCAGATCTTAAGATGAAGAAGACTATTGGT
AAAGGGCGTGAATCCAATGGCCTCTACACTCAGAGGGAGAGTTCACAAGAAGATGATGACTTTCTTGTCTATTGCATTGTCTTTACTTCTACTGAAAAGCTTCCT
AGCAATACATCTTCATATGTGCCTGATCCTTCTCTTCCCACCATTACTCAAGTTTATTCTCGTCGGCAACTTCCTACGGACTCATGCCCTATACCAACAGCTTCT
TCGTTCGAGGATCTAGGAATAAGTGATGACCTTTCTATTGCTCTTAGAAAAGGTCTGCTTCAAGGCGATGAGACAGAGCATATTGTACATGTCATTAGGGGAGAT
TTAGAGGAAGAGTCCTATAAGGCTCCCATGGAATCTAGTAGCCATGCATGGGAAAACGAGTTCTCGTGTTCTAACGATGATGAGATTGAATGGGGTGGGGAGCAT
TCAATAGAGTCTCCTAGTATGGAGGAAGATGATAGTCCAAATGAAAATGAGGGGATGTACAATGGGGCCGATGGATTCTAG
Protein sequenceShow/hide protein sequence
MNNPVMVQPITGSPSRPMRGWGPLFKSRSQYLREHSSTPLKSGRSEFHLVKLCSQLPTRSRPQNGRNVESATPATLTHTDQRTSPHGQESITHSGLRMSCLVILR
NGNLLVNGVTSKDCLFCGRSVGTGHSNIDAEGDGDDESNAKRMALERRNTLAKPVQSMQPEFIASSPICQINELVWEWEVGNDVGSYPTSFFFFIVKWEAERFCN
SHITKEEKKLPISQGALYLNTLSHSKRGALTSTVSRPKRIARKTQWKKKKMVAMTKYCSEAVGYPVPIKCRDLDSFIIPYSIGGKNLGRALCNLVASISLMPLSI
LKELGIREARPHDRMKEELEAVEGDCPPRSSRRVESTDGQDGGKGDDDRRMQDIATSIVEPPTLEQKPLLSHLKYDDARLVLHIKNSIEGDIIGLVNECEYVKEL
LKYLEFIYSGKGNVGRIFEICKRFYQLEFGDQSLTNYFMEHKRIYANFNALLPDSNDAKVRLAQREQLAIISFLLGLPPRFDVAKDQLLFGDIIFCNYCHKRGHT
KQECRKLLNKGQKTQSAHVASTPDNAGKLVTIPAEAFAKFQQYQESLTTSSSNPITVIAESDLKMKKTIGKGRESNGLYTQRESSQEDDDFLVYCIVFTSTEKLP
SNTSSYVPDPSLPTITQVYSRRQLPTDSCPIPTASSFEDLGISDDLSIALRKGLLQGDETEHIVHVIRGDLEEESYKAPMESSSHAWENEFSCSNDDEIEWGGEH
SIESPSMEEDDSPNENEGMYNGADGF