; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g26580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g26580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:19582874..19589221
RNA-Seq ExpressionMoc11g26580
SyntenyMoc11g26580
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN73189.1 hypothetical protein VITISV_042346 [Vitis vinifera]2.0e-4536.62Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        +KNSI  DI+G+++ CEFVKEL+ Y +FLYSGKGNVSR++++   F+  E G +SLT+Y M+ K  Y E NAL+P S D +V+  QREQ+AV+SFL GLP
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGLEE----ALRSHNPSC--HLSLTVLWLDVVQMHT-----------EPGHTKRECRKLLNKGQRTQPAHV-----ASTPDNTG
         +F+  K Q+L GS+I  L+E     LR+ N S   H ++ +L  + +Q              E GHTK+ C KL N+ +R Q A+V     A+  D++ 
Subjt:  PKFDVGKDQLLYGSEILGLEE----ALRSHNPSC--HLSLTVLWLDVVQMHT-----------EPGHTKRECRKLLNKGQRTQPAHV-----ASTPDNTG

Query:  KLVTILAEEFAKFQQYQESLTAS--------------------------------------------------------------TKKTIGKGRESNGLY
        K+VT+  EEF K+ QYQ++L AS                                                              TK+T GKG  S+GLY
Subjt:  KLVTILAEEFAKFQQYQESLTAS--------------------------------------------------------------TKKTIGKGRESNGLY

Query:  TFDTQIPTTTVCTRVPSSFEEHCRL
          D  +P    C    S  E HCRL
Subjt:  TFDTQIPTTTVCTRVPSSFEEHCRL

CAN74964.1 hypothetical protein VITISV_006810 [Vitis vinifera]1.0e-4447.89Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        I+NSI+ +I+G++N CEFVKEL+ Y EFLYSGKGN+SR++++CK FY+ E   +SLT+Y M+ K TY E N LLP S D KV+  QRE++ V+SFL+GLP
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGLEEA----LRSHNPSCHLSLTVLWLDVVQMHTEPGHTKRECRKLLNKGQRTQPAHVASTPDNTG----KLVTILAEEFAKFQ
         +F+  K Q+L  SEI  L+E     LR+   S            +Q    PGHTK+ C+KL N  +R Q A+VA+    +     K V + A+EFAKF 
Subjt:  PKFDVGKDQLLYGSEILGLEEA----LRSHNPSCHLSLTVLWLDVVQMHTEPGHTKRECRKLLNKGQRTQPAHVASTPDNTG----KLVTILAEEFAKFQ

Query:  QYQESLTASTKKT
        QYQESL  ST  T
Subjt:  QYQESLTASTKKT

RVW66431.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.3e-4740Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        +KNSI  DI+G+++ CEFVKEL+ Y +FLYSGKGNVSR++++   F+  E G +SLT+Y M+ K  Y E NAL+P S D +V+  QREQ+AV+SFL GLP
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGLEEALRSHNPSCHLSLTVLWLD----VVQMHTEPGHTKRECRKLLNKGQRTQPAHVAST-----PDNTGKLVTILAEEFAKF
         +F+  K Q+L G      E A R +N   + +      D    V     E GHTK+ CRKL N+ +R Q A+VA++      D++ K+VT+ AEEF+K+
Subjt:  PKFDVGKDQLLYGSEILGLEEALRSHNPSCHLSLTVLWLD----VVQMHTEPGHTKRECRKLLNKGQRTQPAHVAST-----PDNTGKLVTILAEEFAKF

Query:  QQYQESLTAS----------------------------------------TKKTIGKGRESNGLYTFDTQIPTTTVCTRVPSSFEEHCRL
         QYQ++L AS                                        TK+T GKG  S+GLY  D  +P    C    S  E HCRL
Subjt:  QQYQESLTAS----------------------------------------TKKTIGKGRESNGLYTFDTQIPTTTVCTRVPSSFEEHCRL

XP_022850817.1 uncharacterized protein LOC111372670 [Olea europaea var. sylvestris]3.8e-4445.42Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        I+NSI+ ++IG++N CEFVKEL+ Y EFLYS KGNVSRI+E+C+ FY++E   +SLT++ M+ K TY E N LLP S D KV+ +QREQ+AV+SFL GLP
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGLEE----ALRSHNPSCHLSLTVLWLD------------------------------VVQMHTEPGHTKRECRKLLNKGQRTQ
         +F+  K Q+L  SEI  L++     LR+ N S      VL                                 V     EPGHTK  CRKL N+ +RTQ
Subjt:  PKFDVGKDQLLYGSEILGLEE----ALRSHNPSCHLSLTVLWLD------------------------------VVQMHTEPGHTKRECRKLLNKGQRTQ

Query:  PAHVASTPDNT-GKLVTILAEEFAKFQQYQESLTASTKKT
         A+VA+TP ++  K V I  +E+AKF QYQESL  S   T
Subjt:  PAHVASTPDNT-GKLVTILAEEFAKFQQYQESLTASTKKT

XP_038882618.1 uncharacterized protein LOC120073824 [Benincasa hispida]8.5e-4448.89Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        IKNSI+ +I+ +VN CE VK+LL+Y +FLYSGK N++R+F++CK  YQ + G++SLTSY ME KNT AEFNAL+P S D KV + + E++ ++SFL+GL 
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGLEEA----LRSHN----PSCHLSLTVLW------------LDVVQMH-----TEPG-------HTKRECRKLLNKGQRTQP-
        PK+++ KDQ+L    IL LEEA    LR+       S   S T++             L+  + H     + PG       HTKRECR+LLNKGQR    
Subjt:  PKFDVGKDQLLYGSEILGLEEA----LRSHN----PSCHLSLTVLW------------LDVVQMH-----TEPG-------HTKRECRKLLNKGQRTQP-

Query:  -AHVASTPDNTGKLVTILAEEFAKF
         AHVASTPDN  K +TI AEEFAKF
Subjt:  -AHVASTPDNTGKLVTILAEEFAKF

TrEMBL top hitse value%identityAlignment
A0A438G2L4 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-4740Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        +KNSI  DI+G+++ CEFVKEL+ Y +FLYSGKGNVSR++++   F+  E G +SLT+Y M+ K  Y E NAL+P S D +V+  QREQ+AV+SFL GLP
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGLEEALRSHNPSCHLSLTVLWLD----VVQMHTEPGHTKRECRKLLNKGQRTQPAHVAST-----PDNTGKLVTILAEEFAKF
         +F+  K Q+L G      E A R +N   + +      D    V     E GHTK+ CRKL N+ +R Q A+VA++      D++ K+VT+ AEEF+K+
Subjt:  PKFDVGKDQLLYGSEILGLEEALRSHNPSCHLSLTVLWLD----VVQMHTEPGHTKRECRKLLNKGQRTQPAHVAST-----PDNTGKLVTILAEEFAKF

Query:  QQYQESLTAS----------------------------------------TKKTIGKGRESNGLYTFDTQIPTTTVCTRVPSSFEEHCRL
         QYQ++L AS                                        TK+T GKG  S+GLY  D  +P    C    S  E HCRL
Subjt:  QQYQESLTAS----------------------------------------TKKTIGKGRESNGLYTFDTQIPTTTVCTRVPSSFEEHCRL

A0A438GNT2 Uncharacterized protein1.0e-4245.54Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        +KNSI  DI+G+++ C+FVKEL+ Y +FLYS KG+VSR++++   F+  E G +SLT+Y M+ K  Y E NAL+P S D +V+  QREQ+ V+SFL GLP
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGLEE----ALRSHNPSC--HLSLTVLWLDVVQMHTEPGHTKRECRKLLNKGQRTQPAHVAST-----PDNTGKLVTILAEEFA
         +F+  K Q+L GS+I  L+E     LR+ N S   H ++ V      +     GHTK+ CRKL N+ +R Q A+VA++      D++ K+VT++A+EFA
Subjt:  PKFDVGKDQLLYGSEILGLEE----ALRSHNPSC--HLSLTVLWLDVVQMHTEPGHTKRECRKLLNKGQRTQPAHVAST-----PDNTGKLVTILAEEFA

Query:  KFQQYQESLTAST
        K+ QYQ++L AST
Subjt:  KFQQYQESLTAST

A5B136 Uncharacterized protein1.0e-4246.33Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        +KNSI  DI+G+++ CEFVKEL+ Y +FLYSGKGNVSR++++   F+  E G +SLT+Y M+ K  Y E NAL+P S D +V+  QREQ+AV+SFL GLP
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGL------EEALRSHNPSCHLSLTVLWLD-----VVQMHTEPGHTKRECRKLLNKGQRTQPAHVAST-----PDNTGKLVTIL
         +F+  K Q+L GS+I  L      E A R +N   + +      D     V     E GHTK+ CRKL N+ +R Q A+VA++      D++ K+VT+ 
Subjt:  PKFDVGKDQLLYGSEILGL------EEALRSHNPSCHLSLTVLWLD-----VVQMHTEPGHTKRECRKLLNKGQRTQPAHVAST-----PDNTGKLVTIL

Query:  AEEFAKFQQYQESLTAST
        AEEF+K+ QYQ++L AST
Subjt:  AEEFAKFQQYQESLTAST

A5BSK6 Integrase catalytic domain-containing protein4.8e-4547.89Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        I+NSI+ +I+G++N CEFVKEL+ Y EFLYSGKGN+SR++++CK FY+ E   +SLT+Y M+ K TY E N LLP S D KV+  QRE++ V+SFL+GLP
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGLEEA----LRSHNPSCHLSLTVLWLDVVQMHTEPGHTKRECRKLLNKGQRTQPAHVASTPDNTG----KLVTILAEEFAKFQ
         +F+  K Q+L  SEI  L+E     LR+   S            +Q    PGHTK+ C+KL N  +R Q A+VA+    +     K V + A+EFAKF 
Subjt:  PKFDVGKDQLLYGSEILGLEEA----LRSHNPSCHLSLTVLWLDVVQMHTEPGHTKRECRKLLNKGQRTQPAHVASTPDNTG----KLVTILAEEFAKFQ

Query:  QYQESLTASTKKT
        QYQESL  ST  T
Subjt:  QYQESLTASTKKT

A5C970 Uncharacterized protein9.7e-4636.62Show/hide
Query:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP
        +KNSI  DI+G+++ CEFVKEL+ Y +FLYSGKGNVSR++++   F+  E G +SLT+Y M+ K  Y E NAL+P S D +V+  QREQ+AV+SFL GLP
Subjt:  IKNSIEGDIIGMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLP

Query:  PKFDVGKDQLLYGSEILGLEE----ALRSHNPSC--HLSLTVLWLDVVQMHT-----------EPGHTKRECRKLLNKGQRTQPAHV-----ASTPDNTG
         +F+  K Q+L GS+I  L+E     LR+ N S   H ++ +L  + +Q              E GHTK+ C KL N+ +R Q A+V     A+  D++ 
Subjt:  PKFDVGKDQLLYGSEILGLEE----ALRSHNPSC--HLSLTVLWLDVVQMHT-----------EPGHTKRECRKLLNKGQRTQPAHV-----ASTPDNTG

Query:  KLVTILAEEFAKFQQYQESLTAS--------------------------------------------------------------TKKTIGKGRESNGLY
        K+VT+  EEF K+ QYQ++L AS                                                              TK+T GKG  S+GLY
Subjt:  KLVTILAEEFAKFQQYQESLTAS--------------------------------------------------------------TKKTIGKGRESNGLY

Query:  TFDTQIPTTTVCTRVPSSFEEHCRL
          D  +P    C    S  E HCRL
Subjt:  TFDTQIPTTTVCTRVPSSFEEHCRL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCATAAAATTGTGGAAGTGGCGGAAGAGGTTGCGCTGATTGTTGAGCCTGAAAGCATTGCACGAGGTAGTCAAATTGTGGCTGATTCTTTACTTGAAATCAATGA
AACTCGCCTGACAGAAGACAAACACCCTCCTCGAGTGATCGAGGTCAGTTATCAACTTGGCGTGGGGAAGACCACTTGTCGTGCTACCACGTCTAGCATGCTTGATGCTT
TGTTTCATGATTCGCCCAACGTCGATAGACTTATCAGTGATGATGGTGTACAACAATATAGCCTGTTCTTTGGTGACATCGCTCATGGTTTTGTGCCCATTAGGGTTTTG
TTCATTAGTATCCAACTCCAACTCACCACTGTCGCTGCCACCGTCGCCCACTACTGCCGTCGTCGTCGCCCACTACCGTTGCCGCCACTATTGCCATCGTCGACGTCGCT
CACTGCTGTTTGTTGGTCGTGCGTTTTCGTCGCCGACGTTGCTGTTTGGGGTTTCCGTCTCACCATCGTTCACGCTTCTGTCGCCGCTAGCTGCCGCCCATCACCGTTGA
CATGCGCCTGCGTCACTAGCCATCGCCCTCTGTCGCCATCAACGTGCGCTACCGTCACTGTTGGTCTTTTGGTGTCATGGATCAAGAATTCGATTGAGGGTGACATTATT
GGCATGGTCAACGAGTGCGAGTTTGTTAAAGAATTGCTTAAATACTTTGAATTCCTTTATTCTGGAAAAGGAAATGTTAGTCGAATATTTGAAATCTGCAAGCGCTTCTA
CCAATCTGAGTGTGGTGACCAATCGCTTACGAGTTACGTTATGGAACACAAAAATACTTATGCAGAGTTTAATGCATTACTCCCAAATAGTAATGATGCAAAAGTTCGGC
TTGTCCAACGCGAACAAATAGCAGTTATAAGTTTTCTTCTTGGTCTTCCACCTAAATTTGATGTGGGCAAAGATCAATTACTCTATGGTTCGGAAATTCTAGGTTTAGAG
GAGGCATTGAGAAGTCACAACCCGTCCTGTCATCTCAGTCTAACAGTGCTTTGGTTGGACGTAGTACAAATGCATACCGAGCCTGGACATACAAAGCGAGAATGTAGAAA
GCTGTTGAATAAAGGTCAGAGAACACAGCCTGCACATGTTGCATCTACTCCTGATAATACTGGCAAGTTGGTTACGATTCTTGCGGAAGAGTTTGCTAAGTTCCAACAGT
ATCAAGAGTCATTGACGGCATCGACGAAGAAGACTATTGGTAAAGGGCGTGAATCCAATGGCCTCTACACATTTGATACACAAATCCCTACAACTACTGTTTGCACTCGA
GTACCATCTTCTTTCGAAGAACATTGTCGTTTAGCAGGTGCGGCGGTTCTGTTCGCGACAGCAGCGTTCAGTAGGGCGGCGGTTCCGTTCGTGACAGCAGCGTGCGGCGG
CTTCTGTTCGGGACAGTATCGTGTGTGGCGGCTCCCGTTTGTGACAGTAGCATGCAGTAGGTGCGGCGACTCCATTGGTGACAGCGGCGTGCAGCAGGTGTTGAATCGAC
GAGTGTTTTGGAAGCATAGCAGTGTCTTTGTTAGTTGTGGTTGGTTAGAATGTTGTGAAGCATCTGTTGGATCTTATGTTTCTGAGTGTGACTGCTGTGTCGATTTTCTT
AGAGTCGATACGACTCGAGGAGTTAGACTTGGAGGACTAACATGGGGTGGTGATTATGAGCAACAGTTCAAGACCTTGGGGATAAATGGCAAGGTCGAACGCCAAGCTTT
GGTAGAGAGATGTGTAAACAGGACCCTCGACCGTGGTGATGACGTGGAGGAGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCATAAAATTGTGGAAGTGGCGGAAGAGGTTGCGCTGATTGTTGAGCCTGAAAGCATTGCACGAGGTAGTCAAATTGTGGCTGATTCTTTACTTGAAATCAATGA
AACTCGCCTGACAGAAGACAAACACCCTCCTCGAGTGATCGAGGTCAGTTATCAACTTGGCGTGGGGAAGACCACTTGTCGTGCTACCACGTCTAGCATGCTTGATGCTT
TGTTTCATGATTCGCCCAACGTCGATAGACTTATCAGTGATGATGGTGTACAACAATATAGCCTGTTCTTTGGTGACATCGCTCATGGTTTTGTGCCCATTAGGGTTTTG
TTCATTAGTATCCAACTCCAACTCACCACTGTCGCTGCCACCGTCGCCCACTACTGCCGTCGTCGTCGCCCACTACCGTTGCCGCCACTATTGCCATCGTCGACGTCGCT
CACTGCTGTTTGTTGGTCGTGCGTTTTCGTCGCCGACGTTGCTGTTTGGGGTTTCCGTCTCACCATCGTTCACGCTTCTGTCGCCGCTAGCTGCCGCCCATCACCGTTGA
CATGCGCCTGCGTCACTAGCCATCGCCCTCTGTCGCCATCAACGTGCGCTACCGTCACTGTTGGTCTTTTGGTGTCATGGATCAAGAATTCGATTGAGGGTGACATTATT
GGCATGGTCAACGAGTGCGAGTTTGTTAAAGAATTGCTTAAATACTTTGAATTCCTTTATTCTGGAAAAGGAAATGTTAGTCGAATATTTGAAATCTGCAAGCGCTTCTA
CCAATCTGAGTGTGGTGACCAATCGCTTACGAGTTACGTTATGGAACACAAAAATACTTATGCAGAGTTTAATGCATTACTCCCAAATAGTAATGATGCAAAAGTTCGGC
TTGTCCAACGCGAACAAATAGCAGTTATAAGTTTTCTTCTTGGTCTTCCACCTAAATTTGATGTGGGCAAAGATCAATTACTCTATGGTTCGGAAATTCTAGGTTTAGAG
GAGGCATTGAGAAGTCACAACCCGTCCTGTCATCTCAGTCTAACAGTGCTTTGGTTGGACGTAGTACAAATGCATACCGAGCCTGGACATACAAAGCGAGAATGTAGAAA
GCTGTTGAATAAAGGTCAGAGAACACAGCCTGCACATGTTGCATCTACTCCTGATAATACTGGCAAGTTGGTTACGATTCTTGCGGAAGAGTTTGCTAAGTTCCAACAGT
ATCAAGAGTCATTGACGGCATCGACGAAGAAGACTATTGGTAAAGGGCGTGAATCCAATGGCCTCTACACATTTGATACACAAATCCCTACAACTACTGTTTGCACTCGA
GTACCATCTTCTTTCGAAGAACATTGTCGTTTAGCAGGTGCGGCGGTTCTGTTCGCGACAGCAGCGTTCAGTAGGGCGGCGGTTCCGTTCGTGACAGCAGCGTGCGGCGG
CTTCTGTTCGGGACAGTATCGTGTGTGGCGGCTCCCGTTTGTGACAGTAGCATGCAGTAGGTGCGGCGACTCCATTGGTGACAGCGGCGTGCAGCAGGTGTTGAATCGAC
GAGTGTTTTGGAAGCATAGCAGTGTCTTTGTTAGTTGTGGTTGGTTAGAATGTTGTGAAGCATCTGTTGGATCTTATGTTTCTGAGTGTGACTGCTGTGTCGATTTTCTT
AGAGTCGATACGACTCGAGGAGTTAGACTTGGAGGACTAACATGGGGTGGTGATTATGAGCAACAGTTCAAGACCTTGGGGATAAATGGCAAGGTCGAACGCCAAGCTTT
GGTAGAGAGATGTGTAAACAGGACCCTCGACCGTGGTGATGACGTGGAGGAGACCTGA
Protein sequenceShow/hide protein sequence
MNHKIVEVAEEVALIVEPESIARGSQIVADSLLEINETRLTEDKHPPRVIEVSYQLGVGKTTCRATTSSMLDALFHDSPNVDRLISDDGVQQYSLFFGDIAHGFVPIRVL
FISIQLQLTTVAATVAHYCRRRRPLPLPPLLPSSTSLTAVCWSCVFVADVAVWGFRLTIVHASVAASCRPSPLTCACVTSHRPLSPSTCATVTVGLLVSWIKNSIEGDII
GMVNECEFVKELLKYFEFLYSGKGNVSRIFEICKRFYQSECGDQSLTSYVMEHKNTYAEFNALLPNSNDAKVRLVQREQIAVISFLLGLPPKFDVGKDQLLYGSEILGLE
EALRSHNPSCHLSLTVLWLDVVQMHTEPGHTKRECRKLLNKGQRTQPAHVASTPDNTGKLVTILAEEFAKFQQYQESLTASTKKTIGKGRESNGLYTFDTQIPTTTVCTR
VPSSFEEHCRLAGAAVLFATAAFSRAAVPFVTAACGGFCSGQYRVWRLPFVTVACSRCGDSIGDSGVQQVLNRRVFWKHSSVFVSCGWLECCEASVGSYVSECDCCVDFL
RVDTTRGVRLGGLTWGGDYEQQFKTLGINGKVERQALVERCVNRTLDRGDDVEET