; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g14610 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g14610
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:9068121..9074843
RNA-Seq ExpressionMoc01g14610
SyntenyMoc01g14610
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.3e-4043.38Show/hide
Query:  LQGHDLDKFIGPEAQIPPEFIRS--EGESSSTAIINKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENL
        L+ +DL+ F+  E++ P +++ S     +S+T   N  +  WKR+ +LI+SWLLGSM+EEIL+QML C++A E+W  L  +FSSR LA+ M+ K+K  N+
Subjt:  LQGHDLDKFIGPEAQIPPEFIRS--EGESSSTAIINKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENL

Query:  KKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQDHQK
        KKGS+ LK+YFLK+    D+LA+ +K +S +D I+++LAGLG D+   +SVISA  + P++QEV SLLL QE++NE   +++ S+ ++ S+N+ TQ  +K
Subjt:  KKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQDHQK

Query:  RGNFSNSAETRTNWNNNEA
         G  S     + N++NN +
Subjt:  RGNFSNSAETRTNWNNNEA

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.3e-4155.26Show/hide
Query:  RRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGI
        ++ KLITSWL  SM EEIL +M+ C TA EVW IL NL++SRNLARVM+LKSK EN+KKG+L LKDYF KVK + DSLAAA KK++  D IMH+L GL  
Subjt:  RRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGI

Query:  DFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQ----------DHQK------RGNFSNSAETRTNWNNN
        +F+ TVSVISA  +  TLQEVYSLLL+ E RNERN+  IN+D ++ S+N+T Q          D Q+      R   S +   R NWN+N
Subjt:  DFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQ----------DHQK------RGNFSNSAETRTNWNNN

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]1.3e-4155.26Show/hide
Query:  RRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGI
        ++ KLITSWL  SM EEIL +M+ C TA EVW IL NL++SRNLARVM+LKSK EN+KKG+L LKDYF KVK + DSLAAA KK++  D IMH+L GL  
Subjt:  RRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGI

Query:  DFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQ----------DHQK------RGNFSNSAETRTNWNNN
        +F+ TVSVISA  +  TLQEVYSLLL+ E RNERN+  IN+D ++ S+N+T Q          D Q+      R   S +   R NWN+N
Subjt:  DFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQ----------DHQK------RGNFSNSAETRTNWNNN

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]7.4e-5053.3Show/hide
Query:  ELNPTLQGHDLDKFIGPEAQIPPEFIR-SEGESSSTAI-INKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKS
        ++   LQG+ L+ +I      P +F++ +E ESSS+++  N  +  W ++ KLI++WLLGSM E+ILSQML+C++A E+WT+L  +F+SR LARVM+LK 
Subjt:  ELNPTLQGHDLDKFIGPEAQIPPEFIR-SEGESSSTAI-INKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKS

Query:  KPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTT
        K EN KKG+L+LKDYFLK+K + DSLA A KKLS  D IMH+LAGLG +FD  +SVI+A     TLQEV SLLL QE RNERN   INSD S+ S+N+T 
Subjt:  KPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTT

Query:  QDHQKRGNFSNS
         D  K+ N   S
Subjt:  QDHQKRGNFSNS

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]2.6e-4735.76Show/hide
Query:  VEDVMMNKHALRIEPSLKLHELNPTLQGHDLDKFIGPEAQIPPEFIRS-EG-ESSSTAIINKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVW
        +E++M+ +  +  +   +  ++   +QGH L+++I  + + P  FI++ +G  SS+T   N E+ +W ++ KLI+ WLLGSM+EEILSQML+C    E+W
Subjt:  VEDVMMNKHALRIEPSLKLHELNPTLQGHDLDKFIGPEAQIPPEFIRS-EG-ESSSTAIINKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVW

Query:  TILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARN
        T+L   F+SRNLARVM+LKSK EN+KKGS+NLK+YFLK+K + DSLA A K+L  +D IMH+LA LG +FD  VSVIS  K   ++QE  S        +
Subjt:  TILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARN

Query:  ERNNAQINSDASVSSINVTTQDHQKRGNFSNSAE------TRTNWNNNEAEEIIDPITIGIEVEFGVLTLES--------------NVSYILHTRASLLH
             Q+ S    SS +   Q +   G F  S           ++N +         T  +  +FG  +L S              N+S I H  ++LL 
Subjt:  ERNNAQINSDASVSSINVTTQDHQKRGNFSNSAE------TRTNWNNNEAEEIIDPITIGIEVEFGVLTLES--------------NVSYILHTRASLLH

Query:  STSPQLSSN-TFLLKNLLH--------------------------------DIQSGQILLMGKVNDGMYEFSLTKTSS-APVSAHISYCNKVGIALSAFT
        S S   SS   F L+NLLH                                D+ +GQ+L  G V+D +Y+F L K SS  P S   +  N   I  S   
Subjt:  STSPQLSSN-TFLLKNLLH--------------------------------DIQSGQILLMGKVNDGMYEFSLTKTSS-APVSAHISYCNKVGIALSAFT

Query:  SQNSHKSTIHSLNDSCNFSIPIASVSSVLDIWHWRLGH-LTLNTVQNVPDSCN
         Q S+   +H+         PI   +SVLDIWH R GH   L  VQ V  SCN
Subjt:  SQNSHKSTIHSLNDSCNFSIPIASVSSVLDIWHWRLGH-LTLNTVQNVPDSCN

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-4043.38Show/hide
Query:  LQGHDLDKFIGPEAQIPPEFIRS--EGESSSTAIINKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENL
        L+ +DL+ F+  E++ P +++ S     +S+T   N  +  WKR+ +LI+SWLLGSM+EEIL+QML C++A E+W  L  +FSSR LA+ M+ K+K  N+
Subjt:  LQGHDLDKFIGPEAQIPPEFIRS--EGESSSTAIINKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENL

Query:  KKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQDHQK
        KKGS+ LK+YFLK+    D+LA+ +K +S +D I+++LAGLG D+   +SVISA  + P++QEV SLLL QE++NE   +++ S+ ++ S+N+ TQ  +K
Subjt:  KKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQDHQK

Query:  RGNFSNSAETRTNWNNNEA
         G  S     + N++NN +
Subjt:  RGNFSNSAETRTNWNNNEA

A0A6J1C6N9 dr1-associated corepressor homolog isoform X16.1e-4255.26Show/hide
Query:  RRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGI
        ++ KLITSWL  SM EEIL +M+ C TA EVW IL NL++SRNLARVM+LKSK EN+KKG+L LKDYF KVK + DSLAAA KK++  D IMH+L GL  
Subjt:  RRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGI

Query:  DFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQ----------DHQK------RGNFSNSAETRTNWNNN
        +F+ TVSVISA  +  TLQEVYSLLL+ E RNERN+  IN+D ++ S+N+T Q          D Q+      R   S +   R NWN+N
Subjt:  DFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQ----------DHQK------RGNFSNSAETRTNWNNN

A0A6J1C8R2 dr1-associated corepressor homolog isoform X26.1e-4255.26Show/hide
Query:  RRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGI
        ++ KLITSWL  SM EEIL +M+ C TA EVW IL NL++SRNLARVM+LKSK EN+KKG+L LKDYF KVK + DSLAAA KK++  D IMH+L GL  
Subjt:  RRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGI

Query:  DFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQ----------DHQK------RGNFSNSAETRTNWNNN
        +F+ TVSVISA  +  TLQEVYSLLL+ E RNERN+  IN+D ++ S+N+T Q          D Q+      R   S +   R NWN+N
Subjt:  DFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTTQ----------DHQK------RGNFSNSAETRTNWNNN

A0A6J1DLT9 uncharacterized protein LOC1110217573.6e-5053.3Show/hide
Query:  ELNPTLQGHDLDKFIGPEAQIPPEFIR-SEGESSSTAI-INKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKS
        ++   LQG+ L+ +I      P +F++ +E ESSS+++  N  +  W ++ KLI++WLLGSM E+ILSQML+C++A E+WT+L  +F+SR LARVM+LK 
Subjt:  ELNPTLQGHDLDKFIGPEAQIPPEFIR-SEGESSSTAI-INKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKS

Query:  KPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTT
        K EN KKG+L+LKDYFLK+K + DSLA A KKLS  D IMH+LAGLG +FD  +SVI+A     TLQEV SLLL QE RNERN   INSD S+ S+N+T 
Subjt:  KPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNNAQINSDASVSSINVTT

Query:  QDHQKRGNFSNS
         D  K+ N   S
Subjt:  QDHQKRGNFSNS

A0A6J1DSS1 uncharacterized protein LOC1110235861.3e-4735.76Show/hide
Query:  VEDVMMNKHALRIEPSLKLHELNPTLQGHDLDKFIGPEAQIPPEFIRS-EG-ESSSTAIINKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVW
        +E++M+ +  +  +   +  ++   +QGH L+++I  + + P  FI++ +G  SS+T   N E+ +W ++ KLI+ WLLGSM+EEILSQML+C    E+W
Subjt:  VEDVMMNKHALRIEPSLKLHELNPTLQGHDLDKFIGPEAQIPPEFIRS-EG-ESSSTAIINKEFLNWKRRGKLITSWLLGSMTEEILSQMLECETANEVW

Query:  TILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARN
        T+L   F+SRNLARVM+LKSK EN+KKGS+NLK+YFLK+K + DSLA A K+L  +D IMH+LA LG +FD  VSVIS  K   ++QE  S        +
Subjt:  TILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARN

Query:  ERNNAQINSDASVSSINVTTQDHQKRGNFSNSAE------TRTNWNNNEAEEIIDPITIGIEVEFGVLTLES--------------NVSYILHTRASLLH
             Q+ S    SS +   Q +   G F  S           ++N +         T  +  +FG  +L S              N+S I H  ++LL 
Subjt:  ERNNAQINSDASVSSINVTTQDHQKRGNFSNSAE------TRTNWNNNEAEEIIDPITIGIEVEFGVLTLES--------------NVSYILHTRASLLH

Query:  STSPQLSSN-TFLLKNLLH--------------------------------DIQSGQILLMGKVNDGMYEFSLTKTSS-APVSAHISYCNKVGIALSAFT
        S S   SS   F L+NLLH                                D+ +GQ+L  G V+D +Y+F L K SS  P S   +  N   I  S   
Subjt:  STSPQLSSN-TFLLKNLLH--------------------------------DIQSGQILLMGKVNDGMYEFSLTKTSS-APVSAHISYCNKVGIALSAFT

Query:  SQNSHKSTIHSLNDSCNFSIPIASVSSVLDIWHWRLGH-LTLNTVQNVPDSCN
         Q S+   +H+         PI   +SVLDIWH R GH   L  VQ V  SCN
Subjt:  SQNSHKSTIHSLNDSCNFSIPIASVSSVLDIWHWRLGH-LTLNTVQNVPDSCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.3e-0425Show/hide
Query:  WKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTI
        W++   ++  WL+ SMT+++L  ++  ETA+++W  L  +F      ++ +L+ +   L++G  ++++YF K+  +
Subjt:  WKRRGKLITSWLLGSMTEEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.2e-1125.87Show/hide
Query:  NKEFLNWKRRGKLITSWLLGSMT-EEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDI
        N   +NW++R  ++   L G++T ++     +   T+ ++W  + N F +   AR + L S+      G + + DY+ K+K +ADSL      ++  + +
Subjt:  NKEFLNWKRRGKLITSWLLGSMT-EEILSQMLECETANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDI

Query:  MHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNER
        M++L GL   FD  ++VI   +  P+  +  ++L  +E R +R
Subjt:  MHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGATAATAAACCACAAAGGAAGGTTGTGTTAAATGAGATTTCCGATGAAACTACAAATACATCAACAATAGTTGTTGATAAAGCTAGCACTTCAACAAGAGTTGT
TGATGGCGATAGTACATCACGTCAGTCACATCCATTTCAAGAGTTGAGAGTGCCTCGACATAGTGGTCCATTAGGTCTCCTTGCTAGGCAATCTGTCAATTTGTCTTGCG
GTTCAATTGGGGTTAGTGGAATCTGCCCAAATCCAATGAGGACTGAGTCTCCAAAATTTAGCAAAGTGGAACGATTACCACCTTTGAATGCAATGTGTTTGGGCAAAATA
AAGAATGATTCCCAATCCATAGCAATATGGAGGTCACCAAGGTTACGGAGTGGAGTTGAAGATGTCATGATGAATAAGCATGCATTACGAATTGAACCATCCCTCAAACT
ACATGAGTTGAACCCAACGCTGCAAGGTCATGACTTAGACAAATTTATCGGTCCAGAGGCACAAATTCCACCAGAATTCATCAGATCTGAAGGTGAATCATCTTCCACTG
CAATTATAAATAAAGAATTTCTTAATTGGAAGAGACGAGGCAAATTAATCACTTCATGGCTCCTTGGGTCCATGACTGAAGAAATTTTATCACAGATGCTCGAGTGTGAA
ACAGCCAACGAAGTCTGGACAATTCTGAATAATTTGTTTTCTTCACGTAATTTAGCTAGAGTTATGGAATTGAAATCAAAACCAGAGAATTTAAAGAAAGGAAGTCTCAA
TCTTAAGGATTATTTCCTAAAGGTAAAAACTATTGCAGATTCGTTGGCTGCCGCAAGTAAGAAACTCTCAAAGAATGATGATATTATGCATCTTCTTGCTGGTCTTGGAA
TTGACTTTGATGTTACGGTTTCTGTAATTTCGGCTGGAAAAGAAATTCCAACACTCCAAGAGGTTTATTCACTTCTCTTAGCTCAAGAAGCACGAAATGAGAGGAATAAT
GCACAAATTAATTCTGATGCATCTGTATCTTCTATTAATGTTACCACCCAAGACCATCAGAAGAGAGGAAATTTTTCGAATTCTGCAGAAACTAGAACCAACTGGAACAA
TAACGAGGCAGAGGAAATAATCGATCCAATAACAATTGGAATCGAGGTCGAATTTGGAGTATTAACTCTAGAATCCAATGTCAGTTATATTCTTCATACTCGTGCTTCTC
TACTTCATTCTACTTCTCCTCAATTGTCTTCCAATACTTTTCTCCTCAAGAATCTTCTTCATGATATCCAATCTGGACAAATACTTCTCATGGGTAAAGTCAATGATGGG
ATGTACGAATTCTCCTTGACAAAGACCTCTTCCGCCCCTGTCTCTGCTCATATTTCTTATTGTAATAAAGTTGGTATTGCTTTATCGGCTTTTACTTCTCAGAATTCTCA
TAAATCTACTATTCATTCATTAAATGACAGTTGTAATTTTTCTATTCCTATTGCTTCTGTTTCTTCTGTATTAGACATATGGCATTGGCGTCTTGGCCACCTTACTCTTA
ATACCGTGCAAAATGTTCCTGATTCATGTAATATTTCCTATTCTCGAAATAAAATACCATTGGAACCTGTGCTCTTGAGCCAGTATGTAGTTCTTCTCCATCGCATTCTT
TACCTTCGTGTTCTGCCTCTTTACCTTCCAGTTCTACGTCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGATAATAAACCACAAAGGAAGGTTGTGTTAAATGAGATTTCCGATGAAACTACAAATACATCAACAATAGTTGTTGATAAAGCTAGCACTTCAACAAGAGTTGT
TGATGGCGATAGTACATCACGTCAGTCACATCCATTTCAAGAGTTGAGAGTGCCTCGACATAGTGGTCCATTAGGTCTCCTTGCTAGGCAATCTGTCAATTTGTCTTGCG
GTTCAATTGGGGTTAGTGGAATCTGCCCAAATCCAATGAGGACTGAGTCTCCAAAATTTAGCAAAGTGGAACGATTACCACCTTTGAATGCAATGTGTTTGGGCAAAATA
AAGAATGATTCCCAATCCATAGCAATATGGAGGTCACCAAGGTTACGGAGTGGAGTTGAAGATGTCATGATGAATAAGCATGCATTACGAATTGAACCATCCCTCAAACT
ACATGAGTTGAACCCAACGCTGCAAGGTCATGACTTAGACAAATTTATCGGTCCAGAGGCACAAATTCCACCAGAATTCATCAGATCTGAAGGTGAATCATCTTCCACTG
CAATTATAAATAAAGAATTTCTTAATTGGAAGAGACGAGGCAAATTAATCACTTCATGGCTCCTTGGGTCCATGACTGAAGAAATTTTATCACAGATGCTCGAGTGTGAA
ACAGCCAACGAAGTCTGGACAATTCTGAATAATTTGTTTTCTTCACGTAATTTAGCTAGAGTTATGGAATTGAAATCAAAACCAGAGAATTTAAAGAAAGGAAGTCTCAA
TCTTAAGGATTATTTCCTAAAGGTAAAAACTATTGCAGATTCGTTGGCTGCCGCAAGTAAGAAACTCTCAAAGAATGATGATATTATGCATCTTCTTGCTGGTCTTGGAA
TTGACTTTGATGTTACGGTTTCTGTAATTTCGGCTGGAAAAGAAATTCCAACACTCCAAGAGGTTTATTCACTTCTCTTAGCTCAAGAAGCACGAAATGAGAGGAATAAT
GCACAAATTAATTCTGATGCATCTGTATCTTCTATTAATGTTACCACCCAAGACCATCAGAAGAGAGGAAATTTTTCGAATTCTGCAGAAACTAGAACCAACTGGAACAA
TAACGAGGCAGAGGAAATAATCGATCCAATAACAATTGGAATCGAGGTCGAATTTGGAGTATTAACTCTAGAATCCAATGTCAGTTATATTCTTCATACTCGTGCTTCTC
TACTTCATTCTACTTCTCCTCAATTGTCTTCCAATACTTTTCTCCTCAAGAATCTTCTTCATGATATCCAATCTGGACAAATACTTCTCATGGGTAAAGTCAATGATGGG
ATGTACGAATTCTCCTTGACAAAGACCTCTTCCGCCCCTGTCTCTGCTCATATTTCTTATTGTAATAAAGTTGGTATTGCTTTATCGGCTTTTACTTCTCAGAATTCTCA
TAAATCTACTATTCATTCATTAAATGACAGTTGTAATTTTTCTATTCCTATTGCTTCTGTTTCTTCTGTATTAGACATATGGCATTGGCGTCTTGGCCACCTTACTCTTA
ATACCGTGCAAAATGTTCCTGATTCATGTAATATTTCCTATTCTCGAAATAAAATACCATTGGAACCTGTGCTCTTGAGCCAGTATGTAGTTCTTCTCCATCGCATTCTT
TACCTTCGTGTTCTGCCTCTTTACCTTCCAGTTCTACGTCACTAG
Protein sequenceShow/hide protein sequence
MRDNKPQRKVVLNEISDETTNTSTIVVDKASTSTRVVDGDSTSRQSHPFQELRVPRHSGPLGLLARQSVNLSCGSIGVSGICPNPMRTESPKFSKVERLPPLNAMCLGKI
KNDSQSIAIWRSPRLRSGVEDVMMNKHALRIEPSLKLHELNPTLQGHDLDKFIGPEAQIPPEFIRSEGESSSTAIINKEFLNWKRRGKLITSWLLGSMTEEILSQMLECE
TANEVWTILNNLFSSRNLARVMELKSKPENLKKGSLNLKDYFLKVKTIADSLAAASKKLSKNDDIMHLLAGLGIDFDVTVSVISAGKEIPTLQEVYSLLLAQEARNERNN
AQINSDASVSSINVTTQDHQKRGNFSNSAETRTNWNNNEAEEIIDPITIGIEVEFGVLTLESNVSYILHTRASLLHSTSPQLSSNTFLLKNLLHDIQSGQILLMGKVNDG
MYEFSLTKTSSAPVSAHISYCNKVGIALSAFTSQNSHKSTIHSLNDSCNFSIPIASVSSVLDIWHWRLGHLTLNTVQNVPDSCNISYSRNKIPLEPVLLSQYVVLLHRIL
YLRVLPLYLPVLRH