; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr2:14006289..14008446
RNA-Seq ExpressionMoc02g18780
SyntenyMoc02g18780
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_024033610.1 uncharacterized protein LOC112095733 [Citrus clementina]8.1e-5640.84Show/hide
Query:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFLIH--------------------------ESWERFKELLKKCPHHRIP
        ++V+D+FK+ G  +E +RL+LF +SLRD A A L SLP +SITTW+DLA+ FL+                           ++WERFKELL++CPHH IP
Subjt:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFLIH--------------------------ESWERFKELLKKCPHHRIP

Query:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTNSWLADIVMKSATESEAGASKATVNAVQS
        CCI +E   NGLN+ T+L++DAS NGALL K Y EA++ILERI+ N +QW   R            VD LT +  A +   +        + ATVN +  
Subjt:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTNSWLADIVMKSATESEAGASKATVNAVQS

Query:  ALCQYCEGEHQFENYPGNP----------------YPPGFSGQNQQSTQKPPETL-SLEDMFTAYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTE
          C YC   H F+N PGNP                 PPGF  QNQ+      + L SLE +   Y+ +N+       A++QS   SLRN+E QI QL   
Subjt:  ALCQYCEGEHQFENYPGNP----------------YPPGFSGQNQQSTQKPPETL-SLEDMFTAYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTE

Query:  LKNRLRGALPSDTEERKRDGKEQCKALTLKNGE
        L NR +G+LPS+TE  +R+GKE CK + L++G+
Subjt:  LKNRLRGALPSDTEERKRDGKEQCKALTLKNGE

XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]3.9e-5840.33Show/hide
Query:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP
        ++V+DSFK++GV +E +R KLF +SLRD A A L +LP +S+T WNDLAE FL                            ++WERFKE+L+KCPHH IP
Subjt:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP

Query:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLT--NSWLADI--VMKSATESEAGASKATVN
         CI +E + NGLN  +++V+DAS NGA+LSK Y EAF+ILERI+ N +QW   R            VD LT   + +A +  ++K+     +    AT+ 
Subjt:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLT--NSWLADI--VMKSATESEAGASKATVN

Query:  AVQSALCQYCEGEHQFENYPGNP---------------------YPPGFSGQNQQSTQKPP---ETLSLEDMFTAYMTKNDVNVQSQVALLQSQVASLRN
          + + C YC   H FEN P NP                     +PPGFS Q +      P   +T SLE +   YM KND       A++QSQ ASL+N
Subjt:  AVQSALCQYCEGEHQFENYPGNP---------------------YPPGFSGQNQQSTQKPP---ETLSLEDMFTAYMTKNDVNVQSQVALLQSQVASLRN

Query:  MEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGETPKTDKARASSSKPTDKLKHPGQ
        +E+Q+ QL  +LKNR +G LPSDTE  +RDGKE CKA+TL++G+  +++ A A+ SK +  ++  G+
Subjt:  MEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGETPKTDKARASSSKPTDKLKHPGQ

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.5e-5739.65Show/hide
Query:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP
        ++V+DSFK++GV +E +RLKLF +SLRD A A L +LP +S+T WNDLAE FL                            ++WERFKELL+KCPHH IP
Subjt:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP

Query:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTNSWLADIVMKSATES-EAGASKATVNAVQ
         CI +E + NGLN  +++V+DAS NGA+LSK Y EAF+ILERI+ N +QW   R            VD LT        M +  ++   G S     A+Q
Subjt:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTNSWLADIVMKSATES-EAGASKATVNAVQ

Query:  SA--LCQYCEGEHQFENYP-----------------GNPY------------------------PPGFSGQNQQSTQKPP---ETLSLEDMFTAYMTKND
         A   C YC   H FEN P                  NPY                        PPGFS Q +      P   +T SLE +   YM KND
Subjt:  SA--LCQYCEGEHQFENYP-----------------GNPY------------------------PPGFSGQNQQSTQKPP---ETLSLEDMFTAYMTKND

Query:  VNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGETPKTDKARASSSKPTDKLKHPGQPTAPATT-TELRPL
                ++QSQ ASLRN+EVQ+ QL  +LKNR +G LPSDTE  +RDGKE CKA+TL++G+  +++ A   S +P+   K       PAT+  E+ P+
Subjt:  VNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGETPKTDKARASSSKPTDKLKHPGQPTAPATT-TELRPL

Query:  V
        V
Subjt:  V

XP_030507648.1 uncharacterized protein LOC115722545 [Cannabis sativa]2.4e-5540.43Show/hide
Query:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP
        ++V+DSFK++GV +E +RLKLF +SLRD A A L +LP +S+T WNDLAE FL                            ++WERFKELL+KCPHH IP
Subjt:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP

Query:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTNSWLADIVMKSATES-EAGASKATVNAVQ
         CI +E + NGLN +T++V+DAS NGA+LSK Y EAF+ILERI+ N +QW   R            VD LT        M +  ++   G S     A+Q
Subjt:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTNSWLADIVMKSATES-EAGASKATVNAVQ

Query:  SA--LCQYCEGEHQFENYP-----------------GNPY-----------------------------------PPGFSGQNQQSTQKPPETLSLEDMF
         A   C YC   H FEN P                  NPY                                   PPGFS Q +    +  +T SLE + 
Subjt:  SA--LCQYCEGEHQFENYP-----------------GNPY-----------------------------------PPGFSGQNQQSTQKPPETLSLEDMF

Query:  TAYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGE
          YM KND       A++QSQ ASLRN+EVQ+ QL  +LKNR +G LPSDTE  +RDGKE CKA+TL++G+
Subjt:  TAYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGE

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]1.1e-5540.27Show/hide
Query:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP
        ++V+DSFK++GV +E +RLKLF +SLRD A A L +LP +S+T WNDLAENFL                            ++WERFKELL+KCPHH IP
Subjt:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP

Query:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTNSWLADIVMKSATES-EAGASKATVNAVQ
         CI +E + NGLN  +++V+DAS NGA+LSK Y EAF+ILERI+ N +QW   R            VD LT        M +  ++   G S     A+Q
Subjt:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTNSWLADIVMKSATES-EAGASKATVNAVQ

Query:  SA--LCQYCEGEHQFENYPGNP---------------------------------------------------YPPGFSGQNQQSTQKPPETLSLEDMFT
         A   C YC   H FEN P NP                                                   +PPGFS Q  Q   +  +T SLE +  
Subjt:  SA--LCQYCEGEHQFENYPGNP---------------------------------------------------YPPGFSGQNQQSTQKPPETLSLEDMFT

Query:  AYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGE
         YM KND       A++QSQ ASLRN+EVQ+ QL  +LKNR +G LPSDTE  +RD KE CKA+TL++G+
Subjt:  AYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGE

TrEMBL top hitse value%identityAlignment
A0A5B6VWJ0 Retroelement pol polyprotein-like7.9e-4134.38Show/hide
Query:  YSLRDIAGARLYSLPVESITTWNDLAENFLI--------------------------HESWERFKELLKKCPHHRIPCCIHIENYCNGLNEVTQLVIDAS
        +SLRD A A L SLP  SI+TW +LAE FL+                          +E+WERFKELL+KCPHH IP CI +E + NGL   T++V+DAS
Subjt:  YSLRDIAGARLYSLPVESITTWNDLAENFLI--------------------------HESWERFKELLKKCPHHRIPCCIHIENYCNGLNEVTQLVIDAS

Query:  ENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTN-----SWLADIVMKSATESEAGASKATVNAVQSALCQYCEGEHQFENYPGN
         NGALLSK Y EA++I+ERI+ N +QW   R            VD +T+     S ++ +     T      +    N  ++    YC   H  E  P N
Subjt:  ENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTN-----SWLADIVMKSATESEAGASKATVNAVQSALCQYCEGEHQFENYPGN

Query:  PYPPGFSGQNQQS----------------------------------TQKPPETL-----------------SLEDMFTAYMTKNDVNVQSQVALLQSQV
        P    + G   Q+                                  TQ  P  L                 SLE +   YM KND       AL+QSQ 
Subjt:  PYPPGFSGQNQQS----------------------------------TQKPPETL-----------------SLEDMFTAYMTKNDVNVQSQVALLQSQV

Query:  ASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGE-----TPKTDKARASSSKPTDKLKHPGQPTAP
        A+L+N+E Q+ QL TEL+NRL+GALPSDTE  +  GKE CKALTL++ +     T + +K +A++    +       P +P
Subjt:  ASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGE-----TPKTDKARASSSKPTDKLKHPGQPTAP

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.6e-3933.33Show/hide
Query:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP
        +++ D+FK  GV  + +RL+LF +SLRD A + L SLP  SITTW DLA+ FL                          ++E+WERFKELL++CPHH IP
Subjt:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIP

Query:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYRVD--LLTNSWLADIVMKSATESEAGASKAT---VNAVQSAL--CQY
          + ++ + NGL    + +IDA+  GAL+SK   +A+++LE ++ N +QW   R        ++  D +    T+  A + K     V+AVQ++L  C+ 
Subjt:  CCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYRVD--LLTNSWLADIVMKSATESEAGASKAT---VNAVQSAL--CQY

Query:  CEGEHQFENYP----------------GNPY-----------------------------PPGFSGQNQQSTQKPPETLSLEDMFTAYMTKNDVNVQSQV
        C   H ++  P                 NPY                             PPGF  Q Q   Q P +   LE++   Y++K D       
Subjt:  CEGEHQFENYP----------------GNPY-----------------------------PPGFSGQNQQSTQKPPETLSLEDMFTAYMTKNDVNVQSQV

Query:  ALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGETPK--TDKARASSSKPTDK
        A++QSQ ASLRN+E Q+ QL   + NR +G+LPSDT+   + GKEQC+A+TL++G+  +    KA  S  +  DK
Subjt:  ALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGETPK--TDKARASSSKPTDK

A0A6J1DWK1 uncharacterized protein LOC1110250531.5e-4440.11Show/hide
Query:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFLIHESWERFKELLKKCPHHRIPCCIHIENYCNGLNEVTQLVIDASENG
        M V +SFK EG+ K  +RLKLF YSLR  A   L SL  E IT+W+DL E FL+ + +   K L ++CP+H IP  I IE Y  GL+  T+LVIDAS NG
Subjt:  MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFLIHESWERFKELLKKCPHHRIPCCIHIENYCNGLNEVTQLVIDASENG

Query:  ALLSKPYAEAFDILERISRNKHQWLKYRVDLLTNSWLADIVMKSATESEAGASKATVNAVQSALCQYCEGEHQFEN-------------YPGN-------
        ALL KPYA+A +ILERIS + H W  +R           I  KS+ E     S  T+N+    L         + N             + GN       
Subjt:  ALLSKPYAEAFDILERISRNKHQWLKYRVDLLTNSWLADIVMKSATESEAGASKATVNAVQSALCQYCEGEHQFEN-------------YPGN-------

Query:  -----------PYPPGFS--GQNQQSTQKPPETLSLEDMFTAYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKE
                    YPPGF+  GQ  +  Q      SLE++   YM  ND       A +QSQ ASLRN+E+Q+ QL  +LK+R  GALPSDTE  KRD KE
Subjt:  -----------PYPPGFS--GQNQQSTQKPPETLSLEDMFTAYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKE

Query:  QCKALTLKNGETPKTDKARASSSKPTDKLKHPGQPTAPATTTELRPLVDVHQGE
        QC ALTL++G                 K   P  P AP  T E   +V   QGE
Subjt:  QCKALTLKNGETPKTDKARASSSKPTDKLKHPGQPTAPATTTELRPLVDVHQGE

A0A6J1DXK5 uncharacterized protein LOC1110255007.9e-4141.44Show/hide
Query:  IHESWERFKELLKKCPHHRIPCCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQW----------------------LKYRVDL
        + ESWERFK L++K  +  IP CI I+ Y NGL++ T+LVIDAS NGALL+KPYAEAF+ILERIS N   W                      L  +++ 
Subjt:  IHESWERFKELLKKCPHHRIPCCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQW----------------------LKYRVDL

Query:  LTNSWLADIVMKSATESE---AGASKATVNAVQSALCQYCEGEHQFENYPGNPYPPGFSGQNQQSTQKPPETLSLEDMFTAYMTKNDVNVQSQVAL----
        LT     D+VM+S T      A A KA V+ +Q   C +C GE+++ N PGNP    + G N Q+ +  P ++         +T   V ++ +  L    
Subjt:  LTNSWLADIVMKSATESE---AGASKATVNAVQSALCQYCEGEHQFENYPGNPYPPGFSGQNQQSTQKPPETLSLEDMFTAYMTKNDVNVQSQVAL----

Query:  -----LQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGETPKT
             +QSQ  SLRN+E+Q+ QL T+LK++ +G LPSD +  KRDGKEQC ALTL++G+T  T
Subjt:  -----LQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGETPKT

A0A6J1G7Q6 uncharacterized protein LOC1114515981.0e-4035.09Show/hide
Query:  VADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIPCC
        V+DSF+ +GV K+ +RL  F YSLRD A + L  L +  I +WN LAE FL                          + E+WERFKE L+KCPHH +P C
Subjt:  VADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFL--------------------------IHESWERFKELLKKCPHHRIPCC

Query:  IHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTN---------SWLADIVMKSATESEAGASKA
        I IE + NGLN  T+ V+DAS NG +LSK Y EA++ILERI+ N  QW+  R            VD L++         + L ++     +  +A A  A
Subjt:  IHIENYCNGLNEVTQLVIDASENGALLSKPYAEAFDILERISRNKHQWLKYR------------VDLLTN---------SWLADIVMKSATESEAGASKA

Query:  TVNAVQSAL--CQYCEGEHQFENYPGNP---------------------------------------------------YPPGFSGQN------QQSTQK
        TV  +Q+A   C YC  +H F+  P NP                                                   YPPGF  QN      QQ+T +
Subjt:  TVNAVQSAL--CQYCEGEHQFENYPGNP---------------------------------------------------YPPGFSGQN------QQSTQK

Query:  PPETLS--------LEDMFTAYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKE
           T          LE +   YM +ND       A++QSQ  SLRN+EVQ+ QL  EL+NR  G LP+DTE  KR+G E
Subjt:  PPETLS--------LEDMFTAYMTKNDVNVQSQVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTGGCAGATTCATTTAAAGTAGAAGGAGTCGGTAAGGAGACTATGCGCCTGAAGCTATTTCTTTACTCTTTGAGGGATATTGCTGGAGCACGGTTGTATTCCCT
ACCAGTTGAGTCGATCACCACATGGAATGATTTAGCAGAAAATTTTTTGATACACGAATCATGGGAAAGATTCAAGGAATTGCTTAAGAAGTGTCCTCACCATCGGATAC
CATGCTGTATACATATAGAAAATTATTGTAATGGACTGAATGAGGTCACTCAGCTGGTGATAGATGCCTCGGAAAATGGGGCTCTACTGTCAAAACCCTATGCGGAAGCT
TTTGACATCCTAGAGAGAATTTCACGTAACAAGCACCAATGGTTGAAATATAGGGTTGATTTATTAACAAATTCATGGTTGGCTGATATAGTCATGAAAAGTGCAACTGA
AAGTGAGGCTGGAGCCTCAAAAGCCACGGTGAATGCTGTGCAAAGTGCTCTCTGTCAATATTGTGAAGGCGAACACCAATTTGAGAATTATCCAGGCAATCCCTATCCAC
CGGGTTTCTCAGGACAGAATCAACAATCCACTCAAAAGCCTCCAGAGACGTTGAGCTTGGAAGACATGTTCACGGCATACATGACGAAGAATGATGTTAATGTACAGAGC
CAGGTCGCATTGCTGCAGAGTCAGGTAGCATCCTTGCGAAATATGGAAGTCCAGATTTACCAATTGGTGACAGAATTGAAAAACAGGCTAAGGGGAGCGCTTCCAAGCGA
CACCGAGGAGCGAAAAAGGGACGGCAAGGAGCAATGCAAGGCTTTAACACTGAAAAATGGAGAGACGCCTAAGACAGACAAAGCTAGAGCATCATCGTCCAAACCAACTG
ACAAGCTGAAGCATCCGGGACAACCCACTGCTCCAGCAACCACAACAGAGTTACGACCCTTGGTGGATGTGCATCAAGGAGAAGTCACTATGAGGGTGCAGGATCAAGAG
ATTAAATTTTCGATATATGACTCCATGAAATATCCCTCGGATGCTGAGGAGTGTGCTTTCTTACGAGTGCTAGATGAAGCTGTGATGGCAATGCTAAGTGTAGAGGTTAT
GCTGGAGCACCAGAACAACGAGTTGAACAGCATGATGGAACAAGCAGATGAAATTTGCCAAGAAATTCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTGGCAGATTCATTTAAAGTAGAAGGAGTCGGTAAGGAGACTATGCGCCTGAAGCTATTTCTTTACTCTTTGAGGGATATTGCTGGAGCACGGTTGTATTCCCT
ACCAGTTGAGTCGATCACCACATGGAATGATTTAGCAGAAAATTTTTTGATACACGAATCATGGGAAAGATTCAAGGAATTGCTTAAGAAGTGTCCTCACCATCGGATAC
CATGCTGTATACATATAGAAAATTATTGTAATGGACTGAATGAGGTCACTCAGCTGGTGATAGATGCCTCGGAAAATGGGGCTCTACTGTCAAAACCCTATGCGGAAGCT
TTTGACATCCTAGAGAGAATTTCACGTAACAAGCACCAATGGTTGAAATATAGGGTTGATTTATTAACAAATTCATGGTTGGCTGATATAGTCATGAAAAGTGCAACTGA
AAGTGAGGCTGGAGCCTCAAAAGCCACGGTGAATGCTGTGCAAAGTGCTCTCTGTCAATATTGTGAAGGCGAACACCAATTTGAGAATTATCCAGGCAATCCCTATCCAC
CGGGTTTCTCAGGACAGAATCAACAATCCACTCAAAAGCCTCCAGAGACGTTGAGCTTGGAAGACATGTTCACGGCATACATGACGAAGAATGATGTTAATGTACAGAGC
CAGGTCGCATTGCTGCAGAGTCAGGTAGCATCCTTGCGAAATATGGAAGTCCAGATTTACCAATTGGTGACAGAATTGAAAAACAGGCTAAGGGGAGCGCTTCCAAGCGA
CACCGAGGAGCGAAAAAGGGACGGCAAGGAGCAATGCAAGGCTTTAACACTGAAAAATGGAGAGACGCCTAAGACAGACAAAGCTAGAGCATCATCGTCCAAACCAACTG
ACAAGCTGAAGCATCCGGGACAACCCACTGCTCCAGCAACCACAACAGAGTTACGACCCTTGGTGGATGTGCATCAAGGAGAAGTCACTATGAGGGTGCAGGATCAAGAG
ATTAAATTTTCGATATATGACTCCATGAAATATCCCTCGGATGCTGAGGAGTGTGCTTTCTTACGAGTGCTAGATGAAGCTGTGATGGCAATGCTAAGTGTAGAGGTTAT
GCTGGAGCACCAGAACAACGAGTTGAACAGCATGATGGAACAAGCAGATGAAATTTGCCAAGAAATTCTCTAA
Protein sequenceShow/hide protein sequence
MQVADSFKVEGVGKETMRLKLFLYSLRDIAGARLYSLPVESITTWNDLAENFLIHESWERFKELLKKCPHHRIPCCIHIENYCNGLNEVTQLVIDASENGALLSKPYAEA
FDILERISRNKHQWLKYRVDLLTNSWLADIVMKSATESEAGASKATVNAVQSALCQYCEGEHQFENYPGNPYPPGFSGQNQQSTQKPPETLSLEDMFTAYMTKNDVNVQS
QVALLQSQVASLRNMEVQIYQLVTELKNRLRGALPSDTEERKRDGKEQCKALTLKNGETPKTDKARASSSKPTDKLKHPGQPTAPATTTELRPLVDVHQGEVTMRVQDQE
IKFSIYDSMKYPSDAEECAFLRVLDEAVMAMLSVEVMLEHQNNELNSMMEQADEICQEIL