; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025572 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025572
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetroelement pol polyprotein-like
Genome locationchr10:15468682..15478007
RNA-Seq ExpressionLag0025572
SyntenyLag0025572
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_024949903.1 uncharacterized protein LOC112496715 [Citrus sinensis]3.3e-3642.52Show/hide
Query:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS
        GL+Q+++ +V+ASANG+ L K   EA+ IL+ IA NN  W        +    V   +A   + AQ+ ++  M+  +T       A  N IS   C  C 
Subjt:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS

Query:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYN------LGTSNAGTNQFGKASIGFN--SQPTQQQQKNFQPVQVHNASQESNLEALIKEYMTRNDA
        E H+ DNCP NPTS+ YVG    QN+ NPYS TYN      L  S    NQ   A  G N  +QP    Q+N + + ++N  Q S+LE LIK+Y+ RN+A
Subjt:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYN------LGTSNAGTNQFGKASIGFN--SQPTQQQQKNFQPVQVHNASQESNLEALIKEYMTRNDA

Query:  TV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSG
         V       RNLE QIGQ+A  + NRPQG+LPS TENP REGK+  K + LRSG
Subjt:  TV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSG

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]3.3e-3637.84Show/hide
Query:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS
        GL+ AS+ +++ASANG+ L K   EA  IL+ IA+NN  W        + +  V+E +A   + AQ+ ++ +++ ++ MG     A A   +   C  C 
Subjt:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS

Query:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYNLGTS---NAGTNQFGKASI--GFNSQPTQQQQKNFQPVQVHNASQESNLEALIKEYMTRND----
        + H  +NCP N  S+ YVG Q      NPYS +YN       N      GK S   GF+ QP  QQ    QP      SQ S+LE+L+++YM +ND    
Subjt:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYNLGTS---NAGTNQFGKASI--GFNSQPTQQQQKNFQPVQVHNASQESNLEALIKEYMTRND----

Query:  ---ATVRNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYPINQEVRKIPEEVSEKSTKKRSGVLSPTRCKKVPSAS
           A++RNLEVQ+GQ+A ++KNRPQGTLPS TENP R+GK+  KAVTLRSG              + I   V+   +K+ S +      KK P+ S
Subjt:  ---ATVRNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYPINQEVRKIPEEVSEKSTKKRSGVLSPTRCKKVPSAS

XP_030498047.1 uncharacterized protein LOC115713707 [Cannabis sativa]1.0e-3739.85Show/hide
Query:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS
        GL+ AS+ +++ASANG+   K   EA  I++ IA+NN  W        + +  V+E +A   + AQ+ ++ +++ ++ MG     A A   +   C  C 
Subjt:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS

Query:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYN-------------LGTSNAGTNQFGKASI--GFNSQPTQQQQKNFQPVQVHNASQESNLEALIKE
        + H  +NCP NP S+ YVG Q      NPYS +YN              G S++G    GK S   GF+ QP  QQ  + QP      SQ S+LE+L+++
Subjt:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYN-------------LGTSNAGTNQFGKASI--GFNSQPTQQQQKNFQPVQVHNASQESNLEALIKE

Query:  YMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSG
        YM +NDA +       RNLEVQ+GQ+A  +KNRPQGTLPS TENP R+GK+  KA+TLRSG
Subjt:  YMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSG

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]1.1e-3934.85Show/hide
Query:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS
        GL+ AS+ +++ASANG+ L K   EA  IL+ IA+NN  W        + +  V+E +A   + AQ+ ++ +++ ++ MG     A A   +   C  C 
Subjt:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS

Query:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYN-------------LGTSNAGTNQF-GKASI--GFNSQPTQQQQKNFQPVQVHNASQESNLEALIK
        + H  +NCP NP S+ YVG Q      NPYS +YN              G S++G  Q  GK S   GF+ QP  QQ    QP      SQ S+LE+L++
Subjt:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYN-------------LGTSNAGTNQF-GKASI--GFNSQPTQQQQKNFQPVQVHNASQESNLEALIK

Query:  EYMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYPINQEVRKIPEEVSEKSTKKRSGVLSPTRC
        +YM +NDA +       RNLEVQ+GQ+A ++KNRPQGTLPS TENP R+GK+  KAVTLRSG              + I   V+   +K+ S +      
Subjt:  EYMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYPINQEVRKIPEEVSEKSTKKRSGVLSPTRC

Query:  KKVPSASLG------AHLNRGYLIKSIF--TVDTQDSPTVDVFNDWLTIPLLCVHEFLHTNCSSRIVEAIQEL
        KK P+ S           NR +  + +F     T  S   +    W     L V + LH N    +VEA++++
Subjt:  KKVPSASLG------AHLNRGYLIKSIF--TVDTQDSPTVDVFNDWLTIPLLCVHEFLHTNCSSRIVEAIQEL

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]3.0e-3739.85Show/hide
Query:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS
        GL+ AS+ +++ASANG+ L K   EA  IL+ IA+NN  W        + +  V+E +A   + AQ+ ++ +++ ++ MG     A A   +   C  C 
Subjt:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICS

Query:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYN-------------LGTSNAGTNQFGKASI--GFNSQPTQQQQKNFQPVQVHNASQESNLEALIKE
        + H  +NCP NP S+ YVG Q      NPYS +YN              G S++G    GK S   GF+ QP Q Q            SQ S+LE+L+++
Subjt:  EEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYN-------------LGTSNAGTNQFGKASI--GFNSQPTQQQQKNFQPVQVHNASQESNLEALIKE

Query:  YMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSG
        YM +NDA +       RNLEVQ+GQ+A ++KNRPQGTLPS TENP R+ K+  KAVTLRSG
Subjt:  YMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSG

TrEMBL top hitse value%identityAlignment
A0A5B6VWJ0 Retroelement pol polyprotein-like4.7e-2833.85Show/hide
Query:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIA--PANAISSPCCDI
        GL   ++ +V+ASANG+ L K   EA+ I++ IA+NN  W        + +  + E +A  ++ +Q+ +I SM  +LT     + A  P N   +     
Subjt:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIA--PANAISSPCCDI

Query:  CSEEHITDNCPLNPTSIFYVGPQGNQNRW------NPYSATYNLGTSNAGTNQFGKASIGFNSQPTQQQQKNFQPVQVH---NASQESNLEALIKEYMTR
        C E H+ + CP NP S++Y+G Q NQNR       N Y++++      + +NQ G  +    +QP   Q  NF P QV     A   ++LE+L+K YM +
Subjt:  CSEEHITDNCPLNPTSIFYVGPQGNQNRW------NPYSATYNLGTSNAGTNQFGKASIGFNSQPTQQQQKNFQPVQVH---NASQESNLEALIKEYMTR

Query:  ND-------ATVRNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRS---------------GLEYDGPEYPINQEVRKIPEEVSEKSTKK
        ND       AT++NLE Q+GQ+A E++NR QG LPS TENP   GK+  KA+TLRS                   D  E   + E    PE  S K  K 
Subjt:  ND-------ATVRNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRS---------------GLEYDGPEYPINQEVRKIPEEVSEKSTKK

Query:  RSGVLSPTRCKKVPSASLGAHL
         SG ++  +    P+ SL   L
Subjt:  RSGVLSPTRCKKVPSASLGAHL

A0A6J1DAE9 uncharacterized protein LOC1110185141.2e-2833.56Show/hide
Query:  LDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHW--GEIEPTILK-NLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQA---------NIAPAN
        LD  +K ++N +ANG+F KK   E   IL+ +A++N  W     +P   K +   V+  +   +MQ ++  ++ M+ +L +  ++           +P  
Subjt:  LDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHW--GEIEPTILK-NLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQA---------NIAPAN

Query:  AISSPCCDICSEEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYNLGT------SNAGTNQFGKASIGFNSQ------------------PTQQQQKNF
         I+   C  CS+ H+ +NCP NP S +YVG  G    +NPYS TYN G       S  G      A+ G N Q                  P QQ  +N 
Subjt:  AISSPCCDICSEEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYNLGT------SNAGTNQFGKASIGFNSQ------------------PTQQQQKNF

Query:  QPVQVHNASQESNLEALIKEYMTRND--------------ATVRNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYPI
        +       +  ++LE + KEYM RND              A +RNLEVQ+GQ A ++K RPQG+ P  TE   R+G +Q KAVTLRSGL Y+GP+ P+
Subjt:  QPVQVHNASQESNLEALIKEYMTRND--------------ATVRNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYPI

A0A6J1DY39 uncharacterized protein LOC1110256535.0e-3032.37Show/hide
Query:  NEQLNAQWSGAWFLQDAQVKVEGSVGISVLIEC-----GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHW-GEIEPTILKNL--VKVVETEA
        NE +N  W      +D  +    ++GI   ++      G D  +K ++N +ANG F  K   E   ILD ++ +N  W  E   T  K      V+  + 
Subjt:  NEQLNAQWSGAWFLQDAQVKVEGSVGISVLIEC-----GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHW-GEIEPTILKNL--VKVVETEA

Query:  NLTMQAQIKAIHSMMMDLTMGN--------QANIAPANAISSPCCDICSEEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYNLG--------------
          +MQ QI  I  M+ ++   N          N +P   I+   C  C + H ++NCP NP+S++YVG Q NQ ++NPYS TYN G              
Subjt:  NLTMQAQIKAIHSMMMDLTMGN--------QANIAPANAISSPCCDICSEEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYNLG--------------

Query:  TSNAGTNQFGKASI---GFNSQPT-------QQQQKNF-QPVQVHNASQESNLEALI------------------------KEYMTRNDATVRNLEVQIG
        ++  G NQ  K +    GF + P          QQKN+ QP Q + ++ E  ++ LI                        K+YM RND TVR LE+Q+G
Subjt:  TSNAGTNQFGKASI---GFNSQPT-------QQQQKNF-QPVQVHNASQESNLEALI------------------------KEYMTRNDATVRNLEVQIG

Query:  QIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYP
        Q+  E++ RPQG+LPS TE P R GK+   ++  RSGL+Y+GP  P
Subjt:  QIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYP

A0A6J1DYG0 uncharacterized protein LOC1110257642.7e-3136.59Show/hide
Query:  LVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTIL---KNLVKVVETEANLTMQAQIKAIHSMMMDLTMG--------------NQANIAPANAI
        ++N +ANG+F KK   E   IL+ +A++N  W           ++   V+  +   +MQ +   ++  + ++ +G              +    AP   +
Subjt:  LVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTIL---KNLVKVVETEANLTMQAQIKAIHSMMMDLTMG--------------NQANIAPANAI

Query:  SSPCCDICSEEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYNLG---------TSNAGTNQFGKASIGFNSQ----PTQQ----------QQKNFQPV
        +   C  CSE HI D CP NP S+FYVG  GN   +NPYS TYN G             G+  F +     N Q    PTQQ          Q+    PV
Subjt:  SSPCCDICSEEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYNLG---------TSNAGTNQFGKASIGFNSQ----PTQQ----------QQKNFQPV

Query:  QVHNASQESNLEALIKEYMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYP
        Q +N    SNLE ++KEYM R DA +       RN E Q+GQ+A E+KNRPQG+ P  TE P REGK+Q KAVTLRSGL YD P  P
Subjt:  QVHNASQESNLEALIKEYMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYP

A0A6J1G7Q6 uncharacterized protein LOC1114515984.3e-2938.7Show/hide
Query:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANI-APANA------ISS
        GL+ A+K +V+ASANG  L K   EA+ IL+ IA+NN  W ++     K   +V+E +A  ++ AQ+ ++ +++ +L  G  + I APA+        ++
Subjt:  GLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPTILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANI-APANA------ISS

Query:  PCCDICSEEHITDNCPLNPTSIFYVGPQGNQN--RWNPYSATYNLGTSN------AGTNQFGK---------ASIGFNSQPT-QQQQKNFQPVQVHNASQ
          C  C E+H  D CP NP SIFYVG Q +Q   + NP S TYN G  N       G   + +            G  +Q T   QQ   Q      A  
Subjt:  PCCDICSEEHITDNCPLNPTSIFYVGPQGNQN--RWNPYSATYNLGTSN------AGTNQFGK---------ASIGFNSQPT-QQQQKNFQPVQVHNASQ

Query:  ESN--LEALIKEYMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREG
         S   LE+LIKEYM RNDA +       RNLEVQ+GQ+A E++NRP G LP+ TE P REG
Subjt:  ESN--LEALIKEYMTRNDATV-------RNLEVQIGQIAQEIKNRPQGTLPSKTENPHREG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTACAATAAACAAAAATTTGGAGAAATCCAAATTGAGGATTTGGGAATAGGTGGATTGGAGTATGAGCATAAAGATGTAGGTGAGATTTCTAGTTTTAAGAGGAG
TTTTGAATCCTTAGAGCCAATAGATAAGAAATTCAAGCCTATTGAACCTTATAATTCATTGACATTGCTCCAGCAACCTGAGATTAGGAAATCCTTCATTGATGAGCGGT
TATTTACTATTTTGGGAGCACAGCCGGAGCCCACTGTTGATCAACTAGAACGGGAGTTCTACGCCAACATCGATGAAAATGAAGGATTCTTGGTTATTGTTCGTAGAGTT
GTTGTCGACTGGAGCCTTGGAGTGATTAATTCCTTATTTAATTTGCAAGACTTCCCCCATGCTGTTTTCAACAAACTATTAGTTGCTTCCTCAAACGAGCAACTGAATGC
CCAGTGGAGCGGTGCTTGGTTTTTGCAGGATGCTCAGGTAAAGGTTGAAGGTAGTGTTGGAATATCTGTTTTGATTGAGTGTGGATTGGATCAAGCTTCAAAAGCTCTAG
TCAATGCGTCTGCGAATGGATCTTTCTTGAAGAAGCCCGCGAAAGAAGCACATGCCATATTAGACACAATAGCCACGAATAACCGACATTGGGGGGAGATTGAACCTACA
ATTCTGAAGAATCTAGTGAAAGTTGTAGAGACAGAAGCGAATTTGACTATGCAAGCTCAGATTAAAGCCATTCACAGCATGATGATGGACTTGACCATGGGCAACCAAGC
AAACATCGCTCCTGCGAATGCTATCTCTTCTCCTTGTTGTGATATTTGTAGTGAAGAACATATCACTGATAATTGTCCATTAAACCCTACATCTATTTTTTATGTAGGAC
CACAGGGAAATCAAAACCGATGGAATCCGTATTCAGCCACGTACAACCTAGGTACCAGTAATGCCGGTACAAACCAATTCGGAAAAGCATCAATAGGATTTAATTCCCAA
CCTACACAACAGCAGCAGAAAAATTTCCAACCAGTCCAAGTTCATAATGCAAGCCAAGAGTCAAATCTAGAGGCTCTAATCAAAGAATACATGACAAGGAATGATGCCAC
TGTAAGGAACTTGGAAGTGCAAATTGGACAAATAGCTCAAGAGATCAAGAATAGACCACAAGGGACATTGCCTAGCAAAACTGAAAATCCTCACCGTGAAGGCAAGAAGC
AGTACAAGGCAGTTACCTTAAGAAGTGGATTGGAGTATGATGGACCAGAATACCCCATAAATCAAGAAGTTAGGAAAATCCCGGAAGAAGTTTCAGAGAAATCTACAAAA
AAGAGGTCCGGGGTTTTATCCCCTACAAGATGTAAAAAAGTCCCCAGTGCATCCTTGGGTGCACATCTCAACAGAGGATACCTCATAAAGTCTATCTTTACAGTCGACAC
TCAAGACTCTCCAACGGTTGATGTATTCAATGACTGGCTCACCATTCCGTTGCTTTGCGTTCATGAGTTTCTTCATACTAACTGTTCGTCTCGTATTGTAGAAGCAATTC
AGGAACTCTCTCTAAACTATTCCCAACTGTCAATTGTCTCAGACTCTAGGTACATGTACCAATCAAACTCATTTCCTTTCAGTGTTCGGATAAATTTTTTGGCAAGCAAG
TCTCCCCGAGTTCCAGCGTTCTCGAAAGTTTCAACGAAGTGGGCGATGCTTTGGATTGCCCTTTCCAACAAACTGTTGGAACTTGGGAGGTTGATACCCAGTGGCCATTC
TCAAATTGTCAATCCTCTTTGTATAAGGTCCCAGTCAACCGAACCGCAAATTAGGGTTTATGCTTTGACATGGGTCGTTGTTAGGTTCCTAGACGTCGCGACGCTGTCAC
AGCGTCTCGACGCTAAGGAGATAGCGTCTCGACGCTGTCTCTGTTGCCGCCCAATTCAGAAAAGGAAAGCAGCGTCGAGACGCTCCAAGAGCAGCGTCCCGACGCTGGCC
TACGGATGGCTCTCTTTTGACTTCTTCTTGGTTGAATCTTTTCCGTTCTTTAGCCAATTTTCATGTCCCGAACACGTATTTAGCTCCAATTTCTCGATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTACAATAAACAAAAATTTGGAGAAATCCAAATTGAGGATTTGGGAATAGGTGGATTGGAGTATGAGCATAAAGATGTAGGTGAGATTTCTAGTTTTAAGAGGAG
TTTTGAATCCTTAGAGCCAATAGATAAGAAATTCAAGCCTATTGAACCTTATAATTCATTGACATTGCTCCAGCAACCTGAGATTAGGAAATCCTTCATTGATGAGCGGT
TATTTACTATTTTGGGAGCACAGCCGGAGCCCACTGTTGATCAACTAGAACGGGAGTTCTACGCCAACATCGATGAAAATGAAGGATTCTTGGTTATTGTTCGTAGAGTT
GTTGTCGACTGGAGCCTTGGAGTGATTAATTCCTTATTTAATTTGCAAGACTTCCCCCATGCTGTTTTCAACAAACTATTAGTTGCTTCCTCAAACGAGCAACTGAATGC
CCAGTGGAGCGGTGCTTGGTTTTTGCAGGATGCTCAGGTAAAGGTTGAAGGTAGTGTTGGAATATCTGTTTTGATTGAGTGTGGATTGGATCAAGCTTCAAAAGCTCTAG
TCAATGCGTCTGCGAATGGATCTTTCTTGAAGAAGCCCGCGAAAGAAGCACATGCCATATTAGACACAATAGCCACGAATAACCGACATTGGGGGGAGATTGAACCTACA
ATTCTGAAGAATCTAGTGAAAGTTGTAGAGACAGAAGCGAATTTGACTATGCAAGCTCAGATTAAAGCCATTCACAGCATGATGATGGACTTGACCATGGGCAACCAAGC
AAACATCGCTCCTGCGAATGCTATCTCTTCTCCTTGTTGTGATATTTGTAGTGAAGAACATATCACTGATAATTGTCCATTAAACCCTACATCTATTTTTTATGTAGGAC
CACAGGGAAATCAAAACCGATGGAATCCGTATTCAGCCACGTACAACCTAGGTACCAGTAATGCCGGTACAAACCAATTCGGAAAAGCATCAATAGGATTTAATTCCCAA
CCTACACAACAGCAGCAGAAAAATTTCCAACCAGTCCAAGTTCATAATGCAAGCCAAGAGTCAAATCTAGAGGCTCTAATCAAAGAATACATGACAAGGAATGATGCCAC
TGTAAGGAACTTGGAAGTGCAAATTGGACAAATAGCTCAAGAGATCAAGAATAGACCACAAGGGACATTGCCTAGCAAAACTGAAAATCCTCACCGTGAAGGCAAGAAGC
AGTACAAGGCAGTTACCTTAAGAAGTGGATTGGAGTATGATGGACCAGAATACCCCATAAATCAAGAAGTTAGGAAAATCCCGGAAGAAGTTTCAGAGAAATCTACAAAA
AAGAGGTCCGGGGTTTTATCCCCTACAAGATGTAAAAAAGTCCCCAGTGCATCCTTGGGTGCACATCTCAACAGAGGATACCTCATAAAGTCTATCTTTACAGTCGACAC
TCAAGACTCTCCAACGGTTGATGTATTCAATGACTGGCTCACCATTCCGTTGCTTTGCGTTCATGAGTTTCTTCATACTAACTGTTCGTCTCGTATTGTAGAAGCAATTC
AGGAACTCTCTCTAAACTATTCCCAACTGTCAATTGTCTCAGACTCTAGGTACATGTACCAATCAAACTCATTTCCTTTCAGTGTTCGGATAAATTTTTTGGCAAGCAAG
TCTCCCCGAGTTCCAGCGTTCTCGAAAGTTTCAACGAAGTGGGCGATGCTTTGGATTGCCCTTTCCAACAAACTGTTGGAACTTGGGAGGTTGATACCCAGTGGCCATTC
TCAAATTGTCAATCCTCTTTGTATAAGGTCCCAGTCAACCGAACCGCAAATTAGGGTTTATGCTTTGACATGGGTCGTTGTTAGGTTCCTAGACGTCGCGACGCTGTCAC
AGCGTCTCGACGCTAAGGAGATAGCGTCTCGACGCTGTCTCTGTTGCCGCCCAATTCAGAAAAGGAAAGCAGCGTCGAGACGCTCCAAGAGCAGCGTCCCGACGCTGGCC
TACGGATGGCTCTCTTTTGACTTCTTCTTGGTTGAATCTTTTCCGTTCTTTAGCCAATTTTCATGTCCCGAACACGTATTTAGCTCCAATTTCTCGATTTAA
Protein sequenceShow/hide protein sequence
MEYNKQKFGEIQIEDLGIGGLEYEHKDVGEISSFKRSFESLEPIDKKFKPIEPYNSLTLLQQPEIRKSFIDERLFTILGAQPEPTVDQLEREFYANIDENEGFLVIVRRV
VVDWSLGVINSLFNLQDFPHAVFNKLLVASSNEQLNAQWSGAWFLQDAQVKVEGSVGISVLIECGLDQASKALVNASANGSFLKKPAKEAHAILDTIATNNRHWGEIEPT
ILKNLVKVVETEANLTMQAQIKAIHSMMMDLTMGNQANIAPANAISSPCCDICSEEHITDNCPLNPTSIFYVGPQGNQNRWNPYSATYNLGTSNAGTNQFGKASIGFNSQ
PTQQQQKNFQPVQVHNASQESNLEALIKEYMTRNDATVRNLEVQIGQIAQEIKNRPQGTLPSKTENPHREGKKQYKAVTLRSGLEYDGPEYPINQEVRKIPEEVSEKSTK
KRSGVLSPTRCKKVPSASLGAHLNRGYLIKSIFTVDTQDSPTVDVFNDWLTIPLLCVHEFLHTNCSSRIVEAIQELSLNYSQLSIVSDSRYMYQSNSFPFSVRINFLASK
SPRVPAFSKVSTKWAMLWIALSNKLLELGRLIPSGHSQIVNPLCIRSQSTEPQIRVYALTWVVVRFLDVATLSQRLDAKEIASRRCLCCRPIQKRKAASRRSKSSVPTLA
YGWLSFDFFLVESFPFFSQFSCPEHVFSSNFSI